Web Scraping 101: A Million Dollar Project Idea

Sdílet
Vložit
  • čas přidán 28. 08. 2024

Komentáře • 361

  • @TechWithTim
    @TechWithTim  Před rokem +41

    Let me know any web scraping project ideas you have!
    Sign up for @BrightData here and get $15 in FREE credit! brdta.com/techwithtim.

    • @AR-rg2en
      @AR-rg2en Před rokem +1

      Thanks for all the videos man, I appreciate your work.

    • @jgalvan09
      @jgalvan09 Před rokem

      you know this is exactly why Twitter limit the amount of content people can look at lol use AI bot to do web scraping for you lol we're going to abuse social media now to the point every platform will have to set limits lmao

    • @nevilleachi6888
      @nevilleachi6888 Před rokem

      i am new to web scraping but i have been researching on how to make an ai webscraper that can automatically open the ispect page of a website and extract the necessary elements given to it by a user. the user only needs to specify what he wants from a website and the scraper does all the work. it can be particularly helpful for newbies

    • @AR-rg2en
      @AR-rg2en Před rokem

      @@nevilleachi6888 If you could make such a thing I'll use it 😂

    • @Pabbzzo
      @Pabbzzo Před rokem

      ​@@AR-rg2en🎉😊😢

  • @nyosdroid868
    @nyosdroid868 Před rokem +374

    I did similar projects in the past (in php long time ago with curl and tidy, now in python with bs4 or selenium). To avoid bot detection, you can use the tor network, detect captcha request, and simply reconnect tor to get a different IP. Of course, you still need to put random sleeps here and there. Also, you need to be prepared to update or rewrite your code any time the website changes.
    Nice video!

    • @sergesmitty137
      @sergesmitty137 Před rokem +44

      "Gathering" data from other people is one of the holy grails of programming.
      OpenAI is the epitome of this aren't they - Gathered data from all of the internet, regurgitated it with "AI magic" and sell it back to consumers as their own.

    • @DerickMasai
      @DerickMasai Před rokem +20

      @@sergesmitty137 You say that in quotes as if it is some gimmick yet multiple multi-billion dollar corporations weren't trying pouring a fuckton of money into similar projects and all failed or produced AI barely worth mentioning. Besides, they aren't selling the data they scrapped, that was just used to train the model, they're selling you a tool that can do practically anything digital you'd hire a minimum wage employee to do, for now.

    • @DerickMasai
      @DerickMasai Před rokem +12

      As for the original comment, thank you so much for the Tor idea! Never even thought using that as the useragent. Downloading it now.

    • @SaKa-L
      @SaKa-L Před rokem +10

      What an idea , using Tor networking for web scaraping . Thumbs up for that 👍

    • @nyosdroid868
      @nyosdroid868 Před rokem +18

      @@DerickMasai The UserAgent string is simply an up-to-date Firefox/Chrome. What I meant was to use tor as a proxy to hide and replace your real IP quickly.
      tags=b.findAll('p',attrs={"class":"mt-4"}) # whatever style their captcha text is using..
      if(len(tags)>0 and 'Your IP made too many requests' in tags[0].text): # we got caught
      print("CAPTCHA, restart tor")
      restart_tor()
      and that function does:
      def restart_tor():
      try:
      print("restarting tor..")
      time.sleep(random.randint(10,15))
      x=subprocess.run("sudo /etc/init.d/tor restart", shell=True, timeout=30)
      except:
      return False
      print("done, waiting")
      time.sleep(random.randint(25,30))
      return (x.returncode==0)
      I hope the indents stay ok. Anyway: just detect if the target site caught you scraping, and instead of solving the captcha, just restart tor. When it reconnects, the new IP you get can redo the last request. (Also I've put my user into the sudoers so the script can do the restart part without my password.) The timing magic numbers were a bit more sophisticated, but it works.

  • @TomGmaan
    @TomGmaan Před 2 měsíci +469

    I just turned 41 and awfully late to investing with barely any portfolio except my 401k, I have a decent amount of cash saved up and with inflation currently soaring AGAIN, I’m getting worried about retirement, my intention is to retire at 65 atleast, so how best do I maximize my savings of over $500k

    • @KeanuReevesf
      @KeanuReevesf Před 2 měsíci +3

      Retirement is now more difficult than it was in the past. it's all about balancing your risk tolerance with your long-term goals. Maybe consider speaking to an advisor to help in diversifying your portfolio to spread out the risk.

    • @NicoleAniston-o
      @NicoleAniston-o Před 2 měsíci +2

      Generally speaking, a good number of people discredit the effectiveness of financial advisor in planning for retirement, For over the past 10years, I’ve had a financial advisor consistently restructure and diversify my portfolio/expenses and I’ve made over $3m in gains… might not be a lot but retirement doesn’t seem so farfetched anymore.

    • @EmpressDelivery
      @EmpressDelivery Před 2 měsíci

      Your advisor must be really good. How I can get in touch? My retirement portfolio's decline is a concern, and I could use some guidance.

    • @NicoleAniston-o
      @NicoleAniston-o Před 2 měsíci +3

      Melissa Elise Robinson has always been on the top of my list. She is regarded as a genius in her area and well knowledgeable about financial markets. I highly recommend you look her up if you want excellent collaboration.

    • @EmpressDelivery
      @EmpressDelivery Před 2 měsíci

      Thanks for sharing. I curiously searched for her full name and her website popped up after scrolling a bit. I looked through her credentials and did my due diligence before contacting her.

  • @twentytwentyeight
    @twentytwentyeight Před rokem +111

    Just an anecdote, but when I first got into a data role I built a competitor price web scraper for my company. At the time it was supposed to be a show and tell for my boss in case python work came up in the future. It worked well enough for the business to utilize for beating the comp to vendors on price. We could keep tabs on them at a quick enough pace to adjust quotes and win. Didn't always work out but being able to say a python project netted three extra contracts is really cool

    • @moeyali123
      @moeyali123 Před rokem +3

      Nice, I’m in a data role

    • @gustavonovakoski4867
      @gustavonovakoski4867 Před 5 měsíci +2

      I am currently developing this exact project with Power BI as dataviz tool. Currently have nearly 60 websites scraped and nearly 3300 urls daily updating.
      Kinda sad because everything I do is local and dont have the infrastructure to put it in the cloud or run from VMs (also I dont really know how to get more deep than this lol)
      The company is a multinacional and they DO NOT have market intelligence, it is bizarre

    • @MuhammadFAH33M
      @MuhammadFAH33M Před 2 měsíci +1

      ​@@gustavonovakoski4867
      Awesome 😎
      Would you like to share your knowledge & scripts. How you are managing live data updates from 60 + websites
      How are you then comparing it
      I would love to see your video tutorials or guides
      Thanks 🙏

    • @The_Consciousness
      @The_Consciousness Před měsícem

      ​@MuhammadFAH33M he won't share is soup with strangers

  • @flin4557
    @flin4557 Před rokem +68

    So this is basically an elaborate commercial

    • @UltraPayrollMaster
      @UltraPayrollMaster Před měsícem +4

      i agree. this video is very misleading. and so is every video with titles about web scraping but then uses a paid 3rd party product.

  • @voitkampff
    @voitkampff Před rokem +49

    Great video! Thank you for talking about web scraping as an IT subset. What most people don't know is that it's extremely hard to build your career around it. I have been running my own web scraping software house for several years now and getting clients is more difficult than if you build apps, create websites, or do virtually anything else. People and companies simply don't have enough awareness what access to real time data can do for their businesses. Creating this awareness is actually the biggest part of my job. Writing code is just a formality afterwards.

    • @samtx
      @samtx Před rokem

      If most of these are e-commerce only

    • @MARTIN-101
      @MARTIN-101 Před 10 měsíci

      what is the name of your company ? are you on linkedin ?

    • @shannoncole6425
      @shannoncole6425 Před 5 měsíci +1

      Wouldn’t it be better to build tools that solves specific problems more than trying to convince people they need tools they don’t know about?
      For example I pay more than I want in my business to access a tool that does a small part of a problem I’m trying to fix. I found another tool that fixes the whole problem and I’m happy to spend $30 a month to solve that problem.

  • @nitindevatraj
    @nitindevatraj Před rokem +72

    Need a whole series with videos like these

  • @krvns
    @krvns Před rokem +505

    Thumbs up if sponsored tools are not your content!

    • @thetruthsayer8347
      @thetruthsayer8347 Před rokem +22

      But man’s gotta eat tho

    • @user-nq2jx6md4h
      @user-nq2jx6md4h Před rokem +55

      @@thetruthsayer8347That’s his 1 million Dollar project (Let viewers use the sponsor) 😂

    • @johnsamuel6080
      @johnsamuel6080 Před rokem +2

      His bills won't sort themselves out

    • @11bftw
      @11bftw Před rokem +4

      He mentioned that companies are fighting against scraping...

    • @adamrhea2339
      @adamrhea2339 Před rokem +10

      He literally used the sponsor's tool in the project. I would use the tool regardless if it was a sponsorship or not so in this case case he makes money at my benefit. I have no problem with this.

  • @LeoTang2307
    @LeoTang2307 Před rokem +14

    I'm trying to build as many projects as I can right now to upgrade my skills and cv. This project idea is really helpful to my resume, I'll try to do it using fastapi and mysql (if possible I'll try to track stock price). Truly appreciated for sharing this project, sir!

  • @shitinsideyou
    @shitinsideyou Před rokem +12

    Bright Data makes it easier to steal data from websites that try to protect themselves... Great job BrightData!

  • @codesandtags
    @codesandtags Před rokem +26

    I totally agree, I've done a couple of web scraping scripts for some companies, and to get some specific data, however there are a lot of challenges. But in general the rate limiting for APIs and changes in DOM, or catchaps are the most common ones.

    • @normallyChallenged
      @normallyChallenged Před rokem +2

      What does DOM stand for?

    • @74G_
      @74G_ Před rokem +3

      ​@@normallyChallengeddocument object model

    • @camilocastrillon2030
      @camilocastrillon2030 Před 10 měsíci

      Since react doesn’t modify the dom directly I’m guessing I could take an advantage and sort that out, right?

    • @Dom-zy1qy
      @Dom-zy1qy Před 7 měsíci

      ​@@camilocastrillon2030react does modify the DOM. The dom is updated with the diff of the react virtual Dom.
      This isn't really the issue with web scraping & Dom changes though. They mean "changes to the DOM" as in the websites html just changes as the developers of the website add new features, change the layout, etc.

    • @Brodragon2225
      @Brodragon2225 Před 6 měsíci

      @@camilocastrillon2030 now its saturated hey can u think of how free saas app can make money

  • @SamuXZamorionVII
    @SamuXZamorionVII Před rokem +8

    He is right, I work for tech consulting company. One of the client projects I work on is web scraping project that collects doctor disciplinary action data using python and they pay a lot for us to do that for them.

    • @anonfourtyfive
      @anonfourtyfive Před rokem +6

      dude, I need money.
      I'm literally scraping day and night.
      is there anyway I can help ?
      I'm hungry for data.

    • @vatsalyavigyaverma5494
      @vatsalyavigyaverma5494 Před rokem

      @@anonfourtyfivehow you store those data 😊

    • @anonfourtyfive
      @anonfourtyfive Před rokem +1

      @@vatsalyavigyaverma5494 there's multiple way of storing data...
      I prefer Tinydb or SQlite, but if you work on a bigger project, I would recommend using a cloud storage like mongodb or firebase.

    • @pawsstory1288
      @pawsstory1288 Před 8 měsíci

      @@anonfourtyfivewhat is your goa scraping day and night?

    • @financestrategistbolt1041
      @financestrategistbolt1041 Před měsícem

      How many languages would you recommend someone learn inorder to work in tech consulting like you, currently learning python.

  • @miyu545
    @miyu545 Před 3 měsíci +3

    A simple Chrome extension build would have made this so much easier for you and it's completely transparent for bot blocking.

    • @pf4934
      @pf4934 Před měsícem

      Could you elaborate on this?
      I've tried an extension easyscraper and that too got my IP blocked.

  • @workin3d
    @workin3d Před 8 měsíci +3

    Good idea. If u r in 2013. That's plenty of services who are provided same possibilities

  • @SpacefootLife
    @SpacefootLife Před rokem +58

    Great tool Tim, i look forward to playing around with this. I still have a lot to learn about the Data industry though, keep up the amazing work.

  • @0x656e
    @0x656e Před rokem +8

    I love web scrapping; web scrapping is really satisfying when you finish your project.

  • @blizzard18
    @blizzard18 Před 4 měsíci +5

    this video is the reason removing the dislike button was a bad idea

  • @Coole986
    @Coole986 Před rokem +6

    20$ / GB is huge and you will hit that really fast lol

  • @yamani3882
    @yamani3882 Před 9 měsíci +5

    Just the cost of Bright Data makes this not applicable. I want to make 10,000 requests a day at least. Anyone found a work around?

  • @3x10.8_ms
    @3x10.8_ms Před rokem +18

    youtube videos these days are becoming ads

  • @isaacbawangisah6096
    @isaacbawangisah6096 Před rokem +23

    Tim deserves more than 1.26 million subscribers. Fantastic job!

  • @enlyxtyrill6127
    @enlyxtyrill6127 Před rokem +12

    Wow! I was literally thinking of doing the same thing, but really had no idea it was that hard or that some many people have already tried the same thing and failed. Before I even started facing the problems, you gave me the solutions for free... So thank you!

  • @EuricoAbel
    @EuricoAbel Před 5 měsíci +1

    Zeus Proxy ensures anonymity and privacy while performing SEO tasks, enhancing security and reliability.

  • @pauljoseph3081
    @pauljoseph3081 Před rokem +4

    I've seen people exploiting online job portal sites like Upwork. They hire freelancers from the Philippines to scrap websites rather than building a program, and of course they *CHEAP AF* like we're talking about $2-3 per 100+ websites.
    Most of them charged $2-3 per hour which is absolutely disgusting.

  • @alainportant6412
    @alainportant6412 Před 9 měsíci +4

    Ok so it's an ad for a paid service.
    Fuck that, if I wanted a paid solution I would have looked for a paid solution already, not a video tutorial.

  • @cuturbs
    @cuturbs Před rokem +4

    I tought I had some good skills on web scraping, then you showed me I'm just a baby still. Loved the idea and the fact you gave it out for free

    • @charrow10
      @charrow10 Před rokem

      not free, it is just an ad

  • @jeremyyd1258
    @jeremyyd1258 Před rokem +4

    More uses:
    - job searches
    - combine with NLP to obtain live information regarding sentiment towards products or services

    • @coder_117
      @coder_117 Před rokem +1

      What's NLP?

    • @TauvicRitter
      @TauvicRitter Před rokem +1

      ​@@coder_117natural language processing. Analyses text

  • @3x10.8_ms
    @3x10.8_ms Před rokem +28

    if it is a million dollar idea then why you don't do this and show us your success...who are you fooling it's just an ad of bright data

    • @bengabp
      @bengabp Před rokem +4

      Calm down bruh

    • @bigpickles
      @bigpickles Před rokem +1

      Clowns like you have said the same thing for years, and still remain employees. If you think one man can take the whole pie, you're very, very mistaken.

    • @themindhelp9584
      @themindhelp9584 Před rokem

      Bahagaaaa😢😢😢
      😂😂😂😂😂oh ms goshhh

    • @TauvicRitter
      @TauvicRitter Před rokem +1

      Bright data is making millions

  • @alimihakeem841
    @alimihakeem841 Před 3 měsíci

    Thanks for this valuable content Tim. I found it helpful

  • @Faisal1504
    @Faisal1504 Před 8 měsíci +1

    Very helpful looking for data will check out bright. Thank you.

  • @salimtlemcani4122
    @salimtlemcani4122 Před 23 dny

    would be awesome to have a step by step tutorial on the app you have developed

  • @rafaelfernandez742
    @rafaelfernandez742 Před rokem +1

    Who is buying scraped data - companies would just build it ? or is the value proposition any community that wants/needs to know if data has changed and building some kind of tool to do that?

  • @philphillips907
    @philphillips907 Před rokem +2

    Tim, your videos look very professional.
    I see the gear/hardware you use. May I ask what software you use to record your screen and include a picture-in-picture of yourself?
    I did not see this info anywhere.
    Thanks in advance!

  • @tylerk2913
    @tylerk2913 Před rokem +7

    Great video ❤ one thing I noticed that might be helpful on your code is comments when you’re walking through what each part does. That way anyone who uses it from your GitHub doesn’t have to reference your video each time to hear what it does. Just a small detail I noticed. Otherwise, fantastic idea and project!

    • @ttpoh4688
      @ttpoh4688 Před rokem

      ChatGPT will do that for you.

  • @susanwojcickiisalonelyrath7311

    This is exactly what I was looking for - thank you!

  • @planktonfun1
    @planktonfun1 Před rokem +1

    I did this back then with food delivery services for example burger it will display the cheapest delivery and price from a pool of available service.

  • @mad-hat
    @mad-hat Před rokem +2

    Thanks for inspiring me to learn programming, you're really cool. This is a very cool and interesting project. ⌨💪🧑‍💻

  • @gurudaki
    @gurudaki Před rokem +4

    I find your content and expertise level brilliant. Are you self taught or You have a BS in IT or attended a bootcamp?

    • @TechWithTim
      @TechWithTim  Před rokem +12

      Self taught, but I did do 5 semester of a CS degree before dropping out (I have videos discussing that)

  • @bengabp
    @bengabp Před rokem +3

    I am planning on working on something like this, but I am from africa and i dont have too much knowledge of these GDPR laws and so on, could you make a video on that please?

  • @simplyeleza
    @simplyeleza Před 11 měsíci +1

    thanks for the video. I have been thinking of making something similar. Even bought a domain for it.

  • @flowbotgirl
    @flowbotgirl Před rokem

    Thank you! This is the replacement for the outwit browser I've been searching for!

  • @awakenwithoutcoffee
    @awakenwithoutcoffee Před 5 měsíci

    appreciated brother. I was actually building something very similar but yours is definitely better build. Do you recommend against doing this in Javascript ?

  • @guidoumfaroukomar6282
    @guidoumfaroukomar6282 Před rokem +7

    Man u gave us so many valuable informations,we appreciate it❤

  • @Ronaldograxa
    @Ronaldograxa Před rokem +2

    Wouldn’t it be illegal to do if the terms and conditions of the website tells you not to scrape it?

  • @Entertainmentexe
    @Entertainmentexe Před rokem +1

    Please keep project ideas video coming 🙏

  • @LalitaPaswan-yq6ll
    @LalitaPaswan-yq6ll Před rokem +1

    Sir i watch your all videos and they are very helpful thankyou to provide as like that informative content.....👌

  • @pedromonteiro8792
    @pedromonteiro8792 Před rokem +4

    Isn't Web Scrapping Illegal? Some time ago I had the idea to do a project just like this one and I ended up giving it up because I thought I was illegal. I read it on the website's terms and conditions.

    • @resresres1
      @resresres1 Před měsícem

      Web scraping is perfectly legal. What you do with the data may or may not be legal though.

  • @screweddevelopment12
    @screweddevelopment12 Před rokem +9

    I did something like this for a company I worked for. Setting up slowly changing dimensions tables in python is p hard when there aren’t primary keys for the data you’ve scraped.

    • @TannerBarcelos
      @TannerBarcelos Před rokem +2

      This is the issue I am in right now. I am scraping tons of data but I need to create the data model but I am lacking primary keys, etc.

    • @LGL1999
      @LGL1999 Před 11 měsíci

      @@TannerBarcelos Hey, did you figure it out?

    • @resresres1
      @resresres1 Před měsícem

      ​@@TannerBarcelosyou are scraping tons of data? Why would you need primary keys? Just collect the data you want, pass it into a pandas dataframe and give it whatever header names (keys) that you want

  • @kamilmarszakowski8694

    That's awesome, keep going bro

  • @dariocannistra3322
    @dariocannistra3322 Před 5 měsíci

    Hey Tim, you always have amazing content. Keep it up! Greetings from Italy!

  • @Autoscraping
    @Autoscraping Před 7 měsíci

    A wonderful video that we've used as a reference for our recent additions. Your sharing is highly appreciated!

  • @subhranshudas8862
    @subhranshudas8862 Před rokem

    This product would be a smash hit among Coupon Grandmas in Texas.

  • @pythoneatssquirrel
    @pythoneatssquirrel Před 11 měsíci +2

    The idea is cool(the frontend part) except the "pay us to run your code and provide you the data data your code got" part, but what if you would want to scrape like 100k distinct items? The frontend will become just a long scrollbar.

    • @pythoneatssquirrel
      @pythoneatssquirrel Před 9 měsíci +1

      I build a "free" scraper to this project that you can find in my videos.

  • @brianhauk8136
    @brianhauk8136 Před rokem +1

    How many pages can you scrape using this approach before Amazon blocks your scraper, and what's the best approach to avoiding detection by Amazon and other sites?

  • @melvinjisieike7889
    @melvinjisieike7889 Před 10 měsíci +2

    Please can anyone give a RESOURCE for data analyst who want to be able to code things like this. I honestly just need like a roadmap to understand how to even get to this level

    • @MeghModhaa
      @MeghModhaa Před 10 měsíci

      I am so down to collaborate!

    • @melvinjisieike7889
      @melvinjisieike7889 Před 10 měsíci

      Just saw your message I'm down to collaborate . Just getting started in this field need some tips on where to go@@MeghModhaa

  • @proxyscrape
    @proxyscrape Před 8 měsíci +1

    Don't forget the proxies while web-scraping 😄

  • @themindlesstruffle
    @themindlesstruffle Před rokem +8

    Would you consider a full length setup and code tutorial for this project or a similar one?

  • @AliAliOxenFree
    @AliAliOxenFree Před rokem +8

    Great video. scrapers + llm apps are going to dominate data very soon (if they aren't already doing this)

  • @quinndaisies6949
    @quinndaisies6949 Před rokem +5

    Funny enough I used PyQt5 to build a web scraper, I can scrape Amazon without getting block... But with this new information I can extend my web scraper a lot further to make a more complex project. Thank You Very Much Tim...

  • @viktorvegh7842
    @viktorvegh7842 Před 11 měsíci

    Great stuff Bro i was wondering how to do this for long time

  • @matiasbarrera6959
    @matiasbarrera6959 Před rokem +3

    Is there any ethical considerations?? Like legal problems based on the terms and conditions of the website and stuff like that??

    • @b_nnett
      @b_nnett Před rokem +1

      Morally, maybe. Legally, no issues.

    • @CyberKnightProbably
      @CyberKnightProbably Před rokem

      The Legality of web-scraping is still in the air. But whole thing dates back since 1993, when Google wasn’t invented and engines needed to search for web pages efficiently, like Jumpstation. So it is a real thing you could do, but there are acts you can check before trying it yourself.

    • @maksimluzin1121
      @maksimluzin1121 Před rokem

      If you have an open source content, legally it is 'free for use'. This is how the OSI (Open Source Intelligence) is working. By the way, I've read one report few years ago, the ~70% of information the CIA is gathering, actually, is the Open source content in the Web... Think about it... ;-)

    • @TauvicRitter
      @TauvicRitter Před rokem

      It depends. If you are small and the website you scrape cannot afford lawsuits then no problem

  • @user-mz8rz9lk8n
    @user-mz8rz9lk8n Před rokem

    What language would be the best to learn to create an app? Java? C#? Which one should I choose? Also, I want to know the prospects of each language. Can you tell me?

  • @royem9511
    @royem9511 Před rokem

    the fact he started with little subscriber and how young he was 5 years ago I made my first project thanks to him time flies

  • @RolandoLopezNieto
    @RolandoLopezNieto Před 8 měsíci

    Super helpful, thanks

  • @felixinit
    @felixinit Před rokem

    Amazing project scraping the web

  • @Grinwa
    @Grinwa Před rokem +4

    The biggest question is where to sell how to build trust
    I already made few tools and selling them but i have no idea how to sell web scraping data or tools
    Also thanks ❤ that's a good starting point
    Edit: 9:02 after seeing this i remember i made something similar but it scraps anime each one with it episodes and each episode with stream url and title desc... and then display and stream anime with electron ui but i never finished it cuz i have no idea how to make ui with js

  • @learner8053
    @learner8053 Před rokem +1

    Can you please build a live streaming web app & show us?

  • @zeburgerkang
    @zeburgerkang Před 8 měsíci +1

    I'm trying to scrap documents from a website with multi layers, so I can use those files with a logical chat bot to reference what I'm looking for, please any help will be appreciated, thanks in advance.

  • @Virtual869
    @Virtual869 Před rokem +1

    very great vedio like every time

  • @HappyAnimals3D
    @HappyAnimals3D Před rokem

    Thanks Tim. Any plan for a detailed tutorial?

  • @sacrista
    @sacrista Před 8 měsíci +1

    bear in mind this is illegal in most cases, if you try to start a saas doing this you most probably end up broke if the source of your scraping is copyright protected

  • @Militarychannel12
    @Militarychannel12 Před rokem

    Hey i watched some of your videos and i like what you do here ... i was thinking may be if you could do a tutorial on how to Develop an API with Python or JavaScript

  • @aleksatanasov8793
    @aleksatanasov8793 Před rokem +13

    But Tim you know what the problem is. When a Co website have T&C that does not allow machine access to their website and your scraper crashes it then they can come after you for damages if they lost a lot of money. And you are the guy who sold this piece of software to the user. Imagine you sold dozens of such scrapers doesnt this put your legal risk very high?
    As freelancers we need to be very careful about the legal repercussions of our work.

    • @Ironpants57
      @Ironpants57 Před rokem +8

      If you don't want to be apart of something that can be a legit legal issue. Do what I do, steer away from it. Use API's instead so you and/or your company can safely exist without getting sued in the future.

  • @demaxl732
    @demaxl732 Před rokem +1

    I'm definitely adding this one to my resume

  • @ChristopherPhillips
    @ChristopherPhillips Před 3 měsíci +3

    This is just an advert

  • @heliahaghighi3597
    @heliahaghighi3597 Před rokem +1

    Hey, thanks for the great content!
    Guide me please, what is the road map for learning the tools to understand what you have done in this project and be able to do it on my own?

    • @MARTIN-101
      @MARTIN-101 Před 10 měsíci

      you can think of a project of your own. and try to make it. if you ran into problems than search on internet.
      but these kinds of projects are tricky. so you need solid advice. and one thing you do will not work after few months. 😂

    • @MARTIN-101
      @MARTIN-101 Před 10 měsíci

      one time i found a scraper on internet. but it was slow. so i made it 50x faster. you can think of it like this too. find someting and than optimize it.

  • @samhhhhh
    @samhhhhh Před rokem +5

    Oh so it's just an ad. Cool

  • @mandongpedroza7486
    @mandongpedroza7486 Před 5 měsíci

    good day tim, i just want to mention and ask permission that i will extend this base code. for now i made a private repo and soon on the right time i will make it public and will credit you. thanks

  • @wmelon3392
    @wmelon3392 Před rokem

    thank you my king this sounds so cool

  • @smosa605
    @smosa605 Před rokem +2

    why not use requests and bs4 instead of flask? Or is there some performance benefits.

    • @flimdejong2030
      @flimdejong2030 Před rokem +1

      I believe bs4 only works for raw html code. Many websites use Dynamic code like Js. Then it doesn't work

  • @MARKOTHEDEV
    @MARKOTHEDEV Před rokem +1

    I love this dude deeply this was the reason I started learning code❤ now some one is give me a repo

  • @mgusa9372
    @mgusa9372 Před rokem

    Tech With Tim, dude, big fan, suggestion, get a ton of sleep like a TON like 24 hours then eat and take a shower then a really cold shower, and then go get a professional picture made or just use your camera and then use that pic as your profile pic cause on your profile pic u look like you're going to pass out from too much hard work too many hours (which I understand being a techie myself) your eyes are giving you away. Love your stuff man, thank you tons, keep it up PLEASE, TKX!

  • @JTJaniceTek
    @JTJaniceTek Před 9 měsíci

    hello Tim, is it possible to make a video on how to download the newest version of chrome driver, been stucked on it for days... i wanna crai

  • @adamdived6456
    @adamdived6456 Před rokem +1

    I saw the same idea before in another video but in puppeteer -_- a stupid ad

  • @derrickpham7987
    @derrickpham7987 Před rokem

    Thanks for info

  • @theDemolisher13
    @theDemolisher13 Před 5 měsíci

    Just wondering haven't played with the program yet. But does the program also record prices differences if items have multiple versions example Red rain coat vs the same rain coat but it's Blue and cost $1 more.

  • @naidolkusa
    @naidolkusa Před rokem +1

    You remind me of that guy who made a song I wanna be a millionaire so f, baad

  • @amirt6063
    @amirt6063 Před rokem +2

    Hello tim
    I have a problem because I asked for a code for the enter button (when you search in Google and when you press the enter button, it gives you the result) Thank you for helping me.

  • @renrenklein_th
    @renrenklein_th Před rokem +4

    Always love seeing your videos really helped me when I was starting out and still a great source for learning new things here and there.
    Also I know your go to tends to be python and that you have done content with SQL and APIs with Python but in terms of teaching new programmers especially those that come to work on my team the hardest concept to teach is just the overarching idea of how The web in my case that's usually an Angular Project with TS interacts with API (Endpoints etc) and how that API links to our SQL database etc basically just teaching how it all connects from an oversight perspective. I think a video like that would be helpful to newcomers obviously the individual pieces are necessary to know but how they work together is important to ^_^
    Regardless Keep up the Great vids :)

  • @cmd_processor9168
    @cmd_processor9168 Před 4 měsíci

    Thanks for this video. How did you bypass Playwright default timeout? It sees that Playwright has a default timeout of 3000ms regardless of setting await in line 106 to 60000 or even Wait=commit

  • @tecTitus
    @tecTitus Před rokem +2

    just a Brightdata ad and python is extremely slow

  • @satwikawasthi2002
    @satwikawasthi2002 Před rokem

    Can we run whole project without brightdata functionality for website which do not blocks or medium level security in it?

  • @killua7270
    @killua7270 Před rokem

    Can you tell, what thing did you do to monetize through webscraping

  • @terenceundbud
    @terenceundbud Před 7 měsíci

    a 20 year old idea that had been exploited to the maximum already. i myself used it to steal real time data in commercial goods trading. dont want to get specific to avoid saturation in that field next to the legal issues.

  • @KrAsHeDD
    @KrAsHeDD Před rokem

    Thank you

  • @ryanhoughton7415
    @ryanhoughton7415 Před 10 měsíci

    I can build a web scraper in Python selenium that spits data into a csv. But where could I go to learn to do something like this?

  • @PRIVAT_USER
    @PRIVAT_USER Před rokem

    Refined marketing approach... Brilliantly done

  • @Alldeveloper2356
    @Alldeveloper2356 Před 7 měsíci

    Do you have a video of web scraping data using javascript and nodejs?

  • @coddude3284
    @coddude3284 Před rokem

    Mr Tim is automation with java a a good thing ?
    wich better java or python for automation

  • @ordep.dutra1454
    @ordep.dutra1454 Před rokem

    Thanks for sharing this project with us !!! 👍