🛑 How to Scrape UNSCRAPABLE data! (super simple!) Node.js + API

Sdílet
Vložit
  • čas přidán 26. 08. 2024
  • Click here to follow along: brightdata.grs...
    __
    ⭐ Sign up for my Full Stack Developer Course: www.codewithan...
    ⭐ Check out my IDE here and get 1 month free: jb.gg/get_webs...
    ⭐ New to code and none of this is making sense? Watch my '12hr+ CZcams Coding Bootcamp' in which you will learn HTML, CSS and JavaScript Fundamentals completely from scratch. It's on my channel and its 100% free.
    ⭐ In most videos I use Tabnine as my A.I autocompletion tool. You can download it for free here (I get no commission from this link, but am in a partnership): bit.ly/tabnine-...
    ⭐ You can get a blockchain domain with my affiliate link here: bit.ly/get-a-cr...
    ⭐ If you would like to buy me a coffee, well thank you very much that is mega kind! : www.buymeacoff...
    ⭐ Sign up for weekly coding tips from my newsletter partnership: bit.ly/JS-tips
    You can also find me on:
    Twitter: / ania_kubow
    Instagram: / aniakubow

Komentáře • 106

  • @Hobbitstomper
    @Hobbitstomper Před rokem +10

    The best solution imo is NodeJS + Puppeteer + puppeteer-extra-plugin-stealth plugin.
    It's free, doesn't rely on any 3rd party APIs and works 100% to avoid cloudflare blocking and other captchas. You can even log into any website, even if it uses OAuth for Google, Facebook, Amazon, Microsoft, Twitter, Apple etc.

  • @dystopian_1
    @dystopian_1 Před rokem +14

    See you in 3 days, Mother of Dragons.

  • @joelapablaza7722
    @joelapablaza7722 Před rokem +5

    So.. if u want to scrap a dynamic web just go the sponsor of this video.... really?

    • @atlantic_love
      @atlantic_love Před 10 měsíci +1

      Yeap, she's just getting subscribers off her looks, and using these stupid sponsors as her "content". I disliked this video, and another one. In watching the previous one I couldn't figure out whether she just can't type or she doesn't really know what the heck she's talking about.

  • @silversolver7809
    @silversolver7809 Před rokem +7

    Ania, fyi in case it might affect the algorithm: "unscrappable" should only have one 'P'-ie "unscrapable" :)
    Also, not sure it's a common word.

    • @aniakubow
      @aniakubow  Před rokem +1

      You make a very good point! Thanks for having my back 🙌🙌🙌. What is a better title do you think?

    • @silversolver7809
      @silversolver7809 Před rokem +2

      @@aniakubow You're welcome :)
      Ideally put the most relevant words first, as it's likely algos will regard those as more important than later words. So instead of say "Make your videos better on CZcams", you should have "CZcams videos-improve yours". Also, it's usually better to use positive words rather than negative-eg "You will win" > "You can't lose". So:
      Scrape ALL Data
      Scrape EVERYthing
      Scrape and Catch ALL Data
      Emphasize ALL and EVERY, because that's the unique point of this video-if everything is in CAPS, then nothing is emphasized.

  • @marekr.9339
    @marekr.9339 Před rokem +4

    Pozdrawiam z Polski i życzę dalszych sukcesów w rozwoju kanału!

  • @gaia2933
    @gaia2933 Před rokem

    Although there are many software solutions for automating and extracting data from a website, using NodeJS and its library ecosystem remains the most flexible option offering endless possibilities.

  • @joseluisperez5137
    @joseluisperez5137 Před rokem +1

    A little notecheck at 7:08, use the -D flag when installing nodemon, nodemon is just for development on this example

    • @joseluisperez5137
      @joseluisperez5137 Před rokem +1

      Another sticknote, like the documentation says, is not a web browser instance, it just takes the html to interpretate and do the job, so, if we do this stuff on websites that don't do server side rendering at all, will be missing some information since maybe it's loaded by external sources, like multiple scripts, external call apis, etc.

    • @producdevity
      @producdevity Před rokem

      This sticknote is more than just a note. It’s the difference between pulling your hair out and understanding right away why some values are populated and some are not.

  • @shrikantjha5630
    @shrikantjha5630 Před rokem

    Yes, I have also used cheeriojs with react native as an experiment and it worked well.

  • @briandsouza7854
    @briandsouza7854 Před rokem +17

    I think putting a premiere 24 hours would be better. This long wait feels annoying!

    • @StephenChapman
      @StephenChapman Před rokem +2

      It doesn't have to feel annoying. Just tap/click the notify button, then put it out of your mind and move on to thinking about literally anything else in the world.

    • @christian-schubert
      @christian-schubert Před rokem

      I'll second that. I'd even go as far as calling all those announcements years in advance spam, literally made me unsubscribe from this channel.
      Now, that's not to say that the content itself isn't of high quality. Ania is a real gem - I keep checking back occasionally. 👍

  • @irobot8297
    @irobot8297 Před rokem

    would love to see your computer setup, your desk, keyboard chair etc :)

  • @DevMadeEasy
    @DevMadeEasy Před rokem +2

    Great Content, as usual, thank you so much for sharing it with us, I know how hard is to build a project then edit it, post...
    Thanks🙏

  • @hazemelbatawy1242
    @hazemelbatawy1242 Před rokem

    great job I followed your steps and really it was fantastic, I am a data scientist and you impressed me. God bless you and if you need anything like Machine learning I am working on algorithms.

  • @ROVAKAN
    @ROVAKAN Před rokem +2

    At the last video there was axios + express module , but i tried it on react result was CORS errors. Maybe this video is going to tell about that kind of errors and maybe about proxy set ups.

    • @aniakubow
      @aniakubow  Před rokem

      I hope it solves your issues too :)

  • @drucifer6
    @drucifer6 Před rokem +4

    Amazing content! I'd be curious how to scrape/store data in a database and use that for my own frontend.

  • @PySnek
    @PySnek Před rokem

    You're the best Ania! Thank you so much!

  • @BreakfastCupNoodles
    @BreakfastCupNoodles Před 10 měsíci

    The problem with managed one is the cost. For custom one, you can pay for as low as $19/month for 100,000 pages. It's also not hard to scale.

  • @Crakkovia
    @Crakkovia Před rokem

    Good Work Annia!

  • @SamLinnett
    @SamLinnett Před rokem

    Nice one Ania - this is really great

  • @ClimbHighWithAI
    @ClimbHighWithAI Před 6 měsíci

    I need to do it with more than 5000+ products and also need description and price and etc how can I do it

  • @jesusmoran1356
    @jesusmoran1356 Před rokem +1

    didn´t work to me

  • @paulthomas1052
    @paulthomas1052 Před rokem +1

    Thanks - very useful as usual :)

  • @philipbengtsson2186
    @philipbengtsson2186 Před rokem

    After viewing this video it would be interesting to see what we can do to prevent others from scraping our own website projects. 😅

  • @Bot-kl1gs
    @Bot-kl1gs Před rokem +4

    Hey ania can you also include the part where you can store the fetched data in a database(Like mongodb) and then show the user. it would be a great help OwO OwO

    • @avenazpk
      @avenazpk Před rokem

      Supabase is best choice

  • @thunde7226
    @thunde7226 Před rokem

    That was great Ania.....................take care ........................:) bye

  • @NAHChannel
    @NAHChannel Před rokem +1

    Great video! I've had some better experience scraping using xpaths instead of classnames in sites which dynamically generate the classnames. But it seems to go down to the content being scraped. Scraping using CSS selectors seems to be faster also.

  • @richardmasters2045
    @richardmasters2045 Před rokem

    Thank you so much Ania 🥰

  • @iankila5618
    @iankila5618 Před rokem

    Ty for these tutorials!

  • @mgusa9372
    @mgusa9372 Před rokem

    Killer look, light pink that's definitely you. Scrap 'UNSCRAPPABLE' data yeah I'm in, I'll be back, spoken in an Arnold hillbilly German accent. Love your stuff GO Ania.

  • @hussainnajaf332
    @hussainnajaf332 Před rokem

    The question of which language do you use and why do you use it ؟

  • @Erwin_t
    @Erwin_t Před rokem

    WOW amazing tutorial! I love your style and your approach. I am starting web development. I want to learn Vanilla JS your way. What is the best practice to learn and retain the methodology of JS? Please help :)

  • @joelayoub2774
    @joelayoub2774 Před rokem

    Looking forward to this.

  • @2ru2pacFan
    @2ru2pacFan Před rokem +1

    Hey Ania, do you know how to scrape websites blocked by Cloud Flare? X

  • @golgappayadav1864
    @golgappayadav1864 Před 6 měsíci

    Can u make myntra scrapper video

  • @coderizer
    @coderizer Před rokem

    great video, keep up the good work

  • @halowarstier3147
    @halowarstier3147 Před rokem

    I am stuck on npm init, not sure how to follow instructions. Please help

  • @hassaneoutouaya
    @hassaneoutouaya Před rokem

    Thank you so much !

  • @srishimalah9561
    @srishimalah9561 Před rokem

    @Code with Ania Kubów,Hi, your video of the battleship is unavailable.Can you please look into it ? Because your video is the part one and the part 2 & 3 three is working. I am trying to study the game logic and it will be very helpful if you can re-upload your video.Thank you.

  • @js_models
    @js_models Před rokem

    I wish there was an npm Ania command, because she is the total package. 😉

  • @miguelbcn
    @miguelbcn Před rokem

    Im trying to do the same with twitter to get the tweets from any user, and it seems imposible. Could you help me?

  • @rachest
    @rachest Před rokem

    I need you as my technical partner

  • @aleksandrkobelev8868
    @aleksandrkobelev8868 Před rokem

    I love you so much! You are the best!)

  • @AMoktar
    @AMoktar Před rokem

    Amazing you are ❤

  • @christiandanielmoralesagui4659

    See you soon Teacher

  • @pastuh
    @pastuh Před rokem

    At such times I would say... AI must understand what to scrape.

  • @godswillhycinth9809
    @godswillhycinth9809 Před 9 měsíci

    Thanks for the video, can this also scrape out Instagram HTML content?

  • @urosjovicic3988
    @urosjovicic3988 Před rokem

    I came here to learn, instead i fell in love :D

  • @startupshorts
    @startupshorts Před rokem

    Hi
    Can u please explain how to scrape email from LinkedIn

    • @aniakubow
      @aniakubow  Před rokem

      I think the video should help with that :)

  • @thefeelingofunfair4052

    Do you have your series 7 ?

  • @techwithulises
    @techwithulises Před rokem

    No lo quiero, lo necesito

  • @albaraasaad4498
    @albaraasaad4498 Před rokem

    Thank you for the great content. I have a request because I've been searching all over to find a good explanation on how to scrape pages that have a load more button - NOT DIFFERENT PAGES - using Cheerio and Puppeteer. I can scrape a page when it's auto-loading when scrolling down but still couldn't make it by clicking the load more button😭.
    Thank you.

    • @qualitytransportation
      @qualitytransportation Před rokem

      Just click it with puppeteer then load with cheerio

    • @albaraasaad4498
      @albaraasaad4498 Před rokem

      @@qualitytransportation I know that it should click, but whenever I try it's not working. I mean the puppeteer will not click the load more. I did navigate the click button and but I don't know why it's not working.

  • @1A_B_C1
    @1A_B_C1 Před rokem +1

    Queen 👸

  • @screendice4107
    @screendice4107 Před rokem

    Mam I am waiting.Why you did not list this video on top?

  • @balakumar.n4891
    @balakumar.n4891 Před rokem

    How to scrape Formula 1 data ?

    • @aniakubow
      @aniakubow  Před rokem

      This video should help I think :)

  • @socar-pl
    @socar-pl Před rokem +1

    >How to scrape data
    >Use paid service that sponsr this video
    ayyyyyyyyyy lmao

    • @aniakubow
      @aniakubow  Před rokem

      I show two ways to do it so you can choose :)

  • @screendice4107
    @screendice4107 Před rokem

    Thanks mam

  • @roostermarques3583
    @roostermarques3583 Před rokem

    I think someone is trolling off your comments.

  • @dystopian_1
    @dystopian_1 Před rokem +2

    Titanic was lost in your bright eyes.... lovely, lovely you...

    • @Qasim6
      @Qasim6 Před rokem

      😜

    • @trolley2327
      @trolley2327 Před rokem

      I think there is a whole generation of programmers in love with her :))

  • @luissosa7685
    @luissosa7685 Před rokem

    Hola 👋

  • @kacperkepinski4990
    @kacperkepinski4990 Před rokem

    How did you get that accent?

  • @ashwinr8317
    @ashwinr8317 Před rokem

    hi ania

  • @jeanmi8184
    @jeanmi8184 Před rokem

    how to scrape your ❤

  • @i-am-your-conscience
    @i-am-your-conscience Před rokem

    I am almost embarrassed to admit on how much easier it is to learn such stuff when your teacher is just smokin' hot :D
    besides being an amazing teacher already, dont get me wrong :)

  • @VesuviusAntaria
    @VesuviusAntaria Před rokem

    Hi Ania! 🙂🌸🏵🌹🌺🌼🌻🌷

    • @VesuviusAntaria
      @VesuviusAntaria Před rokem

      Thank you for your kiss! You have made my day! 🙂🌺

  • @lowlufi
    @lowlufi Před rokem

    Lovely

  • @gileneusz
    @gileneusz Před rokem

    ale się produkujesz ;) scrapowanie to ciężka sprawa....... sam ostatnio bawię się w diffbot'a

  • @michaelallen1154
    @michaelallen1154 Před rokem

    I'm here to learn. 🙄

  • @abhishekkaith1686
    @abhishekkaith1686 Před rokem

    I personally use jsdom don't know why lol

  • @atlantic_love
    @atlantic_love Před 10 měsíci +1

    Nothing but a sponsor video.

  • @__--JY-Moe--__
    @__--JY-Moe--__ Před rokem

    🥳

  • @---fq2kd
    @---fq2kd Před rokem

    ты супер !

  • @desi_vlogs005
    @desi_vlogs005 Před rokem

    👍

  • @wgalloPT
    @wgalloPT Před rokem

    I wish so much I had a girlfriend just like you...smart, beautiful and a coder!!

  • @yobi3d
    @yobi3d Před rokem

    Как обычно, все "очень просто"! ) Как её смотреть то? Стояк мешает )

  • @AADJgroup
    @AADJgroup Před rokem

    😱😇

  • @richardmasters2045
    @richardmasters2045 Před rokem +2

    My cyber girlfriend the smartest woman I know. You have my undying love, respect and devotion 🥰 I can't wait seriously on the edge of my seat 🤓

  • @illegalsmirf
    @illegalsmirf Před rokem

    Using PHP or Perl?

  • @code.design
    @code.design Před rokem +2

    SCRAPE ME! Do you have an OF?

  • @user-mh1km3mv7z
    @user-mh1km3mv7z Před 9 měsíci

    um,
    update the old video so that it actually works
    then do this
    christ id like to do your projects but id ont know this node.js technology for new versions!

    • @aniakubow
      @aniakubow  Před 9 měsíci

      You can change the version of node.js to the one I am using in the video. Just check the package.json for the version :)