ScrapeGraphAI - REVOLUTION in WEB SCRAPING!!!

Sdílet
Vložit
  • čas přidán 29. 08. 2024

Komentáře • 28

  • @ShihChiehLee
    @ShihChiehLee Před 9 dny

    can you do a walkthrough to read a given url of news web page to come up with context of the news and come up with a summary through LLM, ideally using Ollama

  • @MrMoonsilver
    @MrMoonsilver Před 3 měsíci +5

    It extracted the titles, not the articles themselves.

    • @raunaqss
      @raunaqss Před 3 měsíci

      Thanks for writing this

    • @ShihChiehLee
      @ShihChiehLee Před 9 dny

      can you do a walkthrough to read a given url of news web page to come up with context of the news and come up with a summary through LLM, ideally using Ollama

  • @Bbyanar
    @Bbyanar Před měsícem +1

    Great video thx !

  • @userou-ig1ze
    @userou-ig1ze Před 3 měsíci

    Thanks for demo-ing the tool

  • @amandamate9117
    @amandamate9117 Před 3 měsíci

    this video tutorial was suprising beginner friendly

    • @TomsTechAcademy
      @TomsTechAcademy  Před 3 měsíci

      Thank you for your comment Amanda, and glad my video was useful for you :) I always try to make my videos as beginner friendly as possible

  • @iamrrricardo
    @iamrrricardo Před 2 měsíci

    Thank you so much for showing the demo! I still have a question:does scrapegraphai support full stack crawling? or it only works for one page

  • @aleksdeveloper698
    @aleksdeveloper698 Před 19 dny

    Can you scrape data from Google maps?

  • @SynergyOfTwo
    @SynergyOfTwo Před 2 měsíci +2

    I found ScrapeGraphAI pretty useless, it doesn't except for the simplistic examples.

    • @TomsTechAcademy
      @TomsTechAcademy  Před 2 měsíci

      It’s a great tool for people without Python / BeautifulSoup skills. For most cases which are a bit more complex, the conventional approach is probably the best ;)

  • @256Services
    @256Services Před 8 dny

    Video could have been more helpful if it was about scraping helpful marketing information like emails and phone numbers

    • @TomsTechAcademy
      @TomsTechAcademy  Před 4 dny

      Thanks for your comment. It’s impossible for me to make a video about every possible use case. Instead I try to make videos which can be easily applied to other scenarios. In this specific case, it’s just a matter of modifying the prompt and the url and you’re good to go 😉

  • @anishmanandhar1203
    @anishmanandhar1203 Před 3 měsíci

    whenever I install scrapegraphai I am hit by AttributeError: module 'pkgutil' has no attribute 'ImpImporter'. Did you mean: 'zipimporter'?
    [end of output]

    • @TomsTechAcademy
      @TomsTechAcademy  Před 3 měsíci +1

      Hi, I think I had this exact same error - are you running Python in a virtual environment? If you run Python 3.12 you can try switching to 3.11

  • @DuhaMasoodi
    @DuhaMasoodi Před 2 měsíci

    When i am trying this i get the error asyncio error cannot be called

    • @TomsTechAcademy
      @TomsTechAcademy  Před 2 měsíci

      Hi, thanks for your comment. I have seen this error before, is it possible you're running Python 3.12? If so, try running it in a venv with 3.11. Good luck! If you need more help, you can reach me on my Discord (link in description).

  • @zemmouripopodori
    @zemmouripopodori Před 2 měsíci +1

    i get api error

    • @TomsTechAcademy
      @TomsTechAcademy  Před 2 měsíci

      If you receive any error, feel free to join my discord (link in description) and post your question there 😀

  • @s6yx
    @s6yx Před 2 měsíci

    I just give chat gpt the html data as example. Give it what I want scraped and it’ll generate the code for me

    • @TomsTechAcademy
      @TomsTechAcademy  Před 2 měsíci

      Thanks for Sharing your expertise, that’s another way of doing it 😀

  • @metazoo4407
    @metazoo4407 Před 3 měsíci

    I keep getting this error "PermissionError: [Errno 13] Permission denied: 'wired.xlsx'" and I've even typed the code shown in your video exactly and still I get that error, any help would be greatly appreciated

    • @TomsTechAcademy
      @TomsTechAcademy  Před 3 měsíci +2

      Hi Meta, it looks like you're trying to write a file to a directory where you don't have enough permissions. Assuming you're using Windows, can you try the following?
      Navigate to your Desktop, right click on any of your files there. Click properties. Go to the tab "general" and read location. This should be something like: "C:\Users\thoma\Desktop" (where thoma is your username). Copy this location to VS Code. Now add \wired.xlsx and change all slashes for double slashes (you should now have something like this:
      df.to_excel("C:\\Users\\thoma\\Desktop\\wired.xlsx",index=False)
      Run the code again. Let me know whether this works for you. If not, feel free to join my Discord if you need more help (link in video description).

    • @metazoo4407
      @metazoo4407 Před 2 měsíci

      @@TomsTechAcademy Hi, I managed to get it to produce two results and now I'm having issues again, It is now giving me this message "Traceback (most recent call last):
      File "c:\scrapperai\main.py", line 26, in
      df = pd.DataFrame(result['pages'])
      ~~~~~~^^^^^^^^^" any help would be greatly appreciated!