Unlimited AI Agents running locally with Ollama & AnythingLLM

`const` was a mistake

Mesop - Python-based UI framework from Google!

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

How Many Balloons Does It Take To Fly?

Crawl4AI - Crawl the web in an LLM-friendly Style

Unclecode

zhlédnutí 4 978

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 16. 05. 2024
Welcome to the detailed walkthrough of Crawl4AI v0.2.0! 🚀
In this video, I'll dive deep into the code base of Crawl4AI, our powerful web crawling tool designed for AI enthusiasts and developers. We'll explore all the new and exciting features that make this release a game-changer:
- 🕷️ Efficient web crawling to extract valuable data from websites
- 🤖 LLM-friendly output formats (JSON, cleaned HTML, markdown)
- 🌍 Supports crawling multiple URLs simultaneously
- 🌃 Replace media tags with ALT
- 🆓 Completely free to use and open-source
- 📜 Execute custom JavaScript before crawling
- 📚 Chunking strategies: topic-based, regex, sentence, and more
- 🧠 Extraction strategies: cosine clustering, LLM, and more
- 🎯 CSS selector support
- 📝 Pass instructions/keywords to refine extraction
I explain all these features in detail in the video. No API key, signup, or other boring stuff required! 🌐
Check out the repo: [Crawl4AI on GitHub](github.com/unclecode/crawl4ai)
If you find this tool useful, please star the repo and leave a comment! Your feedback helps us improve and support the project.
Follow me on Twitter (X) for updates on my research on function-calling for LLMs and AI agents: x.com/unclecode
I appreciate your feedback and thoughts on this project.
#Crawl4AI #WebCrawling #AI #LLM #Colab #WebScraping #OpenSource #GitHub #OpenSourceAI
Věda a technologie

Komentáře • 13

@po6577 Před měsícem ⁺¹
Love how you so excited of your project! Keep it up man! Great project
@unclecode788 Před měsícem
Thanks! Will do!
@AWSFan Před 4 dny
Very useful Project, I must admit! Is it a recursive crawler, when I say recursive, I mean it, (not restricted to depth threshold). Also How differet is this from FireCrawl, in terms of functionality and other stuffs. I can't wait to get started on using this project, and give it a shot! Thanks!
@MikeLevin Před měsícem
Looks exciting. Have you considered a nix script?
@plumpy8854 Před 16 dny
Hey man. I'm going to be honest but i'm new to data scraping and wanted to ask if crawl4ai can be used to scrape data from tiktok. They have implemented some harsh measures with request rate limits and login requirements. From what i saw crawl4ai has some login feature but just wanted to ask you if i'm going in the right direction. Otherwise looks great
@xinfeng3022 Před měsícem ⁺¹
possible to put up a prebuilt docker image, including the 'models'? I had problem downloading the models during build docker. Thanks!
@unclecode788 Před měsícem ⁺²
I will work on that. Trying to have a version without model dependency as well
@carlosa.villanuevacampoy931 Před měsícem
Really cool man! Can I crawl all accessible subpages from a main page? So I crawl 2 levels in total?
@unclecode788 Před měsícem ⁺²
You can send multiple links, so first crawl the main page, then get links and send them again. However soon I will release the ability to se the depth and get a cool result for that
@fieldcommandermarshall Před měsícem
WHAT HAPPENED TO THE FLUTE UNCLE CODE
@unclecode788 Před měsícem ⁺¹
Hahahaha!! Ok, ok, message received
@bitcoinquickbytes Před 2 měsíci
i got a result object. how to parse it
@unclecode788 Před měsícem
Result is an object like this:
class CrawlResult(BaseModel):
url: str
html: str
success: bool
cleaned_html: str = None
markdown: str = None
extracted_content: str = None
metadata: dict = None
error_message: str = None
So you can access using this property (cleaned_html, markdown, extracted_content), or dump the model into a python dictionary using "result.model_dump()`

Další v pořadí

Automatické přehrávání

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

`const` was a mistake

`const` was a mistake

Mesop - Python-based UI framework from Google!

Mesop - Python-based UI framework from Google!

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

How Many Balloons Does It Take To Fly?

How Many Balloons Does It Take To Fly?

Lasagna Soup @Lionfield

Lasagna Soup @Lionfield

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

Convert any Text Data into a Knowledge Graph (using LLAMA3 + GROQ)

Convert any Text Data into a Knowledge Graph (using LLAMA3 + GROQ)

Apple GAMING CONSOLE - Be BLOWN AWAY in 2024?

Apple GAMING CONSOLE - Be BLOWN AWAY in 2024?

Two GPT-4os interacting and singing

Two GPT-4os interacting and singing

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

Amazing New VS Code AI Coding Assistant with Open Source Models

Amazing New VS Code AI Coding Assistant with Open Source Models

Run any AI model remotely for free on google colab

Run any AI model remotely for free on google colab

This AI Agent can Scrape ANY WEBSITE!!!

This AI Agent can Scrape ANY WEBSITE!!!

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Lenovo Legion Gaming #PC won't stop beeping! (RAM fix and dust cleaning) #tech #technology #shorts

Lenovo Legion Gaming #PC won't stop beeping! (RAM fix and dust cleaning) #tech #technology #shorts

Voice of Alexa Confuses Alexa😂

Voice of Alexa Confuses Alexa😂

Он придумал гениальную идею, как исправить разбитый экран! 🤯 | Credit : gertieinar (TT)

Он придумал гениальную идею, как исправить разбитый экран! 🤯 | Credit : gertieinar (TT)

First 5 Things to Do with a New PC

First 5 Things to Do with a New PC

The All Home Depot Setup

The All Home Depot Setup

The Numitron: An obvious idea that wasn't very bright

The Numitron: An obvious idea that wasn't very bright

Privacy on iPhone | Flock | Apple

Privacy on iPhone | Flock | Apple

Battery low 🔋 🪫

Battery low 🔋 🪫