I built my own Reddit API to beat Inflation. Web Scraping for data collection.

THIS is HARDEST MACHINE LEARNING model I've EVER coded

This AI Agent can Scrape ANY WEBSITE!!!

I felt it in my heart

50 YouTubers Fight For $1,000,000

#JasonDeruloTV // Lottery #GotPermissionToPost From @prestige_et_collection #FromTheIslands

How To Scrape Reddit & Automatically Label Data For NLP Projects | Reddit API Tutorial

Patrick Loeber

zhlédnutí 24 163

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 23. 07. 2024
In this tutorial I show you how to scrape reddit with the reddit API and automatically label the data for NLP projects. We use PRAW (Python Reddit API Wrapper) to download data and nltk to do sentiment classification and assign positive, negative, and neutral labels.
Get my Free NumPy Handbook:
www.python-engineer.com/numpy...
✅ Write cleaner code with Sourcery, instant refactoring suggestions in VS Code & PyCharm: sourcery.ai/?... *
⭐ Join Our Discord : / discord
📓 ML Notebooks available on Patreon:
/ patrickloeber
If you enjoyed this video, please subscribe to the channel:
▶️ : / @patloeber
Resources:
Reddit API Setup: / apps
PRAW: praw.readthedocs.io/
PRAW submission docs: praw.readthedocs.io/en/latest...
~~~~~~~~~~~~~~~ CONNECT ~~~~~~~~~~~~~~~
🖥️ Website: www.python-engineer.com
🐦 Twitter - / patloeber
✉️ Newsletter - www.python-engineer.com/newsl...
📸 Instagram - / patloeber
🦾 Discord: / discord
▶️ Subscribe: / @patloeber
~~~~~~~~~~~~~~ SUPPORT ME ~~~~~~~~~~~~~~
🅿 Patreon - / patrickloeber
#Python
Timeline:
00:00 - Introduction
01:20 - Part 1: Reddit API
07:43 - Part 2: Label The Data
----------------------------------------------------------------------------------------------------------
* This is an affiliate link. By clicking on it you will not have any additional costs, instead you will support me and my project. Thank you so much for the support! 🙏

Komentáře • 45

@NoIntroTutorials Před 2 lety
MAGNIFICENT! I just needed the first part, getting the post titles!, thank you man!
@dabunnisher29 Před 3 lety ⁺⁷
You are certainly one of my favorite Python Masters. I really needed to learn how to do this for stocks. Thank you sooooooo much! You are AWESOME!!!!!
@patloeber Před 3 lety ⁺¹
Glad you like it :)
@dabunnisher29 Před 3 lety
I looked all around today and I couldn't find how to search a subreddit by a key word like "PLTR", get the results and use the NLTK library. If anyone can help, I would appreciate it.
@chasengonzales85 Před rokem
This is really awsome thank you for taking the time to put this together.
@dodgewagen Před 3 lety ⁺⁵
Thanks! Definitely, do more of these API consumption/analysis videos.
@patloeber Před 3 lety ⁺¹
Ok :)
@carfromcars3679 Před 11 měsíci
wowwww easiest tutorial to follow by FAR. thank you!!!!
@ElectroCoderEC Před rokem
woooow amazing. You save my life. very useful. Thanks a lot! :)
@moy92 Před 3 lety
Thanks for doing this! I have been wanting to scrape reddit for a while as exploratory analysis
@patloeber Před 3 lety
hope you like it :)
@Asianyoungman22 Před měsícem
thank you very much, you save my life, my dissertation for my master degree.
@bthapa94 Před 2 lety ⁺¹⁷
Great video and well explained! How do you scrape ALL the posts for a certain time period? I am looking a small subreddit and require a lot of data.
@catalina5382 Před 2 lety
This is exactly what I wanted. I would like to know what modifications do I have to make in order to get the headlines with the flair as well
@paulsastre9833 Před 10 měsíci ⁺¹
thank you for this wonderful video. but how did you get the url used in the beginning
@varinderjitkaur3656 Před 2 lety
great video, i am trying to get the historical daily number of members on a subreddit. Is it possible using praw?
@tazrinkhan1297 Před 3 lety
Thank you for this video. This is really helpful. I am trying to get data for a particular time period (March 2020- November 2020). Can you please tell me how to write the code for this?
@miaoinperth680 Před rokem
Thanks so much for your video. Will you share the codes in github or somewhere?
@basemgoueli Před 2 lety ⁺²
I have a project I could use the help of someone of your caliber with. I want to determine the five stocks mentioned most frequently on Reddit's WallStreetBets page on a given day. from January 2022-August 2022 (I have the CSV file for this). After that I want to take the five most commonly mentioned stocks based on number of days in the top 5 from the aforementioned analysis. I would like to plot the number of mentions of the given stock per day against its stock price for the designated time frame. Any help you can offer would be greatly apprecaited.
@anny23108 Před 3 lety ⁺¹
Could you do a tutorial for mining historical data as well? thank you
@wasgeht2409 Před 2 lety
Hey, danke für das Video :) Habe unten lesen können, dass du aus Deutschland bist. Ich hätte da mal eine Frage und zwar ist es auch möglich über LDA kommende Textnachrichten in Themengebiete zuzuordnen ?
@prod.kashkari3075 Před 3 lety
Wow push and praw!
@fernandosantos3576 Před 3 lety
Yes, I woul love if you publish a video on a complete project.
Thank you.
@patloeber Před 3 lety ⁺²
Ok 👌🏻
@Probly Před 2 lety
Do you know how to scrape in a specified time period so I can compare sentiment towards a stock within r/wallstreetbets or r/investments against the historical stock price of the same period
@blancaherrerosdetejada7160 Před rokem
Is it a way to automatically scrape any new posts in a subreddit? (without having to re-run program)
@mealone007 Před 3 lety
Great video! Quick question, how to scrape the historical headlines with date stamp?
@bitsinbytes9002 Před 3 lety
The UTC attribute will give you the Unix Timestamp, then you just have to convert it. Getting historical headlines may be a little trickier, as the PRAW API allows you to iterate through the following "submission" types: controversial, gilded, hot, new, rising, top.
@prajjwalsinha1187 Před 11 měsíci ⁺¹
How do I scrape comments from reddit posts?
@limjuroy7078 Před 3 lety
Why the user_agent is not "Example"?
@gsom2000 Před 3 lety
great tutorial! Thanks a lot! is there any opportunity to do the same with twiiter data?
@patloeber Před 3 lety
I already have 2 tutorials using the twitter API (tensorflow NLP and flask Twitter bot). Maybe you can apply the knowledge from these videos here
@gsom2000 Před 3 lety
@@patloeber nice! apparently i just missed them! Danke!
@selcukturk3550 Před 8 měsíci
how can i get this code?
@JackFrost1206 Před 3 lety ⁺⁴
Maybe you can scrape the subreddit wallstreetbets :D
@patloeber Před 3 lety
Good idea
@nathanielpetruska1305 Před 3 lety
@@patloeber Scrape to see the stocks that are rising in popularity! haha
@fintech1378 Před rokem
why is it always 401?
@samarendrapradhan5067 Před rokem
I"m using python 3.9,so older vesion may differ for my below comment.Thanks
@knowledgeshack5040 Před 3 lety
First!
@JackFrost1206 Před 3 lety
Are you german?
@patloeber Před 3 lety ⁺²
Yes I am
@gardnmi Před 3 lety ⁺⁴
Just went to that politics subreddit. It's laughably bias. Thanks for the tutorial.
@patloeber Před 3 lety
Haha yeah
@samarendrapradhan5067 Před rokem
Please import followings
import matplotlib.pyplot as plt
import seaborn as sns
nltk.download('vader_lexicon')
Use from vaderSentiment.vaderSentiment import SentimentIntensityAnalyzer as SIA in place of from nltk.sentiment.vader import SentimentIntensityAnalyser as SIA
Please suggest witdth =100 is showig error as "width' is an invalid keyword argument for print()"

Další v pořadí

Automatické přehrávání

I built my own Reddit API to beat Inflation. Web Scraping for data collection.

I built my own Reddit API to beat Inflation. Web Scraping for data collection.

THIS is HARDEST MACHINE LEARNING model I've EVER coded

THIS is HARDEST MACHINE LEARNING model I've EVER coded

This AI Agent can Scrape ANY WEBSITE!!!

This AI Agent can Scrape ANY WEBSITE!!!

I felt it in my heart

I felt it in my heart

50 YouTubers Fight For $1,000,000

50 YouTubers Fight For $1,000,000

#JasonDeruloTV // Lottery #GotPermissionToPost From @prestige_et_collection #FromTheIslands

#JasonDeruloTV // Lottery #GotPermissionToPost From @prestige_et_collection #FromTheIslands

World Record Tunnel Glide 🪂

World Record Tunnel Glide 🪂

How-to Use The Reddit API in Python

How-to Use The Reddit API in Python

Building a Chatbot with ChatGPT API and Reddit Data

Building a Chatbot with ChatGPT API and Reddit Data

Industrial-scale Web Scraping with AI & Proxy Networks

Industrial-scale Web Scraping with AI & Proxy Networks

This Free AI Video Tool Brings Characters to Life

This Free AI Video Tool Brings Characters to Life

Reddit Scraping in 2023 (Data Collection Tips & Tricks)

Reddit Scraping in 2023 (Data Collection Tips & Tricks)

Scrape Reddit Post and Comments | Automate Everything with Python

Scrape Reddit Post and Comments | Automate Everything with Python

How to automate Notion with Python | Notion API Tutorial

How to automate Notion with Python | Notion API Tutorial

Machine Learning for Facial Recognition in Python in 60 Seconds #shorts

Machine Learning for Facial Recognition in Python in 60 Seconds #shorts

How To Use ChatGPT To Fully Automate Web Scraping

How To Use ChatGPT To Fully Automate Web Scraping

Now it's my turn, he can't be angry with me 😆 #c4class #chang0000 #vin #chany #couple

Now it's my turn, he can't be angry with me 😆 #c4class #chang0000 #vin #chany #couple

Logan Paul Fails To Fool Kai Cenat In Mr Beast Video

Logan Paul Fails To Fool Kai Cenat In Mr Beast Video

39kgのガリガリが踊る絵文字ダンス／39kg boney emoji dance#dance #ダンス #にんげんっていいな

39kgのガリガリが踊る絵文字ダンス／39kg boney emoji dance#dance #ダンス #にんげんっていいな

How Many Balloons Does It Take To Fly?

How Many Balloons Does It Take To Fly?

This pasta HACK is almost approved

This pasta HACK is almost approved

I almost managed to get in the car 🚗@elsarca @nevaaada #danilisboom

I almost managed to get in the car 🚗@elsarca @nevaaada #danilisboom

Uděláme koncert, kde uvidite AINKU naživo??

Uděláme koncert, kde uvidite AINKU naživo??

This bag is perfect for YouTube button couriers! 🏃📦✨

This bag is perfect for YouTube button couriers! 🏃📦✨