NLP Demystified 3: Basic Preprocessing (case-folding, stop words, stemming, lemmatization)
Vložit
- čas přidán 2. 08. 2024
- Course playlist: • Natural Language Proce...
Depending on our goal, we may preprocess text further. We'll cover case-folding, stop word removal, stemming, and lemmatization. We'll go over their use cases, their tradeoffs, and how to get them done using spaCy.
Colab notebook: colab.research.google.com/git...
Timestamps:
00:00:00 Basic Preprocessing
00:00:35 Case-folding and its tradeoffs
00:02:40 Stop word removal (tradeoffs and how it can go wrong)
00:04:40 Stemming (tradeoffs and things to watch out for)
00:06:28 Lemmatization and its advantages over stemming
00:07:52 DEMO: basic processing with spaCy
00:10:37 Basic preprocessing recap
This video is part of Natural Language Processing Demystified --a free, accessible course on NLP.
Visit www.nlpdemystified.org/ to learn more.
Timestamps:
00:00:00 Basic Preprocessing
00:00:35 Case-folding and its tradeoffs
00:02:40 Stop word removal (tradeoffs and how it can go wrong)
00:04:40 Stemming (tradeoffs and things to watch out for)
00:06:28 Lemmatization and its advantages over stemming
00:07:52 DEMO: basic processing with spaCy
00:10:37 Basic preprocessing recap
Concise and easily understandable. Thanks a lot for the series.
This is the best NLP series I have ever watched
This content is simple and easy to understand.
Well done, thanks!