StatQuest: edgeR and DESeq2, part 2 - Independent Filtering

StatQuest: DESeq2, part 1, Library Normalization

Linear Regression, Clearly Explained!!!

They succeed because they are united!

Kuběnka vs Denny! Veselý je moc a ex od Vlčka vrací úder

Maniak feat. Mikýř - Úžasná pouť ostrovem (Official Video) prod. Kenny Rough

StatQuest: edgeR, part 1, Library Normalization

StatQuest with Josh Starmer

zhlédnutí 40 693

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 2. 04. 2017
edgeR, like DESeq2, is a complicated program used to identify differentially expressed genes. Here I clearly explain how it normalized libraries.
For a complete index of all the StatQuest videos, check out:
statquest.org/video-index/
If you'd like to support StatQuest, please consider...
Buying The StatQuest Illustrated Guide to Machine Learning!!!
PDF - statquest.gumroad.com/l/wvtmc
Paperback - www.amazon.com/dp/B09ZCKR4H6
Kindle eBook - www.amazon.com/dp/B09ZG79HXC
Patreon: / statquest
...or...
CZcams Membership: / @statquest
...a cool StatQuest t-shirt or sweatshirt:
shop.spreadshirt.com/statques...
...buying one or two of my songs (or go large and get a whole album!)
joshuastarmer.bandcamp.com/
...or just donating to StatQuest!
www.paypal.me/statquest
Lastly, if you want to keep up with me as I research and create new StatQuests, follow me on twitter:
/ joshuastarmer
#statquest #rnaseq #edger

Komentáře • 59

@statquest Před 2 lety
Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/
@dany271197 Před 5 lety ⁺⁵
By the way, you did a Great job to explain in a very nice way stastical analysis for dummies!!
@statquest Před 5 lety ⁺¹
Thank you! :)
@tkdlvk27 Před 4 lety ⁺²
wow.. this is amazing method, and your explanation
@elizabethblears2919 Před 6 lety
This is so helpful! Thank you! Keep up the good work!
@sunnetinternationalbusines9910 Před rokem ⁺¹
Thanks for the in-depth explanation
@statquest Před rokem
bam!
@reytns1 Před 6 lety ⁺¹
Dear Joshua, I already see your video, it is really interesting and helpful for new people that are involved in this RNAseq world. Well I have a question related to normalization. Are there any relation between EdgeR with hypergeometric distribution ?
@liangcheng7824 Před 5 lety
This is really great! I'm a little bit confused, don't people use some conserved genes that have a relative steady expression level as references to normalize their data?
@brettvanderwerff6917 Před 6 lety ⁺¹
This is amazing thanks
@statquest Před 6 lety
Hooray! :)
@NumptyBrainStorm Před rokem ⁺¹
Learning R and differential analysis for ChIP-seq differential analysis (DiffBind), THANKS!!!
@statquest Před rokem
bam! :)
@blankaroje8853 Před 4 lety ⁺²
Thank you!
@statquest Před 4 lety
:)
@victorhigareda4716 Před 3 lety ⁺¹
The reference sample could be one of the treatments or one of the controls in one RNA-seq experiment , is it correct?. Thank you for your great explanation
@statquest Před 3 lety
Yes.
@ElNick09 Před 3 lety ⁺²
This is an explanation of the process executed in TMM normalization, as made clear at 10:37. I'm just saying this in case anyone has come to this video, as i have, looking for an explanation of TMM normalization.
@statquest Před 3 lety ⁺¹
Yep.
@Reza_Ghamsari Před 4 lety ⁺¹
This is great, thank you. I don't understand how did you calculate the weighted average? Is that just the average of log-ratios? "12:28"
@statquest Před 4 lety ⁺¹
I'll be honest, I made this video a while ago and haven't thought about it much since, so I can't give you any more details about how edgeR works.
@simonhuang4807 Před 2 lety
the weights are calculated by the inverse of the approximate asymptotic variances (calculated using the delta method）
@c.p.8689 Před 2 lety ⁺¹
Love you!!
@statquest Před 2 lety
Thanks!
@garyhokawai Před 7 lety ⁺²
Just wonder, comparing edgeR to DESeq2, which one makes more sense for single cell rna seq normalization?
@garyhokawai Před 7 lety ⁺¹
So if my data has a large number of zero-value genes, DESeq2 is preferable? BTW, usually I would use ERCC spike ins for the size factor calculation and apply it the endogenous ones.
@Adelphos0101 Před 4 lety
Is there any reason for edgeR to use the 75th quantile instead of the median to pick the reference sample?
Very nice video to understand edgeR.
@statquest Před 4 lety
I think the point is to just exclude outliers with excessive read counts.
@igumnov.daniel Před 2 lety
Ty
@ns43253 Před 3 lety
Do you have suggestions on whether someone should use edgeR or DeSEQ2 for 16S analysis of soil communities?
@statquest Před 3 lety ⁺¹
To be honest, they are about the same. However, I know Mike Love is still adding tons of new visualizations to DESeq2, so that might be my favorite.
@LayneSadler Před rokem
I'm trying to think of a reason why I shouldn't just compare the case-control distributions with: KS test pval (y axis cutoff 0.05) over difference in normalized means (x axis cutoff +/- 50 TPM). We want to know if they come from the same distribution and don't want to tiny TPM changes.
@statquest Před rokem ⁺¹
Unfortunately it's been way too long since I made this video or did any kind of bioinformatics work to give you a reasonable answer. However, my rough memory is that these methods (edgeR and DESeq2) gain power by pooling genes to estimate variation, and then gain more power by using a parametric test based on the negative binomial distribution. I think if you just went with a straight KS test, you wouldn't have any power.
@binnylinny Před 2 lety
edgeR just seems far more complicated than DESeq2. Is there any advantage edgeR has over DESeq2, apart from the artistic signature you mentioned towards the end? :P
@statquest Před 2 lety ⁺¹
Not that I know of. I used to use edgeR, but switched to DESeq2 with no regrets.
@jesusmateoamillanocisneros6192 Před 3 lety
Hello! Is possible make a association between environment variable and bacteria abundance? Sorry for my english!
@statquest Před 3 lety
I have no idea. Maybe someone else can help.
@dany271197 Před 5 lety ⁺¹
So you mean that EdgeR need o weighted trimmed mean normalization, but DEseq2 do not?
@statquest Před 5 lety
DESeq2 has it's own normalization that is similar, but a little different. Here's the link to my StatQuest that describes the method: czcams.com/video/UFB993xufUU/video.html
@suryakantastat0275 Před rokem
How to calculate the weights to calculate the weighted log2 ratios in this library
@statquest Před rokem
What time point in the video, minutes and seconds, are you asking about?
@suryakantastat0275 Před rokem
12:20 the weights that are assigned how they are calculated
@statquest Před rokem
@@suryakantastat0275 I believe edgeR uses the number of reads per gene in each sample to calculate the weighted average of the log values. For example, if we had two genes: Gene A, with 100 reads and log2()= 0.05 and Gene B, with 50 reads nad log2() = 0.1, then the weighted average would be ((100*0.05) + (50*0.1))/(100 + 50) = 0.067. For more details on how to calculate a weighted average, see en.wikipedia.org/wiki/Weighted_arithmetic_mean
@kimseonhoon9704 Před 2 lety ⁺¹
12:31 I like it
@statquest Před 2 lety
:)
@LayneSadler Před rokem
it's bananas that the top/bottom 30% of fold changes are discarded. is the reason because they prone to being +/- inf? tricky that values less than 1 lead to exploding ratios
@statquest Před rokem
Can you tell me what time point you're asking about (minutes and seconds)?
@LayneSadler Před rokem ⁺¹
@@statquest 9:47 but it appears they aren't actually dropped from the analysis, just the calculation of the scaling factor, which makes sense
@statquest Před rokem ⁺¹
@@LayneSadler Yep, that's correct. We just want the housekeeping genes for the scaling factor.
@henricker Před 3 lety
I really laughed my ass off at 12:30, thanks for the video.
To my understanding, isn't it weird that it's possible to have a reference sample for a gene where there are 0 reads on that gene? Wouldn't it be possible to take a reference sample for each gene to avoid this issue? I don't see how this makes sense logically, but I might have missed something. Thank you!
@statquest Před 3 lety
What time point, minutes and seconds, are you asking about?
@someone_there Před 2 lety
Well, fine but how to use EdgeR ?
@statquest Před 2 lety ⁺²
To be honest, I found the manual for edgeR relatively easy to follow. It has a lot of examples.
@someone_there Před 2 lety ⁺²
@@statquest Actually, I couldn't find any good workflow tutoriel for EdgeR on youtube, with like coding explanations, etc. if you have time to publish a good video about that, it would be extremely helpful.
@statquest Před 2 lety ⁺²
@@someone_there I wish I could, but it's been years since I used edgeR. :(
@someone_there Před 2 lety ⁺²
@@statquest oh I see... well, thanks a lot for your answers anyway :)
@sunnetinternationalbusines9910 Před rokem
Edge R seems to make more sense than DESEQ2 to me.
@statquest Před rokem
Noted
@BiologyIsHot Před 17 dny
I've always felt that,EdgeR's approach seems more arbitrary.

Další v pořadí

Automatické přehrávání

StatQuest: edgeR and DESeq2, part 2 - Independent Filtering

StatQuest: edgeR and DESeq2, part 2 - Independent Filtering

StatQuest: DESeq2, part 1, Library Normalization

StatQuest: DESeq2, part 1, Library Normalization

Linear Regression, Clearly Explained!!!

Linear Regression, Clearly Explained!!!

They succeed because they are united!

They succeed because they are united!

Kuběnka vs Denny! Veselý je moc a ex od Vlčka vrací úder

Kuběnka vs Denny! Veselý je moc a ex od Vlčka vrací úder

Maniak feat. Mikýř - Úžasná pouť ostrovem (Official Video) prod. Kenny Rough

Maniak feat. Mikýř - Úžasná pouť ostrovem (Official Video) prod. Kenny Rough

Nejlepší Proměna Na Světě

Nejlepší Proměna Na Světě

Principal Component Analysis (PCA) - easy and practical explanation

Principal Component Analysis (PCA) - easy and practical explanation

Covariance, Clearly Explained!!!

Covariance, Clearly Explained!!!

Standardization vs Normalization Clearly Explained!

Standardization vs Normalization Clearly Explained!

UMAP Dimension Reduction, Main Ideas!!!

UMAP Dimension Reduction, Main Ideas!!!

Why Negative Binomial is used in DESeq2?

Why Negative Binomial is used in DESeq2?

Gene Set Enrichment Analysis (GSEA) - simply explained!

Gene Set Enrichment Analysis (GSEA) – simply explained!

R Workshop Series Part 1 - RNA-Seq: From Raw to Processed Data

R Workshop Series Part 1 - RNA-Seq: From Raw to Processed Data

Black Magic 🪄 by Petkit Pura Max #cat #cats

Black Magic 🪄 by Petkit Pura Max #cat #cats

Veřejné vážení • Vémola vs. Végh 2 | OKTAGON 58

Veřejné vážení • Vémola vs. Végh 2 | OKTAGON 58

You can now keep your hands clean, and your toilet cleaner...🚽 #toilet #cooltech #future

You can now keep your hands clean, and your toilet cleaner...🚽 #toilet #cooltech #future

Would you like a delicious big mooncake? #shorts#Mooncake #China #Chinesefood

Would you like a delicious big mooncake? #shorts#Mooncake #China #Chinesefood

Luck Decides My Future Again 🍀 #katebrush #luck #shorts

Luck Decides My Future Again 🍀 #katebrush #luck #shorts

TRIK JAK SE BĚHEM PÁR VTEŘIN STÁT MNOHEM VÍC OHEBNĚJŠÍM - CO JSEM ZJISTIL PO 30?

TRIK JAK SE BĚHEM PÁR VTEŘIN STÁT MNOHEM VÍC OHEBNĚJŠÍM - CO JSEM ZJISTIL PO 30?

LOOK AT IT . RESPECT? #shorts

LOOK AT IT . RESPECT? #shorts

Zahraj si se mnou 2! #shorts

Zahraj si se mnou 2! #shorts