WGS Variant Calling: Variant Filtering and Annotation - Part 2 | Detailed NGS Analysis Workflow

Sdílet
Vložit
  • čas přidán 29. 06. 2024
  • This is the detailed workflow tutorial on how to filter variants and annotate them using GATK's Funcotator tool. In this video I discuss the need of filtering and annotating variants, the questions we may ask once we call variants, two approaches to filter variants i.e. Variant Quality Score Recalibration (VQSR) & hard filtering, the data sources available, genotype refinement and finally talking about Funcotator. Further, I demonstrate how to filter variants at the site level as well apply genotype filters and annotating variants using Funcotator. Lastly, I demonstrate how to wrangle the output from Funcotator and get the annotations in a tabular format which is helpful for further processing and visualization. I hope you find this video helpful! Leave your thoughts in the comment section below!
    ▸ Code:
    github.com/kpatel427/CZcamsT...
    ▸ Part 1 of this video:
    • WGS Variant Calling: V...
    ▸ VCF file format video:
    • Understanding File For...
    ▸ GATK hard-filters recommendations for germline variants:
    gatk.broadinstitute.org/hc/en...
    Chapters:
    0:00 Intro
    0:43 Overview of variant calling steps
    1:34 Questions we may want to ask after calling variants
    3:00 Variant data sources
    5:16 Two approaches to filter variants
    8:38 Hard filtering: Site-level filtering
    9:14 Hard filtering: Sample-level filtering
    10:15 Genotype Refinement
    10:54 Data used for today’s demo
    11:22 Pre-requisites
    12:22 Filtering SNPs
    16:54 Filtering INDELS
    17:35 Understanding output after filtering (FILTER column)
    19:19 Select variants that passed filters
    20:39 Exclude variants that failed genotype filters
    25:49 GATK Funcotator tool
    28:10 Funcotator data sources
    29:59 Annotate variants using Functotator
    32:07 Understanding output after annotation (Funcotator output)
    33:45 Getting annotations into a tabular format
    You can show your support and encouragement by buying me a coffee:
    www.buymeacoffee.com/bioinfor...
    To get in touch:
    Website: bioinformagician.org/
    Github: github.com/kpatel427
    Email: khushbu_p@hotmail.com
    #bioinformagician #bioinformatics #funcotator #variantcalling #variants #gatk #vcf #gvcf #haplotype #alleles #geneticvariants #mutations #gff3 #gff #gtf #sam #bam #phred #fasta #fastq #singlecell #10X #ensembl #biomart #annotationdbi #annotables #affymetrix #microarray #affy #ncbi #genomics #beginners #tutorial #howto #omics #research #biology #GEO #rnaseq #ngs

Komentáře • 34

  • @julinha_87
    @julinha_87 Před rokem +8

    I usually never comment on anything, really, but I would be remiss if I didn't mention what a great job you are doing. There are not a lot of videos, or at least none I could really find, that make video tutorials on such specific topics like this. Really good job :) Thank you!

  • @belizg1
    @belizg1 Před rokem +1

    Thank you very much for Part 2 and especially for focusing on hard filtering as we requested you to do the video

  • @415chung
    @415chung Před rokem +2

    Fantastic! Your presentation is clear and detail, helps me a lot in understanding the hard filtering process😊

  • @hayatdeen
    @hayatdeen Před rokem

    i wish i found you sooner, i had to painstakingly browse though tones of GATK web pages to get me started with the Best practice guidelines. you made it clearer

  • @user-dk4cx9mw5g
    @user-dk4cx9mw5g Před rokem

    Awesome tutorial! Thank you so much!

  • @venkatakrishnakandalai2571

    Thank you for this very informational video!

  • @mentarikasih8677
    @mentarikasih8677 Před rokem

    your tutorial really help me a lot to understand the workflow. If you have experience analyze gwas and also want to make the tutorial, I would appreciate it. Thank you so much ma'am ☺️

  • @harshasatuluri4540
    @harshasatuluri4540 Před rokem +1

    Thank you ma'am for sharing ur knowledge.... Happy to be ur student form online😊

  • @pauladisanto919
    @pauladisanto919 Před 6 měsíci

    You are great! Thanks for this amazing tutorial!

  • @wagawsendeku7750
    @wagawsendeku7750 Před 7 měsíci

    Thanks for the excellent tutorial👍

  • @rezomgeladze5750
    @rezomgeladze5750 Před rokem

    Great explanation!

  • @ramachandran8106
    @ramachandran8106 Před rokem +1

    you are great teacher.. thankyou.

  • @farhinnikhat4493
    @farhinnikhat4493 Před rokem

    thanks alot for this amazing video.

  • @madhavanjn
    @madhavanjn Před rokem

    Thanks for ur amazing explanation. Plz keep posting more videos, can u make a video on how to use restapi, github, json, aws etc.

  • @hadieidgah4582
    @hadieidgah4582 Před rokem

    Fantastic work, thank you and it is always worth to watch your videos till the last second. Can one use this pipeline for WES as well?

  • @OmPrakash-ob5cz
    @OmPrakash-ob5cz Před 29 dny

    Any script explanation for diplotype calling?, In the same way as you explained variant calling in a very clear way.

  • @Testtesztest12123
    @Testtesztest12123 Před 7 měsíci

    Good job

  • @ayushsafar6289
    @ayushsafar6289 Před rokem

    thank you very very very much

  • @osmandoluca8347
    @osmandoluca8347 Před 9 měsíci

    @22:00 Instead of doing all that, you can add '-select-genotype "GQ > 10 && DP > 10" \' to the SelectVariants commandline.

  • @bioramg
    @bioramg Před 2 měsíci

    Thank you for sharing your knowledge with me. I understood everything well and was able to perform my analysis perfectly. I have a question regarding the view results. The command you provided was used only for SNPs. Should I use the same command for indels as well? The command (VIEW RESULTS STEP) is: "cat ${results}/analysis-ready-indels-filteredGT-functotated.vcf | grep " Funcotation fields are: " | sed 's/|/\t/g' > ${results}/output_curated_variants_indels.txt" and "cat ${results}/output_indels.table | cut -f 5 | sed 's/|/\t/g' >> ${results}/output_curated_variants_indels.txt". Please let me know. Thank you once again for your help.

  • @sgrtools
    @sgrtools Před rokem +2

    This is a great tutorial !.
    Can you please make a video on "Call somatic mutations using GATK4 Mutect2"?? and a video on "maftools" - R package to analyze somatic .vcf files??"
    Highly appreciated !

    • @Bioinformagician
      @Bioinformagician  Před rokem +1

      I will surely consider covering these topics. Thank you for the suggestion:)

  • @alialjizani
    @alialjizani Před 8 měsíci

    Thank you that was helpful, but what if I have the WES data in form of Excel sheet how can I filter my variants? and is there a tool or a program can do that?

  • @user-ci4xh3uv1i
    @user-ci4xh3uv1i Před rokem +1

    Thanks very much for the video. Could you continue to explain Call somatic mutations with Mutect2?

    • @Bioinformagician
      @Bioinformagician  Před rokem

      I will surely consider making a video on mutect2

    • @gana1263
      @gana1263 Před 10 měsíci

      ​@@Bioinformagicianyes ... requesting a tutorial for somatic variant calling workflow....thank you

    • @rajeshsingh-xv7wy
      @rajeshsingh-xv7wy Před 8 měsíci

      @Bioinformagician please.....As soon as possible.

  • @sinugeorge9485
    @sinugeorge9485 Před rokem +1

    Could you please make a video for GTAK-mutect2 variant calling?

  • @nikitamaurya4518
    @nikitamaurya4518 Před měsícem

    Does this include somatic and germline variant calling workflow?

  • @user-zl4rp4cj2o
    @user-zl4rp4cj2o Před měsícem

    how to set those filters ? what do all do we have to consider ?

  • @sanjaisrao484
    @sanjaisrao484 Před rokem

    Mam, please upload the visualization part

  • @c.p.8689
    @c.p.8689 Před rokem

    Why don't you just merge indels and snps in one main file and annotate this one main file right away?

  • @vardansaroyan7634
    @vardansaroyan7634 Před rokem

    You are amazing. Thanks!!!