NCBI Blast, Sequence Analysis & Result Interpretation: Lecture 2 part 2 by Dr. Muhammad Naveed

Sdílet
Vložit
  • čas přidán 22. 08. 2024
  • NCBI Blast, Sequence Analysis, Result Interpretation & Result Submission.
    Background
    BLAST, which The New York Times called the Google of biological research,[2] is one of the most widely used bioinformatics programs for sequence searching.[3] It addresses a fundamental problem in bioinformatics research. The heuristic algorithm it uses is much faster than other approaches, such as calculating an optimal alignment. This emphasis on speed is vital to making the algorithm practical on the huge genome databases currently available, although subsequent algorithms can be even faster.
    Input
    Input sequences (in FASTA or Genbank format) and weight matrix.
    Output
    BLAST output can be delivered in a variety of formats. These formats include HTML, plain text, and XML formatting. For NCBI's web-page, the default format for output is HTML. When performing a BLAST on NCBI, the results are given in a graphical format showing the hits found, a table showing sequence identifiers for the hits with scoring related data, as well as alignments for the sequence of interest and the hits received with corresponding BLAST scores for these. The easiest to read and most informative of these is probably the table.
    If one is attempting to search for a proprietary sequence or simply one that is unavailable in databases available to the general public through sources such as NCBI, there is a BLAST program available for download to any computer, at no cost. This can be found at BLAST+ executables. There are also commercial programs available for purchase. Databases can be found from the NCBI site, as well as from Index of BLAST databases (FTP).
    Process
    Using a heuristic method, BLAST finds similar sequences, by locating short matches between the two sequences. This process of finding similar sequences is called seeding. It is after this first match that BLAST begins to make local alignments. While attempting to find similarity in sequences, sets of common letters, known as words, are very important. For example, suppose that the sequence contains the following stretch of letters, GLKFA. If a BLAST was being conducted under normal conditions, the word size would be 3 letters. In this case, using the given stretch of letters, the searched words would be GLK, LKF, KFA. The heuristic algorithm of BLAST locates all common three-letter words between the sequence of interest and the hit sequence or sequences from the database. This result will then be used to build an alignment. After making words for the sequence of interest, the rest of the words are also assembled. These words must satisfy a requirement of having a score of at least the threshold T, when compared by using a scoring matrix.
    Algorithm
    To run the software, BLAST requires a query sequence to search for, and a sequence to search against (also called the target sequence) or a sequence database containing multiple such sequences. BLAST will find sub-sequences in the database which are similar to sub sequences in the query. In typical usage, the query sequence is much smaller than the database, e.g., the query may be one thousand nucleotides while the database is several billion nucleotides.
    #NCBIBlast #BLAST #QueryCoverage

Komentáře • 147

  • @mariachaudhary4854
    @mariachaudhary4854 Před 4 lety +2

    Lagta tha ab yeh lecture dobra sun' ne ko nahi mile gaa but sir you did it😇😇

  • @bioworld7565
    @bioworld7565 Před 3 měsíci

    you are a talent of Pakistan

  • @muhammadyaqoob8843
    @muhammadyaqoob8843 Před 8 měsíci

    Great sir
    Really you made much difficult task easy for students
    Appreciate your efforts

  • @diverseujjwal2619
    @diverseujjwal2619 Před 3 lety +3

    Really informative and great lectures. Thanks a lot sir.

  • @nimramushtaq4508
    @nimramushtaq4508 Před 4 lety

    It's like a blessing in disguise Sir.I'm a student of BS(Hons.) Zoology Department, actually i was searching BLAST for my presentation ..i did not found it in a precise way..finally a day before my presentation ,i found your lecture. i hope it would help me tomorrow ..Thank you very much..you did a fantastic job..keep it up Sir...

  • @jigyasatiwari
    @jigyasatiwari Před rokem +1

    Great sir, This video is so helpful for me....can you make the video on " interpretation of the clustal omega results (alignments, result summary, phylogenetic tree [ real ]. I need of this video till Sunday 🥺...please can you make sir. I will be very thankful to you 🙏

  • @professortraveler
    @professortraveler Před 2 lety

    Your lectures are awesome Sir, Your lectures are very easy to understand. Keep up the good work.

  • @asifakram7060
    @asifakram7060 Před 5 měsíci

    excellent

  • @atif8183
    @atif8183 Před 3 lety

    Thank you so much sir! Love from Gomal university D.i Khan.
    aap nay both easy tarekay say samjha diya mashallah ❤️

  • @jazzypakistan140
    @jazzypakistan140 Před rokem

    I used to watch your videos very often .. and maine aaj ghor kia k aap ko mai jaanti hu mili b hu.🤣🤣🤣

  • @MujahidAli-dz6ln
    @MujahidAli-dz6ln Před 3 lety +1

    Really Good Work! Please keep it up.

  • @swatikanani323
    @swatikanani323 Před 3 lety +4

    Whole genome sequencing results interpretation

  • @mamtapanda6567
    @mamtapanda6567 Před 3 lety +1

    Which one is more significant when comparing sequences for its function prediction....E value or identity % and why
    How much E value can be considered significant?

  • @A4arooj
    @A4arooj Před 9 měsíci

    There is one question, that keep on pinching me; can't query coverage be defined as a factor which describes the homology or evolutionary relationship between two sequences. as you mentioned in the class that it always must be 100.... which according to my understanding actually tells weather or not they are homologous to each other.

  • @babarbch
    @babarbch Před 4 lety +1

    Informative lecture
    Excelent

  • @zeekhan4040
    @zeekhan4040 Před rokem

    I love your videos. Excellent job.

  • @uzmaramzan4654
    @uzmaramzan4654 Před 3 lety

    Great sir. Your lectures are very helpful.

  • @aqsakashigar2282
    @aqsakashigar2282 Před 2 lety +1

    Thanks sir ❤️

  • @bukhtaweranwer1352
    @bukhtaweranwer1352 Před 4 lety +1

    Brilliant

  • @sanianisar6184
    @sanianisar6184 Před 3 měsíci

    Assalamu Alikum Sir, I am a phd Student and now doing work on transcriptome analysis in garlic.how i will interpret my result using bioinformatices.kindly let me know step wise and dataset which will be good for this species.? Regards

  • @badirajpoot3740
    @badirajpoot3740 Před 2 lety +1

    assalam o alaikum dear sir,
    sir ye Blast ma description ki thori c guidelines dobara dijiya ga kch smaj ni ah rhi.
    sir for example hamara sequence 1584 bp ka ha lakin uski first description ma 2918 show ho rha ha max score or total score , agay query cov and persent identity ka boht difference ha ...

  • @iqrasaeed642
    @iqrasaeed642 Před 3 lety +1

    sir what if I get the negative E-value? should I have to select only 0.0 e-values for comparison?

  • @experimentaleducation-by1cy

    in case of new strains similarity would be less than 85%? and can we claim based on these result that' it's new starins ..( I am referring fungal starin here ) or we need to go further gene analysis ,please guide about it if you can ?thanks

  • @ridwanunuhuahmad5945
    @ridwanunuhuahmad5945 Před rokem

    Hello Sir, I need more clarification on how to run blast and phylogenetic analysis. I have my sequence result from 4 18S rRNA isolate. I run blastn and all the 4 isolate have 95+ percent identity with previously published sequence/ species in the NCBI which is C.parvum. other published species showed lower than 95+ percent identity and when I run phylogenetic tree some of the isolate attached to Cryptosporidium tortoise instead of C. parvum that showed higher percentage identity. How do I interprete phylogenetic tree ? What do I do to identity species from my sequence result and do I deposit my sequence result to NCBI to obtain my sequence assertion number ?
    Thank you

  • @Mr.Abd0001
    @Mr.Abd0001 Před rokem

    Aslamulikum sir. Sir apka bioinformatics wala playlist sequence main nahi. Samjh nai ati kon si video sy start laen. Kindly guide.

  • @havvaunal3957
    @havvaunal3957 Před 11 měsíci

    hi, when I run the global align, I get the different identities percentage for the same sequences each time. Why, can you explain this? thank you

  • @sultanajannat15
    @sultanajannat15 Před rokem

    How to find out the function of an unknown gene using BLAST?

  • @jojithomas587
    @jojithomas587 Před 3 lety +1

    VERY INFORMATIVE VIDEO. Sir, i have the sequence result but when i use blast the similarity is different in term of organism name as well as the gene. is that possible? kindly advice. thank you.

  • @namrahqaisar9945
    @namrahqaisar9945 Před 3 lety +1

    how to know the difference using two different scoring schemes while using blast

  • @waqarhussain4406
    @waqarhussain4406 Před rokem

    Thanks you too much 💝

  • @faheemkhan-vj9qg
    @faheemkhan-vj9qg Před 4 lety +1

    good lecture. sir explain ho wo draw phylogenetic tree?

  • @adilqadir3407
    @adilqadir3407 Před rokem

    Sir do you have any reference which we follow and we can write our blast like that...??

  • @yasirnawaz3910
    @yasirnawaz3910 Před 4 lety +1

    Sir kuch topic send kru wo lecture prepare kroge

  • @imtiazahmad9238
    @imtiazahmad9238 Před 4 lety

    use of query and subject ?
    how to select range or how it varies?

  • @sahajlaboratory7342
    @sahajlaboratory7342 Před 4 lety +1

    NICE LECTURE ,i, m a research student , few quereies like i hv result of my query seq from ncbi blast which shows 86.88% identity ,so is it ok.
    further i hv wgs results of a bacteria,in which i wanna go for pathogeny island prediction for which i was using PAIDB v2.0.cn u pls help me how to use tht particular software??

  • @sumankarmakar9352
    @sumankarmakar9352 Před 3 lety

    How to determine that first organism will be selected for further analysis if there were first 5 organism have identity at 100%....and max score,total score all are same....sir please elaborate this....

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 3 lety

      then means your sequence is copied from these as not possible 5 sequences have 100% similarity

  • @hussain223
    @hussain223 Před 4 lety +5

    Could you speak English, please because I am a foreign researcher

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 4 lety +3

      Dear will share in Eglish version after few months as first to complete in native language ok

  • @imadkhan2029
    @imadkhan2029 Před 2 lety

    Sir can you upload lecture on Illumina Sequencing and Amplicon sequencing .

  • @hirondevnath2513
    @hirondevnath2513 Před 4 lety +1

    Sir I see a option in BLAST where it shows it can compare two sequences and we get results. From there we can have the variant point but if i want to compare more than 10 sequences to find out the variation point then how it can be done.
    You know, from blast we can detect the variantion point of two sequences but if i just count and then place it in my paper it’s possible a mistake may happen. As i want to work with 50 sequences, it’s diffcult to run blast everytime and count them. please let me know if any Software or easy solution is available.
    Take love from Bangladesh. Thanks in advance.

  • @shanzaaslam4826
    @shanzaaslam4826 Před 2 měsíci

    Kindly gap penalty bta day r gaps k bary m bta day

  • @farooqhaider7365
    @farooqhaider7365 Před rokem

    Dear sir, I hope u will be fine.
    Sir which test is further proceed for novel genus of organism. Plz explain

  • @rizwanhafeez9497
    @rizwanhafeez9497 Před rokem

    Sir my question is
    gpaphics mein similarity kis kay sath show ho rahee hai...Becoz sequence tou aik hi species ka hai yahan pay. Thanks

  • @zainabwaheed3606
    @zainabwaheed3606 Před 2 lety +1

    I want to ask one question that how we know that the query sequence is novel?

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 2 lety

      if percentage identity and query coverage are less than 85% then do sequencing again for novelty

    • @zainabwaheed3606
      @zainabwaheed3606 Před rokem

      Thank you sir
      Please guide that how to do sequencing again

    • @zainabwaheed3606
      @zainabwaheed3606 Před rokem

      If percentage identity is 84% and Query Cover is 94% then it is novel or not?

  • @mohdawas4285
    @mohdawas4285 Před 3 lety +1

    Aslamualikum Sir
    thanks for these fruitful lectures, but i m facing a problem in using the NCBI blast where in my system only description showed in detail, while all other option showed nothing like graphic and so on. If possible plz reply i m thankful to you sir forever.
    Regard Mohd Awas (research Scholar)
    University of Kashmir india

  • @uneebashahid5440
    @uneebashahid5440 Před 4 lety +1

    sir I have all the results are almost 100 % , the sequence i used was from GenBank. where I did mistake ? any idea? nucleotide was Saccharomyces cerevicae S288C (PPQ1).
    And your lecture is very informative,Thanks!!

  • @user-lg4yo8me6j
    @user-lg4yo8me6j Před 3 lety +1

    assalam o alikum- sir mujhy ye samjh nhi aya k" stain novel" hony sy kya murad h?

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 3 lety

      means every day new strain are discovered in word means new bacteria so NCBI is first method to confirm this novelty on the basis of similarity ok

    • @user-lg4yo8me6j
      @user-lg4yo8me6j Před 3 lety

      Ok thank you so much

  • @muhammadshahab1421
    @muhammadshahab1421 Před 3 lety +1

    Sir can we convert the sequence into 4 Letter idea of protein

  • @iqralone9785
    @iqralone9785 Před 2 lety

    sir after getting fasta sequencing of my sample i got 93% query coverage and 97% identity and the species was polaromonas glacialis...is it reliable

  • @goodboy-gt3te
    @goodboy-gt3te Před rokem

    Blast me tools konsy use ho rahy

  • @factology2642
    @factology2642 Před 3 lety

    Assalamo alikum sir...
    how we made algorithm table by blasting....?
    please help me out

  • @gopalkalwan7465
    @gopalkalwan7465 Před 2 lety

    Dear sir i am doing genome wide analysis of gene family and I got 58 hits. What will be the criteria of filteration of these hits for validation purpose or i have to select all

  • @rabiahabib883
    @rabiahabib883 Před rokem

    How we trimm sequence

  • @sanasheikh5033
    @sanasheikh5033 Před 3 lety +1

    How to check bit score ?

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 3 lety

      The bit score, S', is derived from the raw alignment score, S, taking the statistical properties of the scoring system into account. Because bit scores are normalized with respect to the scoring system, they can be used to compare alignment scores from different searches.

  • @muhammadayazfaridshah3694

    sir, kindly ye btain k cancer genes r healthy person k genes kasy download kar skty ham ??? NCBI sy cancer k to mil jaty but healthy person k ni mil rahy mujy.

  • @aaaa5nfgfghf
    @aaaa5nfgfghf Před 2 lety

    sir, plz make a video on how use MODELLER software.

  • @ashirali195
    @ashirali195 Před 3 lety

    sir what is the meaning of rid parameters must be applied when i download the all sequnces from blast?

  • @ambreeniqbal6396
    @ambreeniqbal6396 Před 3 lety +1

    Sir kindly Chau fasman Py aik lecture plz

  • @payalprasad5614
    @payalprasad5614 Před 4 lety +1

    sir what is the the meaning of e value?could you please elaborate

  • @shahnazsalamat1271
    @shahnazsalamat1271 Před 4 lety +1

    If i can not find any similarity about my protein than what does that mean?

  • @gloryparmar1904
    @gloryparmar1904 Před 4 lety +1

    sir, how we can submit SNP to dbSNP or clinvar? can you plz help?

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 4 lety

      The method is same as you submit your sequence on NCBI GenBank. You need to sign in first and then there is a submission option. You will submit your sequence there and and a specific accession number will be allotted to you.

  • @nabighafaheem8502
    @nabighafaheem8502 Před 2 lety

    Assalamualaikum Respected Sir
    Please guide us about internship program for bs bioinformatics students

  • @oldisgold-nc1zh
    @oldisgold-nc1zh Před 3 lety +1

    Sir uniquely identified species kese select kare?

  • @shabinakhan6930
    @shabinakhan6930 Před 2 lety

    Hello sir how to find similarities in genome of two viruses

  • @poonamkumari-nn8yg
    @poonamkumari-nn8yg Před 2 lety

    Good evening Sir...I have made mistake in gene bank submission.I have to write in the data isolate instead of voucher specimen.Can I edit my gene bank records ...please help Sir..

  • @user-bz9xn3rm1f
    @user-bz9xn3rm1f Před měsícem

    Sir , Articles,
    Research paper
    Aur publications ma kia frq hy

  • @poonamkumari-nn8yg
    @poonamkumari-nn8yg Před 2 lety

    Good evening Sir....I have done sequencing of Mitochondrial COX1 gene of insect specimens. How can I submit these sequences to NCBI.....

  • @sarimaziz3374
    @sarimaziz3374 Před 2 lety

    KINDLY make video on CARD. DATABASE

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 2 lety +1

      noted

    • @sarimaziz3374
      @sarimaziz3374 Před 2 lety

      @@Prof.Dr.MuhammadNaveed sir try to make full sessions on it. I am currently working on it as student but i think the way you explain stuff would be better . Jazakallaha

  • @healthlinepakistan3157

    sir how to check SNP rs numbers of stroke.plz guide

  • @nisharaj8967
    @nisharaj8967 Před 2 lety

    Hello sir, Hope you are doing good, I would like to learn Primer design and sequence analysis.

  • @VivoVivo-ne3qt
    @VivoVivo-ne3qt Před 3 lety

    AoA! Sir how to find homozygous and heterozygous through sequence

  • @hamzarehman9913
    @hamzarehman9913 Před rokem

    Informative lecture. Can I get your contact Dr Sb. Please reply me

  • @AartiYadav-ns8db
    @AartiYadav-ns8db Před 3 lety

    Sir unknown sequence ko known sequence se kaise match karna h

  • @nehanadeem8988
    @nehanadeem8988 Před 2 lety

    Sir what E value depicts in blast output?

  • @iqraafzal4736
    @iqraafzal4736 Před 2 lety

    Awsome👍

  • @dr.muhammadimran5501
    @dr.muhammadimran5501 Před 3 lety

    Slow down your speed

  • @sabgaminggamer2256
    @sabgaminggamer2256 Před 4 lety

    can i do ms computer science after bs in bioinformatics.

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 4 lety

      Yes dear possible

    • @sabgaminggamer2256
      @sabgaminggamer2256 Před 4 lety

      @@Prof.Dr.MuhammadNaveed thanks sirrr.... in which university do you are studing!!

    • @Prof.Dr.MuhammadNaveed
      @Prof.Dr.MuhammadNaveed  Před 4 lety +1

      @@sabgaminggamer2256 Teaching in University of Central Punjab, Lahore

    • @sabgaminggamer2256
      @sabgaminggamer2256 Před 4 lety

      @@Prof.Dr.MuhammadNaveed great.... well I'm a student of bs in bioinformatics first semester...but i love programming.... like python... artificial intelligence . machine learning... etc..
      so... i wanna confirm that from you... ms in computer science...

    • @sabgaminggamer2256
      @sabgaminggamer2256 Před 4 lety

      plz make. a tour to our university... which is gcu faisalabad. thanks

  • @albertescamilla
    @albertescamilla Před 3 lety

    I have no idea what you're saying.

  • @divyanikhode8693
    @divyanikhode8693 Před 3 lety +1

    so fast talking nothing understand