NCBI Blast, Sequence Analysis & Result Interpretation: Lecture 2 part 2 by Dr. Muhammad Naveed

Dr. Muhammad Naveed

zhlédnutí 42 755

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 22. 08. 2024
NCBI Blast, Sequence Analysis, Result Interpretation & Result Submission.
Background
BLAST, which The New York Times called the Google of biological research,[2] is one of the most widely used bioinformatics programs for sequence searching.[3] It addresses a fundamental problem in bioinformatics research. The heuristic algorithm it uses is much faster than other approaches, such as calculating an optimal alignment. This emphasis on speed is vital to making the algorithm practical on the huge genome databases currently available, although subsequent algorithms can be even faster.
Input
Input sequences (in FASTA or Genbank format) and weight matrix.
Output
BLAST output can be delivered in a variety of formats. These formats include HTML, plain text, and XML formatting. For NCBI's web-page, the default format for output is HTML. When performing a BLAST on NCBI, the results are given in a graphical format showing the hits found, a table showing sequence identifiers for the hits with scoring related data, as well as alignments for the sequence of interest and the hits received with corresponding BLAST scores for these. The easiest to read and most informative of these is probably the table.
If one is attempting to search for a proprietary sequence or simply one that is unavailable in databases available to the general public through sources such as NCBI, there is a BLAST program available for download to any computer, at no cost. This can be found at BLAST+ executables. There are also commercial programs available for purchase. Databases can be found from the NCBI site, as well as from Index of BLAST databases (FTP).
Process
Using a heuristic method, BLAST finds similar sequences, by locating short matches between the two sequences. This process of finding similar sequences is called seeding. It is after this first match that BLAST begins to make local alignments. While attempting to find similarity in sequences, sets of common letters, known as words, are very important. For example, suppose that the sequence contains the following stretch of letters, GLKFA. If a BLAST was being conducted under normal conditions, the word size would be 3 letters. In this case, using the given stretch of letters, the searched words would be GLK, LKF, KFA. The heuristic algorithm of BLAST locates all common three-letter words between the sequence of interest and the hit sequence or sequences from the database. This result will then be used to build an alignment. After making words for the sequence of interest, the rest of the words are also assembled. These words must satisfy a requirement of having a score of at least the threshold T, when compared by using a scoring matrix.
Algorithm
To run the software, BLAST requires a query sequence to search for, and a sequence to search against (also called the target sequence) or a sequence database containing multiple such sequences. BLAST will find sub-sequences in the database which are similar to sub sequences in the query. In typical usage, the query sequence is much smaller than the database, e.g., the query may be one thousand nucleotides while the database is several billion nucleotides.
#NCBIBlast #BLAST #QueryCoverage

Komentáře • 147

@mariachaudhary4854 Před 4 lety ⁺²
Lagta tha ab yeh lecture dobra sun' ne ko nahi mile gaa but sir you did it😇😇
@Prof.Dr.MuhammadNaveed Před 3 lety
pleasures
@bioworld7565 Před 3 měsíci
you are a talent of Pakistan
@muhammadyaqoob8843 Před 8 měsíci
Great sir
Really you made much difficult task easy for students
Appreciate your efforts
@diverseujjwal2619 Před 3 lety ⁺³
Really informative and great lectures. Thanks a lot sir.
@Prof.Dr.MuhammadNaveed Před 3 lety
You are most welcome
@nimramushtaq4508 Před 4 lety
It's like a blessing in disguise Sir.I'm a student of BS(Hons.) Zoology Department, actually i was searching BLAST for my presentation ..i did not found it in a precise way..finally a day before my presentation ,i found your lecture. i hope it would help me tomorrow ..Thank you very much..you did a fantastic job..keep it up Sir...
@Prof.Dr.MuhammadNaveed Před 4 lety
czcams.com/video/hSCu1fx_4aA/video.html
@jigyasatiwari Před rokem ⁺¹
Great sir, This video is so helpful for me....can you make the video on " interpretation of the clustal omega results (alignments, result summary, phylogenetic tree [ real ]. I need of this video till Sunday 🥺...please can you make sir. I will be very thankful to you 🙏
@professortraveler Před 2 lety
Your lectures are awesome Sir, Your lectures are very easy to understand. Keep up the good work.
@Prof.Dr.MuhammadNaveed Před 2 lety
Thank you, I will
@asifakram7060 Před 5 měsíci
excellent
@atif8183 Před 3 lety
Thank you so much sir! Love from Gomal university D.i Khan.
aap nay both easy tarekay say samjha diya mashallah ❤️
@Prof.Dr.MuhammadNaveed Před 3 lety
pleasures dear
@jazzypakistan140 Před rokem
I used to watch your videos very often .. and maine aaj ghor kia k aap ko mai jaanti hu mili b hu.🤣🤣🤣
@MujahidAli-dz6ln Před 3 lety ⁺¹
Really Good Work! Please keep it up.
@Prof.Dr.MuhammadNaveed Před 3 lety
welcome and sure dear
@swatikanani323 Před 3 lety ⁺⁴
Whole genome sequencing results interpretation
@Prof.Dr.MuhammadNaveed Před 3 lety
sure very soon
@mamtapanda6567 Před 3 lety ⁺¹
Which one is more significant when comparing sequences for its function prediction....E value or identity % and why
How much E value can be considered significant?
@Prof.Dr.MuhammadNaveed Před 3 lety ⁺¹
both and E value should be 0 or less than 0
@A4arooj Před 9 měsíci
There is one question, that keep on pinching me; can't query coverage be defined as a factor which describes the homology or evolutionary relationship between two sequences. as you mentioned in the class that it always must be 100.... which according to my understanding actually tells weather or not they are homologous to each other.
@babarbch Před 4 lety ⁺¹
Informative lecture
Excelent
@Prof.Dr.MuhammadNaveed Před 3 lety ⁺¹
welcome dear
@kazmi_collection7065 Před 3 lety
@@Prof.Dr.MuhammadNaveed sir u r amazing
@zeekhan4040 Před rokem
I love your videos. Excellent job.
@uzmaramzan4654 Před 3 lety
Great sir. Your lectures are very helpful.
@Prof.Dr.MuhammadNaveed Před 3 lety
Glad to hear that
@aqsakashigar2282 Před 2 lety ⁺¹
Thanks sir ❤️
@bukhtaweranwer1352 Před 4 lety ⁺¹
Brilliant
@sanianisar6184 Před 3 měsíci
Assalamu Alikum Sir, I am a phd Student and now doing work on transcriptome analysis in garlic.how i will interpret my result using bioinformatices.kindly let me know step wise and dataset which will be good for this species.? Regards
@badirajpoot3740 Před 2 lety ⁺¹
assalam o alaikum dear sir,
sir ye Blast ma description ki thori c guidelines dobara dijiya ga kch smaj ni ah rhi.
sir for example hamara sequence 1584 bp ka ha lakin uski first description ma 2918 show ho rha ha max score or total score , agay query cov and persent identity ka boht difference ha ...
@Prof.Dr.MuhammadNaveed Před 2 lety
waslam just follow identities and coverage not max score
@iqrasaeed642 Před 3 lety ⁺¹
sir what if I get the negative E-value? should I have to select only 0.0 e-values for comparison?
@Prof.Dr.MuhammadNaveed Před 3 lety
yes but query coverage and percentage identity as well
@experimentaleducation-by1cy Před rokem
in case of new strains similarity would be less than 85%? and can we claim based on these result that' it's new starins ..( I am referring fungal starin here ) or we need to go further gene analysis ,please guide about it if you can ?thanks
@ridwanunuhuahmad5945 Před rokem
Hello Sir, I need more clarification on how to run blast and phylogenetic analysis. I have my sequence result from 4 18S rRNA isolate. I run blastn and all the 4 isolate have 95+ percent identity with previously published sequence/ species in the NCBI which is C.parvum. other published species showed lower than 95+ percent identity and when I run phylogenetic tree some of the isolate attached to Cryptosporidium tortoise instead of C. parvum that showed higher percentage identity. How do I interprete phylogenetic tree ? What do I do to identity species from my sequence result and do I deposit my sequence result to NCBI to obtain my sequence assertion number ?
Thank you
@Mr.Abd0001 Před rokem
Aslamulikum sir. Sir apka bioinformatics wala playlist sequence main nahi. Samjh nai ati kon si video sy start laen. Kindly guide.
@havvaunal3957 Před 11 měsíci
hi, when I run the global align, I get the different identities percentage for the same sequences each time. Why, can you explain this? thank you
@sultanajannat15 Před rokem
How to find out the function of an unknown gene using BLAST?
@jojithomas587 Před 3 lety ⁺¹
VERY INFORMATIVE VIDEO. Sir, i have the sequence result but when i use blast the similarity is different in term of organism name as well as the gene. is that possible? kindly advice. thank you.
@Prof.Dr.MuhammadNaveed Před 3 lety ⁺¹
yes but for only for some conserved genes like 16S rRNA
@namrahqaisar9945 Před 3 lety ⁺¹
how to know the difference using two different scoring schemes while using blast
@Prof.Dr.MuhammadNaveed Před 3 lety
It depend upon %age identity based and which give better result use it
@waqarhussain4406 Před rokem
Thanks you too much 💝
@faheemkhan-vj9qg Před 4 lety ⁺¹
good lecture. sir explain ho wo draw phylogenetic tree?
@Prof.Dr.MuhammadNaveed Před 4 lety
My next lecture 14 is on phylogenetic tree by MEGA
@adilqadir3407 Před rokem
Sir do you have any reference which we follow and we can write our blast like that...??
@yasirnawaz3910 Před 4 lety ⁺¹
Sir kuch topic send kru wo lecture prepare kroge
@imtiazahmad9238 Před 4 lety
use of query and subject ?
how to select range or how it varies?
@sahajlaboratory7342 Před 4 lety ⁺¹
NICE LECTURE ,i, m a research student , few quereies like i hv result of my query seq from ncbi blast which shows 86.88% identity ,so is it ok.
further i hv wgs results of a bacteria,in which i wanna go for pathogeny island prediction for which i was using PAIDB v2.0.cn u pls help me how to use tht particular software??
@Prof.Dr.MuhammadNaveed Před 3 lety
will make video on it
@sumankarmakar9352 Před 3 lety
How to determine that first organism will be selected for further analysis if there were first 5 organism have identity at 100%....and max score,total score all are same....sir please elaborate this....
@Prof.Dr.MuhammadNaveed Před 3 lety
then means your sequence is copied from these as not possible 5 sequences have 100% similarity
@hussain223 Před 4 lety ⁺⁵
Could you speak English, please because I am a foreign researcher
@Prof.Dr.MuhammadNaveed Před 4 lety ⁺³
Dear will share in Eglish version after few months as first to complete in native language ok
@imadkhan2029 Před 2 lety
Sir can you upload lecture on Illumina Sequencing and Amplicon sequencing .
@hirondevnath2513 Před 4 lety ⁺¹
Sir I see a option in BLAST where it shows it can compare two sequences and we get results. From there we can have the variant point but if i want to compare more than 10 sequences to find out the variation point then how it can be done.
You know, from blast we can detect the variantion point of two sequences but if i just count and then place it in my paper it’s possible a mistake may happen. As i want to work with 50 sequences, it’s diffcult to run blast everytime and count them. please let me know if any Software or easy solution is available.
Take love from Bangladesh. Thanks in advance.
@Prof.Dr.MuhammadNaveed Před 4 lety
then in simple do run clustalW
@hirondevnath2513 Před 4 lety
@@Prof.Dr.MuhammadNaveed Sir it always shows command timeout.
@shanzaaslam4826 Před 2 měsíci
Kindly gap penalty bta day r gaps k bary m bta day
@farooqhaider7365 Před rokem
Dear sir, I hope u will be fine.
Sir which test is further proceed for novel genus of organism. Plz explain
@rizwanhafeez9497 Před rokem
Sir my question is
gpaphics mein similarity kis kay sath show ho rahee hai...Becoz sequence tou aik hi species ka hai yahan pay. Thanks
@Prof.Dr.MuhammadNaveed Před rokem
with all relevant species but different strains
@zainabwaheed3606 Před 2 lety ⁺¹
I want to ask one question that how we know that the query sequence is novel?
@Prof.Dr.MuhammadNaveed Před 2 lety
if percentage identity and query coverage are less than 85% then do sequencing again for novelty
@zainabwaheed3606 Před rokem
Thank you sir
Please guide that how to do sequencing again
@zainabwaheed3606 Před rokem
If percentage identity is 84% and Query Cover is 94% then it is novel or not?
@mohdawas4285 Před 3 lety ⁺¹
Aslamualikum Sir
thanks for these fruitful lectures, but i m facing a problem in using the NCBI blast where in my system only description showed in detail, while all other option showed nothing like graphic and so on. If possible plz reply i m thankful to you sir forever.
Regard Mohd Awas (research Scholar)
University of Kashmir india
@Prof.Dr.MuhammadNaveed Před 3 lety
Try again
@uneebashahid5440 Před 4 lety ⁺¹
sir I have all the results are almost 100 % , the sequence i used was from GenBank. where I did mistake ? any idea? nucleotide was Saccharomyces cerevicae S288C (PPQ1).
And your lecture is very informative,Thanks!!
@Prof.Dr.MuhammadNaveed Před 3 lety ⁺¹
its impossible have 100% value means you are using same sequence in query
@uneebashahid5440 Před 3 lety
@@Prof.Dr.MuhammadNaveed ok sir thanks!
@user-lg4yo8me6j Před 3 lety ⁺¹
assalam o alikum- sir mujhy ye samjh nhi aya k" stain novel" hony sy kya murad h?
@Prof.Dr.MuhammadNaveed Před 3 lety
means every day new strain are discovered in word means new bacteria so NCBI is first method to confirm this novelty on the basis of similarity ok
@user-lg4yo8me6j Před 3 lety
Ok thank you so much
@muhammadshahab1421 Před 3 lety ⁺¹
Sir can we convert the sequence into 4 Letter idea of protein
@Prof.Dr.MuhammadNaveed Před 3 lety
no dear
@iqralone9785 Před 2 lety
sir after getting fasta sequencing of my sample i got 93% query coverage and 97% identity and the species was polaromonas glacialis...is it reliable
@Prof.Dr.MuhammadNaveed Před 2 lety
yes more than 90% reliable
@goodboy-gt3te Před rokem
Blast me tools konsy use ho rahy
@factology2642 Před 3 lety
Assalamo alikum sir...
how we made algorithm table by blasting....?
please help me out
@gopalkalwan7465 Před 2 lety
Dear sir i am doing genome wide analysis of gene family and I got 58 hits. What will be the criteria of filteration of these hits for validation purpose or i have to select all
@Prof.Dr.MuhammadNaveed Před 2 lety ⁺¹
pick hits which have more than 97% query coverage and percentage identity
@rabiahabib883 Před rokem
How we trimm sequence
@sanasheikh5033 Před 3 lety ⁺¹
How to check bit score ?
@Prof.Dr.MuhammadNaveed Před 3 lety
The bit score, S', is derived from the raw alignment score, S, taking the statistical properties of the scoring system into account. Because bit scores are normalized with respect to the scoring system, they can be used to compare alignment scores from different searches.
@muhammadayazfaridshah3694 Před 3 lety ⁺¹
sir, kindly ye btain k cancer genes r healthy person k genes kasy download kar skty ham ??? NCBI sy cancer k to mil jaty but healthy person k ni mil rahy mujy.
@Prof.Dr.MuhammadNaveed Před 3 lety
all from NCBI GenBank just look gene name from literature then search on NCBI
@muhammadayazfaridshah3694 Před 3 lety
@@Prof.Dr.MuhammadNaveed kindly guide me. I have no idea about that. JazakAllah
@aaaa5nfgfghf Před 2 lety
sir, plz make a video on how use MODELLER software.
@Prof.Dr.MuhammadNaveed Před 2 lety
noted sure
@ashirali195 Před 3 lety
sir what is the meaning of rid parameters must be applied when i download the all sequnces from blast?
@Prof.Dr.MuhammadNaveed Před 3 lety ⁺¹
elaborate your query
@ambreeniqbal6396 Před 3 lety ⁺¹
Sir kindly Chau fasman Py aik lecture plz
@Prof.Dr.MuhammadNaveed Před 3 lety
sure and noted
@payalprasad5614 Před 4 lety ⁺¹
sir what is the the meaning of e value?could you please elaborate
@Prof.Dr.MuhammadNaveed Před 3 lety
excepted value error mean of query coverage and % identity
@shahnazsalamat1271 Před 4 lety ⁺¹
If i can not find any similarity about my protein than what does that mean?
@Prof.Dr.MuhammadNaveed Před 4 lety
Your sequences are not good or have addition of other sequence or sequencing error
@shahnazsalamat1271 Před 4 lety ⁺¹
@@Prof.Dr.MuhammadNaveed if they have addition of other sequence than what should do ?
@Prof.Dr.MuhammadNaveed Před 4 lety
Removed it by Bioedit or check via clustalw
@gloryparmar1904 Před 4 lety ⁺¹
sir, how we can submit SNP to dbSNP or clinvar? can you plz help?
@Prof.Dr.MuhammadNaveed Před 4 lety
The method is same as you submit your sequence on NCBI GenBank. You need to sign in first and then there is a submission option. You will submit your sequence there and and a specific accession number will be allotted to you.
@nabighafaheem8502 Před 2 lety
Assalamualaikum Respected Sir
Please guide us about internship program for bs bioinformatics students
@oldisgold-nc1zh Před 3 lety ⁺¹
Sir uniquely identified species kese select kare?
@Prof.Dr.MuhammadNaveed Před 3 lety
use type strain
@shabinakhan6930 Před 2 lety
Hello sir how to find similarities in genome of two viruses
@Prof.Dr.MuhammadNaveed Před 2 lety
BY BLAST for sequence similarities
@poonamkumari-nn8yg Před 2 lety
Good evening Sir...I have made mistake in gene bank submission.I have to write in the data isolate instead of voucher specimen.Can I edit my gene bank records ...please help Sir..
@Prof.Dr.MuhammadNaveed Před 2 lety
yes you can delete and add new detail
@poonamkumari-nn8yg Před 2 lety
@@Prof.Dr.MuhammadNaveed but...how I can edit...please help...sir
@user-bz9xn3rm1f Před měsícem
Sir , Articles,
Research paper
Aur publications ma kia frq hy
@Prof.Dr.MuhammadNaveed Před 28 dny
all same but in publication there are both research and review articles
@poonamkumari-nn8yg Před 2 lety
Good evening Sir....I have done sequencing of Mitochondrial COX1 gene of insect specimens. How can I submit these sequences to NCBI.....
@Prof.Dr.MuhammadNaveed Před 2 lety ⁺¹
watch my lecture on sequence submission on NCBI
@sarimaziz3374 Před 2 lety
KINDLY make video on CARD. DATABASE
@Prof.Dr.MuhammadNaveed Před 2 lety ⁺¹
noted
@sarimaziz3374 Před 2 lety
@@Prof.Dr.MuhammadNaveed sir try to make full sessions on it. I am currently working on it as student but i think the way you explain stuff would be better . Jazakallaha
@healthlinepakistan3157 Před 2 lety
sir how to check SNP rs numbers of stroke.plz guide
@Prof.Dr.MuhammadNaveed Před 2 lety
look my lecture on mutation identification
@nisharaj8967 Před 2 lety
Hello sir, Hope you are doing good, I would like to learn Primer design and sequence analysis.
@Prof.Dr.MuhammadNaveed Před 2 lety
sure and firs watch my lectures on it
@VivoVivo-ne3qt Před 3 lety
AoA! Sir how to find homozygous and heterozygous through sequence
@Prof.Dr.MuhammadNaveed Před 3 lety
only possible by phylogentic tree
@hamzarehman9913 Před rokem
Informative lecture. Can I get your contact Dr Sb. Please reply me
@AartiYadav-ns8db Před 3 lety
Sir unknown sequence ko known sequence se kaise match karna h
@Prof.Dr.MuhammadNaveed Před 3 lety ⁺¹
Blast do this you just need to paste the sequence
@AartiYadav-ns8db Před 3 lety
@@Prof.Dr.MuhammadNaveed ok
@nehanadeem8988 Před 2 lety
Sir what E value depicts in blast output?
@muhammadzarghamali4804 Před rokem
It is significance value similar to those in statistics (P value).
@iqraafzal4736 Před 2 lety
Awsome👍
@dr.muhammadimran5501 Před 3 lety
Slow down your speed
@Prof.Dr.MuhammadNaveed Před 3 lety
sure
@sabgaminggamer2256 Před 4 lety
can i do ms computer science after bs in bioinformatics.
@Prof.Dr.MuhammadNaveed Před 4 lety
Yes dear possible
@sabgaminggamer2256 Před 4 lety
@@Prof.Dr.MuhammadNaveed thanks sirrr.... in which university do you are studing!!
@Prof.Dr.MuhammadNaveed Před 4 lety ⁺¹
@@sabgaminggamer2256 Teaching in University of Central Punjab, Lahore
@sabgaminggamer2256 Před 4 lety
@@Prof.Dr.MuhammadNaveed great.... well I'm a student of bs in bioinformatics first semester...but i love programming.... like python... artificial intelligence . machine learning... etc..
so... i wanna confirm that from you... ms in computer science...
@sabgaminggamer2256 Před 4 lety
plz make. a tour to our university... which is gcu faisalabad. thanks
@albertescamilla Před 3 lety
I have no idea what you're saying.
@Prof.Dr.MuhammadNaveed Před 3 lety
very soon you can understand in english version
@divyanikhode8693 Před 3 lety ⁺¹
so fast talking nothing understand
@Prof.Dr.MuhammadNaveed Před 3 lety
Noted dear and will slow down

Další v pořadí

Automatické přehrávání

NCBI BLAST, Mutation Identification & Analysis (SNPs), Lecture 2 Part 3 by Dr. Muhammad Naveed