Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models

Interview Questions On Density Based Clustering | DBSCAN clustering Interview question

9 Interview questions on K Means clustering | Data Science Interview Questions and answers

ZTRATIL JSEM SE NA OSTROVĚ…

【鬥羅大陸】小舞真的錯怪唐舞桐了! #斗羅大陸 #唐三 #小舞 #唐舞桐 #唐舞麟

Lady Plays Hide and Seek with Her Dog

K Means Clustering Interview Questions | Data Science Interview Questions On K means algorithm

Unfold Data Science

zhlédnutí 13 274

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 15. 05. 2021
K Means Clustering Interview Questions | Data Science Interview Questions On K means algorithm
#KMeansInterviewQuestions #UnfoldDataScience
Hello ,
My name is Aman and I am a Data Scientist.
About this video:
In this video, I explain different topics for interview question in K-means clustering. I explain what are the areas interviewers might touch in K-means clustering and what are some of the most important interview question in K-means clustering. Below topics are explained in this video:
1. K Means Clustering Interview Questions
2. Data Science Interview Questions On K means algorithm
3. Convergence in K-means clustering
4. Deciding number of clusters in K-means
5. Application of K-means clustering
About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
docs.google.com/forms/d/1Acua...
Book recommendation for Data Science:
Category 1 - Must Read For Every Data Scientist:
The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
Python Data Science Handbook - amzn.to/31UCScm
Business Statistics By Ken Black - amzn.to/2LObAA5
Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
Ctaegory 2 - Overall Data Science:
The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
Category 3 - Statistics and Mathematics:
Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
Category 4 - Machine Learning:
Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
Category 5 - Programming:
The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
Clean Code by Robert C. Martin - amzn.to/3oYOdlt
My Studio Setup:
My Camera : amzn.to/3mwXI9I
My Mic : amzn.to/34phfD0
My Tripod : amzn.to/3r4HeJA
My Ring Light : amzn.to/3gZz00F
Join Facebook group :
groups/41022...
Follow on medium : / amanrai77
Follow on quora: www.quora.com/profile/Aman-Ku...
Follow on twitter : @unfoldds
Get connected on LinkedIn : / aman-kumar-b4881440
Follow on Instagram : unfolddatascience
Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
Watch python for data science playlist here:
• Python Basics For Data...
Watch statistics and mathematics playlist here :
• Measures of Central Te...
Watch End to End Implementation of a simple machine learning model in Python here:
• How Does Machine Learn...
Learn Ensemble Model, Bagging and Boosting here:
• Introduction to Ensemb...
Build Career in Data Science Playlist:
• Channel updates - Unfo...
Artificial Neural Network and Deep Learning Playlist:
• Intuition behind neura...
Natural langugae Processing playlist:
• Natural Language Proce...
Understanding and building recommendation system:
• Recommendation System ...
Access all my codes here:
drive.google.com/drive/folder...
Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
My Music: www.bensound.com/royalty-free...

Komentáře • 77

@jaysoni7812 Před 3 lety ⁺²⁶
k refers to number of studies, in research world, that's why it use k in k means it means here we are studying on cluster, and
n refers to number of outcomes in research world, so that's why in sklearn the parameter name is n_clusters because using that algorithm we wants outcome after running that algorithm it will gives us the n number of outcomes.
I hope my research is correct 😊
@UnfoldDataScience Před 3 lety ⁺⁸
Three 👏👏👏 for you.
@muhammedthayyib9202 Před rokem ⁺¹
Oh great. I commented another answer, from common sense. 😀
@makemoneywithamar Před 2 lety ⁺³
Till yesterday, I generally followed only Krish Naik for any enquiry related to Data Science & today, suddenly found you and boom !!!! I am apologised to subscribe your channel. Awesome step-by-step clearer, you are man....Hats off
@UnfoldDataScience Před 2 lety
Welcome to Unfold Data Science Amarjit 🎉🎉🎉
@vinodbiradar5266 Před 3 lety ⁺⁷
Would like to add one more point in KMEANS++, It internally analyzes the pattern of the data. Such as the spread of data (whether it is spherical, rectangle, oval etc.) and then initialize the centroids as explained.
@UnfoldDataScience Před 3 lety ⁺³
Yes Vinod. Thanks for adding it.
@SESHUNITR Před rokem ⁺¹
Very Good information from interview. keep doing thanks.
@ramyaanand3668 Před 3 lety ⁺⁵
Exactly i was looking for same thing n i found it by u aman great video its has so much information....thnku so much aman keep exploring more
@UnfoldDataScience Před 3 lety
Welcome Ramya.
@AnkitSingh-rd3he Před 3 lety ⁺³
Due to its ubiquity, it is often called "the k-means algorithm"; it is also referred to as Lloyd's algorithm, particularly in the computer science community. It is sometimes also referred to as "naïve k-means", because there exist much faster alternatives
@Monuchaitu44 Před 2 lety ⁺⁴
Your videos were like cheat sheets for revising and remembering concepts very easily. Good and Great Job.
@UnfoldDataScience Před 2 lety ⁺¹
Thanks Again. Please share in your data science groups if possible. That will be helpful for channel.
@Monuchaitu44 Před 2 lety
@@UnfoldDataScience Sure, I will make it to happen.
@vishalbhapkar2359 Před 3 lety ⁺¹
I have been following this channel since very beginning, now I can say this works pretty much for me, thanks @unfold data science and Mr. Aman Sir
@UnfoldDataScience Před 3 lety ⁺¹
Thanks Vishal. :)
@nishanthvirat9044 Před 4 měsíci
thank you so much sir
@sandipansarkar9211 Před 2 lety
finished watching
@chandramouli5881 Před 2 lety ⁺¹
This video helped me to understand K means. Thanks for the sharing
@UnfoldDataScience Před 2 lety
Welcome Chandra.
@sudheeshe1384 Před 3 lety ⁺¹
Thanks for the valuable contents
@UnfoldDataScience Před 3 lety
Welcome Sudheesh :)
@ajaykushwaha-je6mw Před 3 lety ⁺¹
very very informative video.
@UnfoldDataScience Před 3 lety
Thanks Ajay.
@kaanchii123 Před 3 lety ⁺¹
Thank you, you are a great teacher!
@UnfoldDataScience Před 3 lety ⁺¹
You're very welcome!
@theethumnandrumpirartharav4137 Před rokem
Awesome👍
@UnfoldDataScience Před rokem
Thank you! Cheers!
@harithavalmiki9390 Před 2 lety
Thank you so much for this explanation Aman!
@UnfoldDataScience Před 2 lety
My pleasure
@himanshugautam1421 Před 3 lety ⁺¹
Loved it.
@UnfoldDataScience Před 3 lety
Thanks Himanshu :)
@yash422vd Před 3 lety
N number of appreciation for your style of explanation is less, another great video. Your simplicity is your best asset.
@UnfoldDataScience Před 3 lety
So nice of you Vishal. :)
@callmace Před 3 lety ⁺¹
Gr8
@UnfoldDataScience Před 3 lety
Thanks Tausif :)
@muhammedthayyib9202 Před rokem
K stands for a number. That number in a whole number. It cannot have 1.5 number of cluster. In cross validation we use K-flod. Then why not n. n is like a random selection but K is like a choose the best number. Thank you aman
@souravbiswas6892 Před rokem
Excellent video. I wish I would have seen this video before my final round of interview in Walmart. I became heartbroken when I was not selected :(
@samruddhideshmukh5928 Před 3 lety
Great video!!!
@UnfoldDataScience Před 2 lety
Thanks Samruddhi.
@srprev Před 3 lety ⁺¹
Due to its ubiquity, it is often called "the k-means algorithm" :)
@kushalhu7189 Před 2 lety ⁺¹
Brilliant Sir.....
@UnfoldDataScience Před 2 lety
Thanks Kushal.
@vallimuthaiyah5098 Před 3 lety ⁺¹
Thank you sir for such a valuable content and information on silhouette score.. please upload more interviews questions with hidden information.. K in k means clustering refers to number of clusters but not sure why it is called as using letter K
@UnfoldDataScience Před 3 lety
Thanks a lot for watching.
@MohitGupta-sz4bh Před 2 lety
Very informative and helpful video Aman. keep up the good work. We would like to have this kind of interview questions and answers video on every Machine Learning Algorithm to crack the interview. Please do create video on other algorithms. Again superb a wonderful job :)
@UnfoldDataScience Před 2 lety
Thanks Mohit. Sure.
@pramodyadav4422 Před 3 lety ⁺¹
Eagerly waiting to know why it's called K-Means
@qazibasheer443 Před rokem ⁺²
The k-means clustering algorithm is called "k-means" because it specifically partitions the data into "k" clusters based on the mean of the data points.
Other clustering algorithms may use different criteria for clustering, such as "n-means" which partitions the data into "n" clusters, or "s-means" which partitions the data based on the sum of squared distances. However, the k-means algorithm uses the mean of the data points to calculate the centroids, and it partitions the data into "k" clusters. Therefore, it is called k-means.
@terryterry3733 Před 2 lety
Super bro nice explanation and one thing i want to understand HOW KMEAN GETS OVERFIT? Pls give me the couple of details i didnt get the ans in internet .
@UnfoldDataScience Před 2 lety ⁺¹
Overfitting is typically a problem in supervised learning, not k-means generally.
@ArunSingh-bj6ux Před 3 lety ⁺¹
Hi ,
Could you cover the logic behind croston method forecasting
@UnfoldDataScience Před 3 lety
Thanks Arun for feedback. Will add.
@yt-1161 Před rokem
In order to get people to confuse it with K nearest neighbors
@amarmemane2583 Před 3 lety ⁺¹
Hello sir, please make this kind of interview qun video on each machine learning algorithm if u want we are ready to fee for that also😊
@UnfoldDataScience Před 3 lety
Thanks Amar for suggestion. Noted.
@shashankhegde1258 Před 2 lety
The elbow curve comes in the shape of K ?
@rishigupta2342 Před 2 lety
Could you discuss interview question based on Decision tree & Random forest?
@UnfoldDataScience Před 2 lety
Sure,
@praveenkuthuru7439 Před 3 lety
In my opinion, the k-NN algorithm which was coined in 1951 tries to find out the nearest neighbor w.r.t. the distance function similar to k-Means which was coined post 1951, due to this reasons the 'k' is maintained as is since then and not any other letter. Is it right????
@UnfoldDataScience Před 3 lety ⁺²
This one i did not hear yet. What I know is, in statistics K is typically used for number of groups to analyze, hence.
@sampathvinaykumarreddymajj790 Před 3 lety ⁺¹
Need these kind of videos
But why it is called K-Means ??
@UnfoldDataScience Před 3 lety
Thanks Sampath. Pls do try to find out 😁😄
@abithaanand7170 Před rokem
Sir in 3:01 sec, I don't understand thw concept of how the convergence speed would be slow if two clusters are located near . Similarly, how would the convergence speed be faster if two clusters are not located together?
@abhinavkhandelwal1045 Před 3 lety ⁺¹
I have a question.. if I have trained my data on 2 models for instance Random forest and logistic regression and it is giving me the same accuracy then what should be the basis to decide which one the two algorithms should I select for my data
@UnfoldDataScience Před 3 lety
Depends on business need.
@abhinavkhandelwal1045 Před 3 lety ⁺¹
If a business sets free then what must be a parameter to strike out one of the Random forest and logistic regression if giving same accuracy?
@jaysoni7812 Před 3 lety ⁺¹
@@abhinavkhandelwal1045 choose whichever model is fast or give quick prediction. if your both model gives same accuracy then choose that model which is faster, it will help you to quick prediction
@souravbiswas6892 Před rokem
Because any parameter which can be tuned/tweaked, is represented by 'k' and not by a,b,c,d..
@nishah4058 Před rokem
Can u pls elaborate your answer?
@vignesan4197 Před 2 lety
Hello sir please any junior level data scientist job available please inform.
@bangarrajumuppidu8354 Před 2 lety
who will take care of random picking points for initialization of centroid
@UnfoldDataScience Před 2 lety ⁺¹
Python itself through "k-means" module
@bangarrajumuppidu8354 Před 2 lety
@@UnfoldDataScience thank u sir
@letslearndatasciencetogeth479 Před 3 lety ⁺¹
Sir pls make a video on the mathematics behind silhouette score in detail
@UnfoldDataScience Před 3 lety ⁺¹
I was thinking someone will ask, I will do it :)
@letslearndatasciencetogeth479 Před 3 lety
@@UnfoldDataScience thanks sir for the amazing explanation

Další v pořadí

Automatické přehrávání

Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models

Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models

Interview Questions On Density Based Clustering | DBSCAN clustering Interview question

Interview Questions On Density Based Clustering | DBSCAN clustering Interview question

9 Interview questions on K Means clustering | Data Science Interview Questions and answers

9 Interview questions on K Means clustering | Data Science Interview Questions and answers

ZTRATIL JSEM SE NA OSTROVĚ…

ZTRATIL JSEM SE NA OSTROVĚ…

【鬥羅大陸】小舞真的錯怪唐舞桐了! #斗羅大陸 #唐三 #小舞 #唐舞桐 #唐舞麟

【鬥羅大陸】小舞真的錯怪唐舞桐了! #斗羅大陸 #唐三 #小舞 #唐舞桐 #唐舞麟

Lady Plays Hide and Seek with Her Dog

Lady Plays Hide and Seek with Her Dog

Water park😍🔥 Do you want result?📱😈

Water park😍🔥 Do you want result?📱😈

StatQuest: K-means clustering

StatQuest: K-means clustering

K Means Clustering in 15 Minutes | K means clustering explained | K means clustering in python

K Means Clustering in 15 Minutes | K means clustering explained | K means clustering in python

Live Day 6- Discussing KMeans,Hierarchical And DBScan Clustering Algorithms

Live Day 6- Discussing KMeans,Hierarchical And DBScan Clustering Algorithms

Git vs Gitlab vs Github | Git vs Github vs Gitlab for beginners | Git vs Gitlab vs Github

Git vs Gitlab vs Github | Git vs Github vs Gitlab for beginners | Git vs Gitlab vs Github

Interview Questions On Feature Scaling | normalization vs standardization machine learning

Interview Questions On Feature Scaling | normalization vs standardization machine learning

K-Modes intuition and example

K-Modes intuition and example

Learn Exploratory Data Analysis (EDA) from Scratch | EDA in 5 hours | Satyajit Pattnaik

Learn Exploratory Data Analysis (EDA) from Scratch | EDA in 5 hours | Satyajit Pattnaik

#JasonDeruloTV // Wow 🤩 #GotPermissionToPost From @fasheroisbrasil #FromTheIslands

#JasonDeruloTV // Wow 🤩 #GotPermissionToPost From @fasheroisbrasil #FromTheIslands

When your dad misses your husband more than you do🤣♥️ #husband #dad

When your dad misses your husband more than you do🤣♥️ #husband #dad

Growing An Ear In Your Arm 😨

Growing An Ear In Your Arm 😨

Uděláme koncert, kde uvidite AINKU naživo??

Uděláme koncert, kde uvidite AINKU naživo??

This pasta HACK is almost approved

This pasta HACK is almost approved

World’s smallest 4K headset 😎 #tech #vr #technology #virtualreality #insideout2

World’s smallest 4K headset 😎 #tech #vr #technology #virtualreality #insideout2

Un cane intelligente aumenta i suoi soldi in modo intelligente #shorts

Un cane intelligente aumenta i suoi soldi in modo intelligente #shorts