What are Elasticsearch shards? Why do they matter? Elasticsearch cluster architecture explained.

Sdílet
Vložit
  • čas přidán 21. 07. 2021
  • Elasticsearch is a fantastic tool but it's easy to muddle through without knowing the fundamentals. It's only a matter of time before your cluster performance drops, errors start happening, and you wonder just what a shard actually is.
    I talk and wave my hands about, explaining how we could have - maybe - built - some of - Elasticsearch ourselves. At the end of the video you'll know what a shard is, why they're so important, and want to learn more to really improve the performance of your cluster.
    Watch on (and watch the rest of the free course linked below) to figure out how to answer that mysterious question: How many primary shards do I need for my index?
    My course - Fundamentals of Elasticsearch architecture and shards - is available for free here: school.georgebridgeman.com/p/...
    You can read more about Elasticsearch and The Elastic Stack in general on my blog: georgebridgeman.com/posts/

Komentáře • 64

  • @nch77884
    @nch77884 Před 2 lety +42

    Hands down the best explanation and introduction to Elasticsearch. Can't thank enough for making this video.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před 2 lety

      Thank you for your lovely comment. I'm so pleased you enjoyed the video!

    • @zaeemahmedabbasi
      @zaeemahmedabbasi Před 10 měsíci

      ​@@GeorgeBridgemanData⁷77777777777777777777777777777777777777777777777777777777777777777777😅😮

  • @HaisumUsman
    @HaisumUsman Před rokem +3

    Man! You are not from this planet! You deserve a thousand thumbs up.

  • @netlob
    @netlob Před 2 lety +13

    Holy sh*t! This must've been the best tutorial I've ever seen on CZcams. High production, clear presentation and well thought through. +1 subscriber for sure!

  • @sridharnuthi1
    @sridharnuthi1 Před rokem +7

    This is a video that should go to a reference library about Elasticsearch. Thank you for putting such a good, clear and methodical overview of ES. Just brilliant!

  • @lawlade
    @lawlade Před rokem +3

    Watched this 5 times, rewound several times and i understand it FULLY. Thanks so much for such clear explaining

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před rokem

      You're welcome! I'm so pleased you understood everything. I hope it helps!

  • @harrisonleong4283
    @harrisonleong4283 Před 2 lety +3

    I really wish this was the 1st elasticsearch video that I had watched, so as to save me so much time watching other video which could not teach me the same level of information that I need. Thank you very much, and I shall check out your courses.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před 2 lety

      That's the reaction I was hoping for - a useful first video on Elasticsearch. Thank you so much for posting!

  • @yazzy9975
    @yazzy9975 Před 11 měsíci

    This video changed my life. No exaggeration.

  • @TheOtmane007
    @TheOtmane007 Před rokem +2

    What a clear , and progressively explained architecture. Thank you so much

  • @Transactional
    @Transactional Před rokem +1

    Thank you. It feels like my brain is getting clearer.

  • @blossomwithcurls
    @blossomwithcurls Před 6 měsíci

    I just started learning Elastic search and this is the best an clear information on Elastic search architecture. Thanks for sharing!

  • @crujzojam7004
    @crujzojam7004 Před 2 lety +3

    Please post more videos…ur videos are easy to understand and quite informative….please carry on the good work

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před 2 lety +1

      Done! New video just posted!
      There will be more. I have lots of ideas but I'm really trying to get this my Elasticsearch course finished, and work the day job.

  • @jhoyl
    @jhoyl Před 2 lety +1

    Thanks - the perfect introduction to Elasticsearch architecture.

  • @thomasanderson8478
    @thomasanderson8478 Před 2 lety +3

    This is the best explanation of elasticsearch I've ever seen. So many videos skip over the details, and it's been making it difficult to understand what elasticsearch is doing under the hood. I normally don't comment on videos, but this is too high quality not to. Please continue to put out content!

  • @user-tm6cd8sg5l
    @user-tm6cd8sg5l Před 3 měsíci

    Many thanks for your work! It's awesome video!

  • @tkousek1
    @tkousek1 Před 2 lety +1

    Thank you very much sir for this information. Awesome people like you are what's good about this world!!! Much appreciated!!!

  • @jupudivinod
    @jupudivinod Před 2 lety +1

    This is fantastic! Bricks till walls in a nutshell! Thanks much for this great presentation.

  • @sobhan285
    @sobhan285 Před 2 lety +1

    Wonderful. Looking forward to more courses from you.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před 2 lety

      Thanks so much! Elasticsearch Engineer Essentials is in the works, and I'll be posting shorter content on here as well.

  • @user-nt7of9us8p
    @user-nt7of9us8p Před 5 měsíci

    Very clear! Thanks!

  • @hnyc1986
    @hnyc1986 Před 2 lety +1

    Awesome explanation about Elasticsearch!!!

  • @thsu1
    @thsu1 Před 2 lety +1

    thanks for the clear and awesome explanation to Elasticsearch and Lucene. really appreciate this useful content

  • @riazbacchus3962
    @riazbacchus3962 Před 10 měsíci

    this is great content. thank you.

  • @hieungo770
    @hieungo770 Před 6 měsíci

    Its two year late but the lesson is extremely value

  • @pseudolimao
    @pseudolimao Před 2 měsíci

    where was thsi video 1 month ago. you should be paid by these software companies... bless your heart

  • @moritzlgrs401
    @moritzlgrs401 Před 2 lety +1

    Absolutely fantastic!

  • @johnsonakanbi367
    @johnsonakanbi367 Před 3 lety

    Thanks so much for this great presentation.

  • @arpit9163
    @arpit9163 Před rokem +1

    Thank You for making this fantastic video !

  • @danielsantiago11
    @danielsantiago11 Před rokem +1

    Premium content, thank you!

  • @cicd
    @cicd Před 2 lety +1

    Great content, thanks for sharing!

  • @rakeshkush1234
    @rakeshkush1234 Před 2 lety +1

    wonderful technical story.

  • @Milostrosic
    @Milostrosic Před rokem +1

    Very clear explanation!

  • @ucthuannguyen6432
    @ucthuannguyen6432 Před 2 lety +1

    Wonderful. Thank you so much.

  • @slapcanister
    @slapcanister Před 2 lety +1

    This is so good.

  • @PhanTanThangTH
    @PhanTanThangTH Před 4 měsíci

    Thank you so much :)

  • @samrybkin9184
    @samrybkin9184 Před 27 dny

    Hi thanks for a video.
    For example we have: "unassigned_shards" : 40,
    When we run:
    GET _cluster/allocation/explain?filter_path=index,node_allocation_decisions.node_name,node_allocation_decisions.deciders.*
    {
    "index": "elastalert_past",
    "shard": 0,
    "primary": false
    }
    We reiceve next answer:
    "explanation" : "a copy of this shard is already allocated to this node [[elastalert_past][0], node[JaLzrdasdajQ], [P], s[STARTED], a[id=OmY9kwpHTlybJfSrWvdsadada6g]]"
    We have only one node and what we can do in this situation ?
    Also we have "number_of_replicas" : "0", "auto_expand_replicas" : "false", what we can do in this situation ?
    GET /.kibana/_settings
    {
    ".kibana_2" : {
    "settings" : {
    "index" : {
    "number_of_shards" : "1",
    "auto_expand_replicas" : "false",
    "provided_name" : ".kibana_2",
    "creation_date" : "1601664093",
    "number_of_replicas" : "0",
    "uuid" : "WKdIpzLFSP-ydObLw",
    "version" : {
    "created" : "7090299"
    }
    }
    }
    }
    }

  • @andy_ltluan
    @andy_ltluan Před 11 měsíci +1

    I think that shard in ES has the same concept with partition in Kafka when they have all partition replicas in different nodes

  • @DrewIsFail
    @DrewIsFail Před 2 lety +1

    Is it fair to say you could build ES from dynamoDB? I'm trying to compare the two.
    I would love a video on the query language, does it have a mathematical basis like sql does to sets?
    It goes without saying, but I'm say it, thanks for making this clear, concise, focused high level content.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před 2 lety

      Hi there. I'm really pleased you enjoyed the video.
      I'm not sure you could build an equivalent of Elasticsearch using DynamoDB. There's a *lot* more to Elasticsearch than I talked about in this video!
      There's more content coming, including an introduction to the query language. The in-depth content will be in a training course instead of CZcams, though. I've never considered if there's a mathematical basis to the query language. I doubt there is in terms of what Elasticsearch offers, but all Elasticsearch queries are converted to Lucene query language, which may be more thoroughly researched. Interesting question!

  • @akshaychawla7413
    @akshaychawla7413 Před rokem

    I am not able to enroll for your course, tried with 2 different emails, please have a look into this.

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před rokem

      Sorry for the very late response. I've had feedback from a couple of people using Firefox, who worked around it by using a different browser. I'm not sure if that's your issue but thought I'd mention it. Let me know if you're still having issues and I'll try responding quicker this time!

  • @carlosroberto366
    @carlosroberto366 Před 2 lety

    Isn't the cluster the server (i.e. AWS EC2 instance) itself? To my mind, a node is not a server because you can create several nodes in the same machine. I was expecting to see MyCluster1 and MyCluster2 each having a single node, hence, high availability via cross-cluster communication.
    11:11 node = server in his example
    15:05 node = process

    • @GeorgeBridgemanData
      @GeorgeBridgemanData  Před 2 lety +1

      A node is an Elasticsearch process running on a host. You're right that you can run multiple nodes on the same host (even not containerised), but it's not recommended and it's widely accepted that you only run a single node on a host.
      If you did run two nodes on a single host, you could have either one or two clusters on that host. The node is configured with the cluster name it's expected to join, so you could configure each node with a different cluster name and have two clusters on that host!
      Cluster formation can get quite involved. There are configuration settings that need to be applied specifically at the formation stage. I can do a video on how that works at some stage.

  • @yazzy9975
    @yazzy9975 Před 11 měsíci

    If elasticsearch distributes the data between the shards of an index such that each lucene store roughly holds the same number of documents, when you run a search query, elasticsearch, despite the inter-node communication, only knows which shards hold that index and not which particular shard will have that document? So it has to run the query against all the shards and merge results, it cannot just search the one shard that contains that document? It does not know beforehand based on how documents are distributed among shards.

  • @sv_n
    @sv_n Před 5 měsíci

    1000th like 😅