NSDI '21 - Running BGP in Data Centers at Scale

Sdílet
Vložit
  • čas přidán 4. 07. 2024
  • NSDI '21 - Running BGP in Data Centers at Scale
    Anubhavnidhi Abhashkumar and Kausik Subramanian, University of Wisconsin-Madison; Alexey Andreyev, Hyojeong Kim, Nanda Kishore Salem, Jingyi Yang, and Petr Lapukhov, Facebook; Aditya Akella, University of Wisconsin-Madison; Hongyi Zeng, Facebook
    Border Gateway Protocol (BGP) forms the foundation for routing in the Internet. More recently, BGP has made serious inroads into data centers on account of its scalability, extensive policy control, and proven track record of running the Internet for a few decades. Data center operators are known to use BGP for routing, often in different ways. Yet, because data center requirements are very different from the Internet, it is not straightforward to use BGP to achieve effective data center routing.
    In this paper, we present Facebook's BGP-based data center routing design and how it marries data center's stringent requirements with BGP's functionality. We present the design's significant artifacts, including the BGP Autonomous System Number (ASN) allocation, route summarization, and our sophisticated BGP policy set. We demonstrate how this design provides us with flexible control over routing and keeps the network reliable. We also describe our in-house BGP software implementation, and its testing and deployment pipelines. These allow us to treat BGP like any other software component, enabling fast incremental updates. Finally, we share our operational experience in running BGP and specifically shed light on critical incidents over two years across our data center fleet. We describe how those influenced our current and ongoing routing design and operation.
    View the full NSDI '21 program at www.usenix.org/conference/nsd...
  • Věda a technologie

Komentáře • 14

  • @aphrozz
    @aphrozz Před 2 lety +20

    Seems to work fine

  • @somename8831
    @somename8831 Před 2 lety +24

    Well this aged like milk. :D

  • @netcruzer
    @netcruzer Před 2 lety +11

    This network config is having a massive failure right now. FB, Messenger, WhatsApp, and Instagram all down.

  • @pvk93
    @pvk93 Před 2 lety +3

    I would assume the recovery time for routes to propogate around the world is in hours.

  • @LLMA2
    @LLMA2 Před 2 lety +1

    The technical level of this team is very good, I hope not to be fired. . .

  • @IshamMohamedIqbal
    @IshamMohamedIqbal Před 2 lety +8

    Did they implement this? Could this shed light on the current outage?

    • @Jccke
      @Jccke Před 2 lety +3

      Yes. I guess Kim made a bug and It has destroyed everything

  • @abrand305
    @abrand305 Před 2 lety +2

    1st day at work. Go ahead and configure production BGPs… Lol

  • @Cuwubiq
    @Cuwubiq Před 2 lety

    Great.. i was there.

  • @FrankFloresRGVZGM
    @FrankFloresRGVZGM Před 2 lety

    Move fast and break everything.

  • @SpaceFederation
    @SpaceFederation Před 2 lety +2

    Next time use staging to test your changes 😬

  • @kcwithbrim2582
    @kcwithbrim2582 Před 2 lety +1

    LMAO

  • @garyha2650
    @garyha2650 Před 2 lety

    Can you make a version for those who speak English?

  • @Jccke
    @Jccke Před 2 lety +1

    That's your Fault Kim