PEPR '24 - Deleting Data at Organizational Scale

Sdílet
Vložit
  • čas přidán 17. 07. 2024
  • PEPR '24 - Deleting Data at Organizational Scale
    Diogo Lucas, Stripe
    Deleting a million records from a dataset can be hard. Deleting one record from a million datasets can be even harder.
    Data has a tendency to sprawl. In today's information-hungry world, information is replicated and permutated in a myriad of ways in data marts, lakes, and warehouses. This proliferation can add massive volume and variety, turning a single input point into many thousands of somewhat related downstream entries.
    So when it comes to observing a person's right to be forgotten, how can we find their information's needle in a company's data-hungry haystack? How can we do that in a world of architectural sprawl and data repurposing? And how do we do all that without breaking legitimate data usage cases?
    In this session, we will evaluate the fundamental building blocks and practices that allow Stripe to guarantee our customer's (direct and indirect) rights for data deletion. Those include detection and attribution of sensitive data and its affiliation, impact analysis through exploration, and the combined use of deletion propagation and orchestration.
    View the full PEPR '24 program at www.usenix.org...

Komentáře •