Thomas Bierhance: Polars - make the switch to lightning-fast dataframes

Sdílet
Vložit
  • čas přidán 2. 08. 2024
  • In this talk, we will report on our experiences switching from Pandas to Polars in a real-world ML project. Polars is a new high-performance dataframe library for Python based on Apache Arrow and written in Rust. We will compare the performance of polars with the popular pandas library, and show how polars can provide significant speed improvements for data manipulation and analysis tasks. We will also discuss the unique features of polars, such as its ability to handle large datasets that do not fit into memory, and how it feels in practice to make the switch from Pandas. This talk is aimed at data scientists, analysts, and anyone interested in fast and efficient data processing in Python.
    github.com/datenzauberai/PyCo...
  • Věda a technologie

Komentáře • 13

  • @zerdofish9989
    @zerdofish9989 Před rokem +9

    Polars changed my whole pipeline. I love it!

    • @datenzauberai
      @datenzauberai Před měsícem

      I love it too! It really makes a difference!

  • @rokaskarabevicius
    @rokaskarabevicius Před 2 měsíci +1

    glad to hear I'm not the only one who finds pandas multi-index confusing.

    • @datenzauberai
      @datenzauberai Před měsícem +1

      I think I've never met someone in person who is fluent in "multi-index-filtering" 😂

    • @ryan_chew97
      @ryan_chew97 Před 20 dny

      @@datenzauberaipretty much. I just ask chatgpt and half the time it’s wrong

  • @chobblegobbler6671
    @chobblegobbler6671 Před 10 měsíci

    Herr Schuler.. Offnen Sie die tur!

  • @rubendevroomen2637
    @rubendevroomen2637 Před 6 měsíci +1

    I cant use polars until it supports complex numbers

    • @datenzauberai
      @datenzauberai Před měsícem

      It's definitely not a replacement for numpy for this kind of scientific computations.

  • @ScienceMinisterZero
    @ScienceMinisterZero Před 8 měsíci +2

    Rust is the future of data science.

    • @floopybits8037
      @floopybits8037 Před 7 měsíci +1

      It is good for backuend programming. Not for actual DS

  • @slavikdoter
    @slavikdoter Před rokem +1

    whenever i see a new orm i try to avoid it as long as possible

    • @datenzauberai
      @datenzauberai Před měsícem

      It's not a tool for object-relational-mapping, so it would be totally fine to have look 😉