Dask Futures Tutorial: Parallelize Python Code with Dask

Sdílet
Vložit
  • čas přidán 29. 08. 2024
  • In this lesson, we'll parallelize a custom Python workflow that scrapes, parses, and cleans data from Stack Overflow. We'll get to:
    - Learn how to do arbitrary task scheduling using the Dask Futures API
    - Utilize blocking and non-blocking distributed calculations
    Notebook here: github.com/coi...
    Tutorial repo: github.com/coi...
    ----
    What is Dask?
    Dask is a free and open-source library for parallel computing in Python. Dask is a community project maintained by developers and organizations.
    Use Coiled for data transformation at scale with Python.
    Coiled is a Dask company. With Coiled's rock-solid infrastructure, you can quickly and securely create Dask clusters in your cloud account.
    Learn more about Coiled and get started for free
    coiled.io/start
    More content on our blog:
    coiled.io/blog

Komentáře •