Introduction to Model Deployment with Ray Serve

Sdílet
Vložit
  • čas přidán 27. 05. 2023
  • Speakers:
    Jules Damji, Lead Developer Advocate, Anyscale Inc
    Jules S. Damji is a lead developer advocate at Anyscale Inc, an MLflow contributor, and co-author of Learning Spark, 2nd Edition. He is a hands-on developer with over 25 years of experience and has worked at leading companies, such as Sun Microsystems, Netscape, @Home, Opsware/LoudCloud, VeriSign, ProQuest, Hortonworks, and Databricks, building large-scale distributed systems. He holds a B.Sc and M.Sc in computer science (from Oregon State University and Cal State, Chico respectively), and an MA in political advocacy and communication (from Johns Hopkins University).
    Archit Kulkarni, Software Engineer, Anyscale Inc
    Archit Kulkarni is a software engineer on the Platform team at Anyscale working on the open-source library Ray Serve. Prior to Anyscale, he was a PhD student at UC Berkeley
    Abstract:
    This is a two-part introductory and hands-on guided tutorial of Ray and Ray Serve.
    Part one covers a hands-on coding tour through the Ray core APIs, which provide powerful yet easy-to-use design patterns (tasks and actors) for implementing distributed systems in Python.
    Building on the foundation of Ray Core APIs, part two of this tutorial focuses on Ray Serve concepts, what and why Ray Serve, scalable architecture, and model deployment patterns. Then, using code examples in Jupyter notebooks, we will take a coding tour of creating, exposing, and deploying models to Ray Serve using core deployment APIs.
    And lastly, we will touch upon Ray Serve’s integration with model registries such as MLflow, walk through an end-to-end example, and discuss and show Ray Serve’s integration with FastAPI.
    Key takeaways from students:
    * Use Ray Core APIs to convert Python function/classes into a distributed setting
    * Learn to use Ray Serve APIs to create, expose, and deploy models with Ray Server APIs
    * Access and call deployment endpoints in Ray Serve via Python or HTTP
    * Configure compute resources and replicas to scale models in production
    * Learn about Ray Serve integrations with MLflow and FastAPI

Komentáře • 1

  • @parasetamol6261
    @parasetamol6261 Před 2 měsíci

    can you give me an example notbook to do this. in video.