Janakiram MSV
Janakiram MSV
  • 223
  • 1 362 671
Transforming AI | Inference and Fine-Tuning | Lin Qiao on Fireworks AI
Join us for an in-depth conversation with Lin Qiao, the CEO and founder of Fireworks AI, on this episode of The Encounter. Lin shares her incredible journey from working at Meta, where she played a key role in developing the PyTorch AI framework, to founding Fireworks AI. Discover how Fireworks AI is revolutionizing the AI landscape with its cutting-edge inference and fine-tuning services.
Learn about the challenges enterprises face in transitioning to an AI-first approach and how Fireworks AI is designed to help them overcome these hurdles efficiently. Dive into Lin's vision for the future of AI and understand what sets Fireworks AI apart from the competition. Don't miss this fascinating discussion packed with insights and innovations in the world of artificial intelligence.
TIME STAMPS :
00:00 - Trailer
1:22 - Introduction
1:58 - Motivation behind starting Fireworks
4:05 - What to expect from Fireworks AI?
5:40 - What differentiates Fireworks AI from others?
8:33 - How to ensure low latency for global customers?
13:43 - Why one should only choose Fireworks AI?
17:56 - Reason behind building own function model?
21:21 - Why partner with Now's Research for Hermes 2 Pro?
23:41 - Where do you see agentic workflow going?
#AI #ArtificialIntelligence #FireworksAI #LinQiao #Meta #PyTorch #InferenceAsAService #FineTuning #TechInterview #TheEncounter #StartupJourney #Innovation #TechTalk
Follow: www.linkedin.com/in/lin-qiao-22248b4/
zhlédnutí: 388

Video

Fireworks AI | Revolutionizing Inference | Fine-Tuning | Lin Qiao |
zhlédnutí 137Před měsícem
Get ready for an insightful journey with Lin Qiao, CEO and founder of Fireworks AI, in this exciting episode of The Encounter! Discover how Lin transitioned from her pivotal role at Meta, where she helped build the revolutionary AI framework PyTorch, to creating Fireworks AI, a cutting-edge platform offering inference and fine-tuning as a service. Learn about the unique challenges faced by ente...
Building Conversational Agents | The Agent vs. Chatbot Difference | Priyanka Vergadia
zhlédnutí 857Před 2 měsíci
In this insightful interview, Janakiram MSV sits down with Priyanka Vergadia to delve into the nuances between conversational agents and chatbots. Priyanka expertly explains how to define goals, provide instructions, and design these AI entities to handle specific tasks, using natural language prompts. Discover the key distinctions and gain valuable insights into building effective conversation...
Google Cloud Next 2024 Recap | Top Announcements Unveiled | Priyanka Vergadia | Stay Tuned!
zhlédnutí 64Před 2 měsíci
In this exclusive teaser, join Janakiram MSV as he sits down with Priyanka Vergadia, Google Cloud's star evangelist, to discuss the most exciting announcements from Google Cloud Next 2024. Get a sneak peek into their insightful conversation as they demystify the latest technologies and innovations. Stay tuned for the full interview, coming soon! Follow: www.linkedin.com/in/pvergadia/ #GoogleClo...
Decade of Kubernetes | Priyanka Sharma | Discusses Its Impact & Future at CNCF
zhlédnutí 259Před 2 měsíci
Join Janakiram MSV as he hosts Priyanka Sharma, Executive Director of the Cloud Native Computing Foundation, in a detailed conversation about Kubernetes' impactful journey and its future. Celebrating its 10th anniversary, Kubernetes has evolved from an innovative open-source project to a foundational technology akin to Linux, influencing vast areas including AI and edge computing. Hear Priyanka...
Kubernetes Turns 10 | Priyanka Sharma Explores Its Impact & Future at CNCF | Stay Tuned!
zhlédnutí 49Před 2 měsíci
Stay tuned to this exhilarating episode of The Encounter as Priyanka Sharma, Executive Director of the CNCF, reflects on Kubernetes' transformative journey over the past decade. From its inception to becoming the backbone of modern infrastructure, hear Priyanka share insider insights on Kubernetes' milestones and its pivotal role in shaping the tech landscape. Don't miss this special edition wh...
Building End-to-End RAG Pipelines Locally on Kubernetes
zhlédnutí 609Před 3 měsíci
Watch the replay and a deep dive into my KubeCon Paris demo. Tune in if you are interested in learning how to run an end-to-end RAG pipeline on cloud native infrastructure.
Mastering Gemini Function Calling & LangChain Agents | A Comprehensive Tutorial
zhlédnutí 2,2KPřed 3 měsíci
Dive into the world of Gemini Function Calling and LangChain agents with our step-by-step tutorial. This video is designed for developers and enthusiasts keen on exploring advanced generative AI and cloud-native technologies. Learn how to seamlessly integrate Gemini Function Calling with LangChain agents to build, deploy, and manage intelligent applications. Whether you're looking to enhance yo...
Harnessing the Power of Multimodal Interactions with Gemini | A Comprehensive Tutorial
zhlédnutí 312Před 4 měsíci
This in-depth tutorial explores the revolutionary capabilities of Gemini for creating and leveraging multimodal applications. Covering a broad spectrum of functionalities, including image prompts, combined image text prompts, multimodal embeddings, and semantic search for images, this guide is designed for developers and researchers eager to delve into the integration of various data types for ...
Saad Malik's Insights on Spectro Cloud's Strategy Kubernetes Evolution Integration at the Edge
zhlédnutí 89Před 4 měsíci
This interview is a treasure trove for anyone keen on understanding the future of cloud-native technologies, Kubernetes evolution, and the strategic role of AI at the edge. Join us as Saad Malik from Spectro Cloud shares his invaluable perspectives and vision for navigating the complex landscape of modern computing infrastructure. TIME STAMPS 00:00 - Trailer 01:31 - Introduction 1:55 - Has Kube...
Kubernetes & Beyond | Saad Malik on future of Cloud Native Tech | Spectra Cloud
zhlédnutí 61Před 4 měsíci
Gear up for a session filled with groundbreaking insights that promise to inspire and propel your tech initiatives into the future. Join Saad Malik, the visionary co-founder and CTO of Spectra Cloud, for an exhilarating journey into the heart of cloud-native technologies. In this dynamic conversation, uncover how Spectra Cloud is transforming Kubernetes, edge computing, and AI into powerful too...
Enhancing AI Deployments on Google Cloud | Erwan Menard
zhlédnutí 162Před 4 měsíci
Journey into the digital frontier with Erwan from Google Cloud, as we navigate the evolving landscape of AI and cloud technology on The Encounter. As the compass of Cloud AI and Vertex AI, Erwan charts a course through the transformative impact of AI on cloud-native platforms and the metamorphosis of DevOps practices. We delve into the confluence of predictive and generative AI, unearthing how ...
Step Into Tomorrow | How Erwan's Insights from Google Cloud Empower Your DevOps Strategy |
zhlédnutí 39Před 4 měsíci
Tune in for a glimpse into the exciting advancements shaping our digital world! Unveiling the Future of Cloud-Native Tech: Join us for a riveting chat with Erwan from Google Cloud! This episode peels back the layers on Vertex AI, the impact of generative AI (Gen AI) in the tech sphere, and the pioneering concept of LLM Ops. Perfect for tech enthusiasts, developers, and AI experts eager for a de...
Using LangChain with Gemini and Chroma DB
zhlédnutí 2,5KPřed 4 měsíci
This session covers how to use LangChain framework with Gemini and Chroma DB to implement Q&A and Summarization use cases. The code is available at gist.github.com/janakiramm/6546d9734c7872f111b139cda1a8e0de (Q&A) and gist.github.com/janakiramm/8fcfc6c055c09a6f5dc5248b890f0567 What you'll learn: - Detailed session on integrating LangChain framework with Gemini and Chroma DB. - Focus on developi...
Revolutionizing IT Ops | Corey Harrison | Flip AI's Breakthrough in Observability
zhlédnutí 277Před 4 měsíci
Join us for this compelling discussion on The Encounter, where generative AI and operational excellence converge to usher in a new era for the tech industry. Looking towards the future, Corey envisions a world where AI not only diagnoses but also automatically remediates technical issues, significantly elevating operational efficiency and allowing engineers to focus on more strategic tasks. As ...
How AI is Changing DevOps | Insights from Flip AI's | Corey Harrison
zhlédnutí 100Před 4 měsíci
How AI is Changing DevOps | Insights from Flip AI's | Corey Harrison
Analyzing Tamil Movie Scripts with Google's Most Capable Model - Gemini 1.5 Pro
zhlédnutí 332Před 4 měsíci
Analyzing Tamil Movie Scripts with Google's Most Capable Model - Gemini 1.5 Pro
Master RAG on Vertex AI with Vector Search and Gemini Pro
zhlédnutí 5KPřed 4 měsíci
Master RAG on Vertex AI with Vector Search and Gemini Pro
Shaping the Future with Percona | Open Source, Cloud and Beyond | Ann Schlemmer
zhlédnutí 133Před 4 měsíci
Shaping the Future with Percona | Open Source, Cloud and Beyond | Ann Schlemmer
Ann Schlemmer's Vision | CEO of Percona | Redefining Cloud & Database Landscapes
zhlédnutí 41Před 4 měsíci
Ann Schlemmer's Vision | CEO of Percona | Redefining Cloud & Database Landscapes
Mastering Gemini | A Developer's Practical Guide for Hands-On Learning
zhlédnutí 437Před 4 měsíci
Mastering Gemini | A Developer's Practical Guide for Hands-On Learning
Spotlight on Cloud Native Storage | Murli Thirumale | Portworx | Pure Storage
zhlédnutí 127Před 5 měsíci
Spotlight on Cloud Native Storage | Murli Thirumale | Portworx | Pure Storage
Getting Started with Generative AI on Google Cloud
zhlédnutí 1,6KPřed 5 měsíci
Getting Started with Generative AI on Google Cloud
Cloud Native Storage Trends | What's Shaping the Landscape? | Murli Thirumale | Stay Tuned!
zhlédnutí 30Před 5 měsíci
Cloud Native Storage Trends | What's Shaping the Landscape? | Murli Thirumale | Stay Tuned!
Ori Goshen - Co-Founder, Co-CEO of AI21 Labs - Shares Vision and Insights
zhlédnutí 311Před 5 měsíci
Ori Goshen - Co-Founder, Co-CEO of AI21 Labs - Shares Vision and Insights
Ever Wondered How AI21's Jurassic 2 Works? | Ori Goshen | Co-Founder & Co-CEO at AI21 Labs
zhlédnutí 150Před 5 měsíci
Ever Wondered How AI21's Jurassic 2 Works? | Ori Goshen | Co-Founder & Co-CEO at AI21 Labs
Exclusive Encounter | Janakiram MSV and Mumshad Mannambeth | Exploring KodeKloud's Impact
zhlédnutí 772Před 5 měsíci
Exclusive Encounter | Janakiram MSV and Mumshad Mannambeth | Exploring KodeKloud's Impact
Ready to Master Cloud DevOps with Mumshad Mannambeth? | KodeKloud
zhlédnutí 364Před 5 měsíci
Ready to Master Cloud DevOps with Mumshad Mannambeth? | KodeKloud
Walkthrough of Perplexity Labs AI API
zhlédnutí 4,9KPřed 5 měsíci
Walkthrough of Perplexity Labs AI API
Next-Gen Observability Insights and Innovations | GroundCover CEO Shahar Azulay | Janakiram MSV
zhlédnutí 129Před 5 měsíci
Next-Gen Observability Insights and Innovations | GroundCover CEO Shahar Azulay | Janakiram MSV

Komentáře

  • @captainmcduckyYT
    @captainmcduckyYT Před 5 hodinami

    Very helpful. Thank you so much good sir.

  • @jagatmohansarvari5681

    really helpful for understanding the concept of embedding and retrieval. Thanks.

  • @anilaknb1
    @anilaknb1 Před 9 dny

    Great video & playlist ! Can you please share the source code ?

  • @dhananjaypathak15
    @dhananjaypathak15 Před 9 dny

    i want same thing in nodej can some one please help on which library to use

  • @kaushikdas5115
    @kaushikdas5115 Před 13 dny

    Sir, I have a long text which contains five bidder details and their financial values such as turnover, revenue etc. and that is stored in a variable, bidders_data. could you pls let me know how can I create a tool or function to draw a bar chart for the bidder names and turnover or any other financial parameters, when I give a prompt such as ''draw a bar chart for bidders (bidder names) and their turnover", so it should draw a bar chart as response. Awaiting for you reply sir!

  • @Manish69
    @Manish69 Před měsícem

    great work sir , this was very useful but throughout the whole video the audio was coming only from left side of headphones i hope you will fix this issue next time

  • @birolyildiz
    @birolyildiz Před měsícem

    ❤🙏

  • @AyushMandloi
    @AyushMandloi Před měsícem

    No voice

  • @wanderlust8367
    @wanderlust8367 Před měsícem

    the code link u have shared is incomplete, load_file is missing and other few stuffs,

  • @ShaliniMohan-vr4rb
    @ShaliniMohan-vr4rb Před měsícem

    Good Content and explanation for all the beginners

  • @jeffpowell860
    @jeffpowell860 Před měsícem

    Why is Chroma such hot garbage?

  • @shaktipawar5372
    @shaktipawar5372 Před měsícem

    Hi Janakiram. Can you explain Self attention and Feed forwarder layer in more detail plz ? Example what are the fundamental purpose / responsibility of each of them...

  • @brenoav99
    @brenoav99 Před 2 měsíci

    Thank you for the useful video. I'm having a problem using the chroma and google embedding (new versions): >>> vectordb=Chroma.from_documents(pages,embeddings) ValueError: Expected each embedding in the embeddings to be a list, got ['Repeated'] Do you have any ideas? Thanks in advance!

  • @shaktipawar5372
    @shaktipawar5372 Před 2 měsíci

    Janakiram, This is one of the best Fine Tuning Video I have came across. I need to thank you for getting this concept clear in such a simple manner. Keeping making such videos, it helps community to grow. Thank you !!

  • @tubasweb
    @tubasweb Před 2 měsíci

    Can you make an update video onnx?

  • @IanMcAleer-op1xj
    @IanMcAleer-op1xj Před 2 měsíci

    Thanks, this is tremendously helpful One point to note - you need to upload the embed file, not the sentence file -> upload_file(bucket_name,embed_file_path)

  • @AaronNicholsonAI
    @AaronNicholsonAI Před 2 měsíci

    Super helpful! Thank you so much.

  • @ainanirina758
    @ainanirina758 Před 2 měsíci

    Awesome work as always! Do you have a link for the collab? Thanks

  • @awakenwithoutcoffee
    @awakenwithoutcoffee Před 2 měsíci

    can we make our own "online" models with the Brave-API ?

  • @VijayKumar-ws3gv
    @VijayKumar-ws3gv Před 2 měsíci

    Nice Explanation Anna :) expecting Enterprise Gen AI on AWS/AZURE/GCP

  • @larsfrommiddle2162
    @larsfrommiddle2162 Před 2 měsíci

    Do the online models also give back the source of the websites or papers it uses, when asked?

    • @Statsjk
      @Statsjk Před 2 měsíci

      Did you figure it out?

  • @suryarp
    @suryarp Před 2 měsíci

    What a pleasure to see Priyanka on your show! Awesum❤❤

  • @suryarp
    @suryarp Před 2 měsíci

    Teaser really teased😊

  • @MarceloFerreira-rl6hh
    @MarceloFerreira-rl6hh Před 2 měsíci

    Great job! Thanks a lot. What’s the difference between this approach and using langchain?

  • @softreviewed
    @softreviewed Před 2 měsíci

    perplexity now supports mixtral-8x22b-instruct so if we uses can we get more accurate answers ?

  • @GAURAVRAUT007
    @GAURAVRAUT007 Před 2 měsíci

    Excellent video - can u please do same with Langchain with retrieval

  • @tavishi3884
    @tavishi3884 Před 2 měsíci

    Sir Can we use this gemini 1.5 pro with Langchian ..?

  • @fusionxfitness_
    @fusionxfitness_ Před 3 měsíci

    you didn't shared the pdf

  • @vikasbammidi1340
    @vikasbammidi1340 Před 3 měsíci

    Can you please do a video on "How to use the same in Langchain with retrieval"

  • @ammvr
    @ammvr Před 3 měsíci

    how can we deploy this to the web?

  • @elibessudo
    @elibessudo Před 3 měsíci

    Thanks for sharing. Do you have any insight on how to incorporate the feature where the response includes citation URL's?

  • @harishankar7587
    @harishankar7587 Před 3 měsíci

    impressive slide show

  • @user-mk3qb3iv2i
    @user-mk3qb3iv2i Před 3 měsíci

    why always python is there any way to use js?

  • @venkatathota633
    @venkatathota633 Před 3 měsíci

    could you please provide git repo for the above code?

  • @AIWithShrey
    @AIWithShrey Před 3 měsíci

    Any reason why you chose BAAI and not any other Embedding model? What are the impacts of mix and matching the Embedding model and the LLM? My current app works just fine with GPT4ALL Embeddings, and Gemma 1.1 7B. Another note: Deploying a quantized LLM will significantly reduce VRAM usage, Gemma 7B Q8_0 quantized takes up 12 gigs of VRAM for me. Implementing KEDA and using Quantization in tandem will be a game-changer.

  • @goldendrake1387
    @goldendrake1387 Před 3 měsíci

    To whoever is experiencing problems with the following lines of code: URI='http:...' client = InferenceClient(model=URI) Do this instead: find a model that you want to use on huggingface, such as Mistral 7B, create an access token, save the model URL and use the following code: API_URL = "your_chosen_model_url" headers = {"Authorization": "Bearer your_created_token"} inference_client = InferenceClient(model=API_URL, headers=headers)

  • @sharathkumar8422
    @sharathkumar8422 Před 3 měsíci

    Concise and to the point. Superbly explain! Thank you...

  • @Hitish99999
    @Hitish99999 Před 3 měsíci

    Thanks for the tutorial. I am bit confused which file to be uploaded to bucket. sentence file or embedding file

  • @ashokvaddepally5181
    @ashokvaddepally5181 Před 3 měsíci

    This was a great close encounter and looking forward for the GenAI courses from Jani sir in collab with kode kloud way of doing 😃

  • @AhmedBesbes
    @AhmedBesbes Před 3 měsíci

    Thanks for the tutorial! Instead of going through the ids in the json file to fetch the sentences, is it possible to integrate those directly as metadata in the index?

  • @sureshkumarselvaraj8911
    @sureshkumarselvaraj8911 Před 3 měsíci

    Great video! What is the difference between Vertex Search service VS Vector Search for RAG application? which one is better in terms of handling better retrieval of relevant documents for RAG application where we deal with 100+ PDF documents? Can you share some insights?

  • @sagarvaiyata9510
    @sagarvaiyata9510 Před 3 měsíci

    What is the confusion at 14:23? What are meta data files? What are these phrases in a realtime scenario? You are doing good but please give examples that are actually possible in realtime scenarios

  • @jagatchaitanyaprabhala8668
    @jagatchaitanyaprabhala8668 Před 3 měsíci

    is oscer-index file saved in s3?

    • @Janakirammsv
      @Janakirammsv Před 3 měsíci

      No. It’s just an empty collection at this point. Watch part 1 on how to ingest the data from the CSV file into the collection.

    • @jagatchaitanyaprabhala8668
      @jagatchaitanyaprabhala8668 Před 3 měsíci

      @Janakirammsv yes found it after this comment 😊. What are the accesses/setups I need to ask my MLOps team to enable me start working on bedrock?

  • @ShirishBhagnani
    @ShirishBhagnani Před 3 měsíci

    Thank you for putting all the content in a very simple and understandable way!

  • @sreehari514
    @sreehari514 Před 3 měsíci

    Source code please

  • @shraddha9640
    @shraddha9640 Před 3 měsíci

    is this on sagemaker?

    • @Janakirammsv
      @Janakirammsv Před 3 měsíci

      No. This is a new service called Bedrock.

  • @sachinworld_
    @sachinworld_ Před 3 měsíci

    Can we add two different documents in collection ?

    • @Janakirammsv
      @Janakirammsv Před 3 měsíci

      Absolutely. The vector index doesn't know the source. You can add the details in the metadata of each item in the collection.

  • @TomFord-mv2mx
    @TomFord-mv2mx Před 4 měsíci

    Great Video. One question, I noticed you used a different model (gecko) to Gemini Pro for the embeddings. Is this ok to do? I assumed the models needed to be the same for both training and inference? Thanks again

    • @Janakirammsv
      @Janakirammsv Před 3 měsíci

      Text embedding models are independent of LLMs. You only have to ensure that the same embedding model is used for indexing the documents and the query. This is critical to retrieving the context based on the similarity.

  • @arvindmathur6574
    @arvindmathur6574 Před 4 měsíci

    Great!

  • @ShahidGhetiwala-dg3ol
    @ShahidGhetiwala-dg3ol Před 4 měsíci

    Great Video, thank you soo much........