A Survey of Advanced Prompt Engineering Techniques [webinar]

[1hr Talk] Intro to Large Language Models

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts

escape in roblox in real life

VŠECHNY VÁS NENÁVIDIM - PROBLÉMY S PÉŤOU

Analyzing the Costs of Large Language Models in Production

TensorOps

zhlédnutí 4 640

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 10. 09. 2024

Komentáře • 15

@CresGallego Před 8 měsíci ⁺²
Really great insights. Economics is well explained.
@tensorops Před 8 měsíci
Thank you!
@mohamedfouad1309 Před 9 měsíci ⁺²
😊
@billykotsos4642 Před 8 měsíci ⁺¹
Being handed a bill based on tokens generated by a model is preposterous...
These LLM apps cost so much right now that you need to have a solid use case in mind....
Else you just wait for a couple more years when inferencing these LLMs wont be as expensive... the only reason these LLMs are so expensive to run is that they are SOTA and Nvidia is the only player right now.
@billykotsos4642 Před 8 měsíci ⁺¹
the economics are broken because the hardware setup just isnt there...
instead of paying by the hour you pay by the token/call which is insane..... Cloud has been build on the idea that you fire up the instance and you know what you pay.... but these days you need huge cloud instances to run these huge models...
The costs will go down significantly to run these models in about 3 years.... you wont have to think about these things...
@lionhuang9209 Před 8 měsíci ⁺¹
Where can we download the slides?
@loopaal Před 7 měsíci
fantastic
@tensorops Před 7 měsíci
Thank you so much 😀
@balainblue Před 7 měsíci ⁺¹
Can you explain the math of 5 requests per minute translating it to 9,000$ per month?
@tensorops Před 7 měsíci ⁺¹
We recommend looking here
gptforwork.com/tools/openai-chatgpt-api-pricing-calculator
Assuming 220K requests, with proper prompts that are usually 1000-2000 tokens you can get to these costs.
Additionally we want to remind that often a single request to an LLM application triggers more than one API call to an LLM
@balainblue Před 7 měsíci
@@tensorops Thank you so much.
@balainblue Před 6 měsíci
@@tensorops Can you please elaborate on that? "A single request to an LLM application triggers more than one API call to an LLM"
@tensorops Před 6 měsíci ⁺¹
@@balainblue We give an example on the next webinar where you have one query that triggers many LLM calls. Sometimes even simple chains like Map-Reduce or Refine can cause many LLM calls to OpenAI for a simple action as "summarization"
@balainblue Před 6 měsíci
@@tensorops Thank you. I look forward to it.

Další v pořadí

Automatické přehrávání

A Survey of Advanced Prompt Engineering Techniques [webinar]

A Survey of Advanced Prompt Engineering Techniques [webinar]

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts

Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts

escape in roblox in real life

escape in roblox in real life

VŠECHNY VÁS NENÁVIDIM - PROBLÉMY S PÉŤOU

VŠECHNY VÁS NENÁVIDIM - PROBLÉMY S PÉŤOU

EMMA MI KRESLÍ TETOVÁNÍ! 😱

EMMA MI KRESLÍ TETOVÁNÍ! 😱

Workshop on Useful and Reliable AI Agents

Workshop on Useful and Reliable AI Agents

Engineering Techniques to Reduce Cost of LLMs in Production [webinar]

Engineering Techniques to Reduce Cost of LLMs in Production [webinar]

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

ChatGPT: 30 Year History | How AI Learned to Talk

ChatGPT: 30 Year History | How AI Learned to Talk

Serve a Custom LLM for Over 100 Customers

Serve a Custom LLM for Over 100 Customers

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Should You Use Open Source Large Language Models?

Should You Use Open Source Large Language Models?

POV: Já VS budík, při prvním školním dnu 🥹 #fyp #school #marcel

POV: Já VS budík, při prvním školním dnu 🥹 #fyp #school #marcel

GENIUS FOOD HACKS #shorts

GENIUS FOOD HACKS #shorts

C’est qui le plus fort 😂

C’est qui le plus fort 😂

Attack a Terezka jdou na rande… FOTBALOVÝ ZÁPAS

Attack a Terezka jdou na rande… FOTBALOVÝ ZÁPAS

BEST AIRPODS MAGIC SECRET | @Whoispelagheya

BEST AIRPODS MAGIC SECRET | @Whoispelagheya

How Strong is Tin Foil? 💪

How Strong is Tin Foil? 💪

Předělal Jsem Mojí Postel Na Aquárium!

Předělal Jsem Mojí Postel Na Aquárium!

Be a princess for Halloween: DIY guide! 👸🏰

Be a princess for Halloween: DIY guide! 👸🏰