sentdex
sentdex
  • 1 254
  • 114 872 520
Building an LLM fine-tuning Dataset
Going through the building of a QLoRA fine-tuning dataset for a language model.
NVIDIA GTC signup: nvda.ws/3XTqlB6
Fine-tuning code: github.com/Sentdex/LLM-Finetuning
5000-step Walls1337bot adapter: huggingface.co/Sentdex/Walls1337bot-Llama2-7B-003.005.5000
WSB Dataset: huggingface.co/datasets/Sentdex/WSB-003.005
"I have every reddit comment" original reddit post and torrent info: www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/
2007-2015 Reddit Archive.org: archive.org/download/2015_reddit_comments_corpus/reddit_data/
Reddit BigQuery 2007-2019 (this has other data besides reddit comments too!): reddit.com/r/bigquery/comments/3cej2b/17_billion_reddit_comments_loaded_on_bigquery/
Contents:
0:00 - Introduction to Dataset building for fine-tuning.
02:53 - The Reddit dataset options (Torrent, Archive.org, BigQuery)
06:07 - Exporting BigQuery Reddit (and some other data)
14:44 - Decompressing all of the gzip archives
25:13 - Re-combining the archives for target subreddits
28:29 - How to structure the data
40:40 - Building training samples and saving to database
48:49 - Creating customized training json files
54:11 - QLoRA training and results
Neural Networks from Scratch book: nnfs.io
Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin
Discord: discord.gg/sentdex
Reddit: www.reddit.com/r/sentdex/
Support the content: pythonprogramming.net/support-donate/
Twitter: sentdex
Instagram: sentdex
Facebook: pythonprogramming.net/
Twitch: www.twitch.tv/sentdex
zhlédnutí: 38 203

Video

Visualizing Neural Network Internals
zhlédnutí 41KPřed 4 měsíci
Visualizing some of the internals of a neural network during training and inference. Starting and full code: github.com/Sentdex/neural-net-internals-visualized Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ T...
Getting Back on Grid
zhlédnutí 18KPřed 4 měsíci
Establishing an internet connection in an internet desert, then figuring out (well, starting to) networking. Combined with Starlink as my internet provider, I ended up going with a wifi bridge implementation with a couple of Ubiquiti nanostation AC locos to network between buildings at 100 meters of distance. The Ubiquiti units can also do point to point (ptp), but so far the wifi bridge setup ...
Open Source AI Inference API w/ Together
zhlédnutí 30KPřed 5 měsíci
Exploring the Together Inference API (www.together.ai/) Together API basics jupyter notebook examples: github.com/Sentdex/Together-API-Basics Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: twitter.co...
INFINITE Inference Power for AI
zhlédnutí 25KPřed 6 měsíci
Testing and enjoying the Comino Grando Server machine with 6x RTX 4090s from Comino (www.comino.com/) Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: sentdex Instagram: sentd...
Pandas Dataframes on your GPU w/ CuDF
zhlédnutí 41KPřed 7 měsíci
An overview and some quick examples of using CuDF's Pandas accelerator and how much faster it can be than vanilla Pandas for data analysis. Colab demo of Rapids: nvda.ws/3LWggQj AI and Data Science Virtual Summit: nvda.ws/3ZR3wjL Notebook in this video: gist.github.com/Sentdex/469c30385d06719519af13125db85edc Install CuDF: pip install cudf-cu11 extra-index-url=pypi.nvidia.com (or cu12) Neural N...
QLoRA is all you need (Fast and lightweight model fine-tuning)
zhlédnutí 65KPřed 9 měsíci
Learning and sharing my process with QLoRA (quantized low rank adapters) fine-tuning. In this case, I use a custom-made reddit dataset, but you can use anything you want. I referenced a LOT of stuff in this video, I will do my best to link everything, but let me know if I forget anything. Resources: WSB-GPT-7B Model: huggingface.co/Sentdex/WSB-GPT-7B WSB-GPT-13B Model: huggingface.co/Sentdex/WS...
Chat Interface for your Local Llama LLMs
zhlédnutí 20KPřed 9 měsíci
A tutorial of sorts covering how to create streaming chat interfaces using Gradio for the various chat/instruct large language models from HuggingFace. Sample code: huggingface.co/spaces/Sentdex/StableBeluga-7B-Chat/blob/main/app.py Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.co...
Gzip is all You Need! (This SHOULD NOT work)
zhlédnutí 150KPřed 10 měsíci
Github code: github.com/Sentdex/Simple-kNN-Gzip Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: sentdex Instagram: sentdex Facebook: pythonprogramming.net/ Twitc...
Better Attention is All You Need
zhlédnutí 62KPřed 11 měsíci
Addressing the current state of attention for artificial intelligence and why it's currently holding back maximum context lengths. Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: sentdex I...
The BEST Open Source LLM? (Falcon 40B)
zhlédnutí 97KPřed 11 měsíci
TII Call for Proposals with Falcon 40B: falconllm.tii.ae/proposal.php Falcon Github samples: github.com/Sentdex/Falcon-LLM TermGPT: czcams.com/video/O4EmRi0_CI4/video.html GPT-4 Overview: czcams.com/video/lJNblY3Madg/video.html Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/s...
OpenAI GPT-4 Function Calling: Unlimited Potential
zhlédnutí 227KPřed rokem
Function calling is a new capability for OpenAI's GPT-4 and GPT-3.5 via the API. Function-calling allows you to extract structured outputs from the GPT model. Github notebook: github.com/Sentdex/ChatGPT-API-Basics/blob/main/function_calling.ipynb Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit:...
Letting GPT-4 Control My Terminal (TermGPT)
zhlédnutí 74KPřed rokem
Giving LLMs like GPT-4 the ability to plan and execute terminal commands. TermGPT github: github.com/Sentdex/TermGPT/ OpenAI Chat API tutorial: github.com/Sentdex/ChatGPT-API-Basics Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming...
Building an Open Assistant API
zhlédnutí 30KPřed rokem
Working with one of the Open Assistant models, a 12B parameter Pythia model (huggingface.co/OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5) Github: github.com/Sentdex/OpenAssistant_API_Pythia_12B Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pyt...
Sparks of AGI? - Analyzing GPT-4 and the latest GPT/LLM Models
zhlédnutí 58KPřed rokem
An in-depth look into the current state of the art of Generative Pre-trained Transformer (GPT) language models, with a specific focus on the advancements and examples provided by OpenAI in their GPT4 Technical Report (arxiv.org/abs/2303.08774) as well as the Microsoft "Sparks of AGI" Paper (arxiv.org/abs/2303.12712). Neural Networks from Scratch book: nnfs.io Channel membership: czcams.com/chan...
ChatGLM: The ChatGPT killer? Checking out ChatGLM6B
zhlédnutí 102KPřed rokem
ChatGLM: The ChatGPT killer? Checking out ChatGLM6B
GPT Journey - A text and image game with ChatGPT
zhlédnutí 35KPřed rokem
GPT Journey - A text and image game with ChatGPT
ChatGPT API in Python
zhlédnutí 178KPřed rokem
ChatGPT API in Python
Image Editing A.I.
zhlédnutí 21KPřed rokem
Image Editing A.I.
The AI wars: Google vs Bing (ChatGPT)
zhlédnutí 109KPřed rokem
The AI wars: Google vs Bing (ChatGPT)
ChatGPT Writes a Chatbot AI
zhlédnutí 188KPřed rokem
ChatGPT Writes a Chatbot AI
OpenAI's ChatGPT is a MASSIVE step forward in Generative AI
zhlédnutí 470KPřed rokem
OpenAI's ChatGPT is a MASSIVE step forward in Generative AI
Google A.I. Diffusion Image Editing w/ Prompt to Prompt
zhlédnutí 41KPřed rokem
Google A.I. Diffusion Image Editing w/ Prompt to Prompt
Google's DreamFusion AI: Text to 3D
zhlédnutí 200KPřed rokem
Google's DreamFusion AI: Text to 3D
Open AI’s Whisper is Amazing!
zhlédnutí 479KPřed rokem
Open AI’s Whisper is Amazing!
The Future of User Interfaces with A.I.
zhlédnutí 70KPřed rokem
The Future of User Interfaces with A.I.
Creating Stable Diffusion Interpolation Videos
zhlédnutí 54KPřed rokem
Creating Stable Diffusion Interpolation Videos
Exploring an AI’s Imagination (Stable Diffusion and MidJourney)
zhlédnutí 146KPřed rokem
Exploring an AI’s Imagination (Stable Diffusion and MidJourney)
$5 MILLION AI for FREE
zhlédnutí 426KPřed rokem
$5 MILLION AI for FREE
Does a Deep Learning Laptop Exist? - Tensorbook Review
zhlédnutí 50KPřed rokem
Does a Deep Learning Laptop Exist? - Tensorbook Review

Komentáře

  • @alessandrocoppelli3056
    @alessandrocoppelli3056 Před 15 hodinami

    hello,i'm trying to use PPO and A2C for my discrete-box environment. i have set negative rewards in order to teach the agent to avoid impossible operation in my environment. most of the training time is spent to learn to avoid those operations with negative rewards. Is there a method to directly "tell" the agent (inside the agent itself) to avoid those operations, instead of spend training time? thanks in advance

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 17 hodinami

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 17 hodinami

  • @Ukraine-0
    @Ukraine-0 Před 20 hodinami

    WTF MAN WHY DID I STUDY VERTEXES AND JOINTS AND ALL THAT STUFF?!

  • @Codingskills-ne2lh

    Hey man looks like the command "py -3.7" is not working for me. I doo have python 3.12 installed and when I check the list of environments it shows 3.12 and 3.8. How can I add 3.7 here. Should I go and add it in the env path and downloading python 3.7?

  • @michaelmoore7568
    @michaelmoore7568 Před 2 dny

    Is Sentdex the best Python coder in the world?

  • @snipplerifsky
    @snipplerifsky Před 4 dny

    4 years later and still one of the best series on the subject

  • @helloeveryone6614
    @helloeveryone6614 Před 4 dny

    Thankyou bro❤

  • @Anandgamerzzz-ru8xt

    Get an error string indices must be integers, not 'str'

  • @rachaelkibicho7461
    @rachaelkibicho7461 Před 4 dny

    Can anyone access his python shell scripts? I'm having trouble with the input at the attribute category_slug. Were we not supposed to fill that attribute at the python shell. I feel lost

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 4 dny

  • @imad-ice
    @imad-ice Před 4 dny

    why is this resource being underrated, people nowadays import tensorflow, and PyTorch, -> say u build a neural network, but this man combined theory and lab in such a way that it is easy to understand

  • @Uncreeperble
    @Uncreeperble Před 5 dny

    "You won't get through this in a weekend" Challenge accepted.

  • @No7es-Txken
    @No7es-Txken Před 5 dny

    Loved the videos, really helped me understand Neural Networks deeper. I’m looking forward to buying the book but I also enjoy the videos that you do. Are you going to continue video support for this series or is the book my only way forward from here?

  • @norimashiron3635
    @norimashiron3635 Před 5 dny

    for row in range(len(game)): diag1.append(game[row][row]) diag2.append(game[row][len(game)-1-row])

  • @souravdey1227
    @souravdey1227 Před 6 dny

    Your videos always feel so grounded. Most other videos on such topics make me feel so insignificant. Also, I have a friend, Zeke, from Florida, and you two are soo uncannily similar!!!!

  • @namitshah9535
    @namitshah9535 Před 6 dny

    That ain't a home lab it feels like a tiny datacenter

  • @proflead
    @proflead Před 6 dny

    Epic video! :)

  • @Eisenwolf.Ironwulf
    @Eisenwolf.Ironwulf Před 7 dny

    Still here ... still waiting ... with my book and my unwavering Will to wait

    • @DawitMengistuAbajifar
      @DawitMengistuAbajifar Před dnem

      I can't afford to buy the book? Any way you can sent it to me?

    • @Eisenwolf.Ironwulf
      @Eisenwolf.Ironwulf Před dnem

      @@DawitMengistuAbajifar Sorry, but i cant give my copie away, I still need it. But if you cant afford it, ask sentdex, mby he can give you a code or something. Thats far better than pirating it.

    • @DawitMengistuAbajifar
      @DawitMengistuAbajifar Před dnem

      @@Eisenwolf.Ironwulf alright, found it anyway.

  • @rajvaidya1691
    @rajvaidya1691 Před 7 dny

    this is going over my head now

  • @GamerDevIND
    @GamerDevIND Před 7 dny

    Brain is glitching

  • @VyreBET
    @VyreBET Před 8 dny

    Can I Also try this on Pycharm IDE? Or do I Really need to Install SUblime?

  • @gamagambogam
    @gamagambogam Před 8 dny

    I just purchased your book! You are too kind

  • @prathyushmadhu2861
    @prathyushmadhu2861 Před 8 dny

    Does anybody know about that copilot he used to speed up the decompressing process?

  • @randomdude5634
    @randomdude5634 Před 8 dny

    hi

  • @gamagambogam
    @gamagambogam Před 8 dny

    bless you!

  • @rook451
    @rook451 Před 8 dny

    Love your website. Thank you.

  • @sakethsaketh750
    @sakethsaketh750 Před 8 dny

    Nice video

  • @itzkritiandjoseph6380

    am literally doing this just to communicate with my girl😭😭😭😭

  • @SUPERKK11
    @SUPERKK11 Před 9 dny

    what the heck was that 35.7

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 9 dny

  • @prabhdeepsingh8726
    @prabhdeepsingh8726 Před 9 dny

    For people getting error while loading the saved model, use the activation functions of keras, not of tensorflow and specify input_shape of the first layer. Following are the code changes - from keras import activations model = tf.keras.models.Sequential() model.add(tf.keras.layers.Flatten(input_shape=(28,28))) model.add(tf.keras.layers.Dense(128, activation=activations.relu)) model.add(tf.keras.layers.Dense(128, activation=activations.relu)) model.add(tf.keras.layers.Dense(10, activation=activations.softmax))

  • @Stafford674
    @Stafford674 Před 9 dny

    This is a good course, but may I make a friendly positive comment.You need to separate out the discursive 'chat' from the Really Important Stuff. The RIS needs to be taken a little slower, and a little more carefully and with a slightly greater focus. And make sure all your code is visible on the screen.

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 9 dny

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 9 dny

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 9 dny

  • @Harshit-qc5ck
    @Harshit-qc5ck Před 9 dny

  • @sinasasani1994
    @sinasasani1994 Před 9 dny

    I didn't understand why you put last 5 percent for index

  • @irfannafiz
    @irfannafiz Před 10 dny

    I think one big issue with the model not working well, may be because of the traction and handling of the vehicle? Playing around with the vehicle using custom controls, I saw some serious handling and traction issues at high speeds, whereas vehicles like ambulances were handling really well in comparison. Just my two cents.

    • @anshumansinha1145
      @anshumansinha1145 Před 8 dny

      were you able to follow all the previous videos and do exactly what he did? I mean, libraries and packages have upgraded.

  • @qwerty.mnbdudeehfurhfnvur

    What do i do if the transposition of the weights isnt either the right shape not sure if this question makes sense i thought youd use a for loop and then make a output of the dot product for each input node for each output node

  • @qwerty.mnbdudeehfurhfnvur

    Shout out daniel

  • @thundersepp
    @thundersepp Před 11 dny

    I think now would be a perfect time for a part 4 😁

  • @abhishekvs9076
    @abhishekvs9076 Před 11 dny

    Jesse Pinkman if he did math instead of meth

  • @Luiz-fy2lz
    @Luiz-fy2lz Před 12 dny

    pls continue

  • @blewmyload
    @blewmyload Před 12 dny

    wow nice tactic, just stop halfway and go quiet. what you want us to buy the book? no way dude. not supporting you.

  • @lookIdrewstuff-sr4oc
    @lookIdrewstuff-sr4oc Před 12 dny

    love this tutorial and i followed it religiously but he said first-ith and i cried because i watched that part so much

  • @joemw8806
    @joemw8806 Před 13 dny

    Elon watched this and was like, X is the name of training data. Twitter data is training data. Twitter is X;

  • @Muhammad_Abdullah_Sultan

    Wo what I am 8 years late

  • @GOBPK
    @GOBPK Před 13 dny

    Thanks sentdex. Love the videos!

  • @artin.a1985
    @artin.a1985 Před 13 dny

    can you make a video about gradient decent optimization and backpropagation and their implemetations?