How to hack a LLM using PyReft (using your own data for Fine Tuning!)

Sdílet
Vložit
  • čas přidán 4. 05. 2024
  • 🚀 Sign up to the newsletter
    go.coursesfromnick.com/newsletter
    👨‍💻 Sign up for the Full Stack course and use CZcams50 to get 50% off:
    www.coursesfromnick.com/bundl...
    Hopefully you enjoyed this video.
    💼 Find AWESOME ML Jobs: www.jobsfromnick.com
    🤖 Get the Code: github.com/nicknochnack/PyReft
    Disclaimer: This has been developed for academic purposes. Nothing herein is financial advice, and NOT a recommendation to trade real money. Please use common sense and always first consult a professional before trading or investing.
    Oh, and don't forget to connect with me!
    LinkedIn: bit.ly/324Epgo
    Facebook: bit.ly/3mB1sZD
    GitHub: bit.ly/3mDJllD
    Patreon: bit.ly/2OCn3UW
    Join the Discussion on Discord: bit.ly/3dQiZsV
    Happy coding!
    Nick
  • Věda a technologie

Komentáře • 57

  • @aryamanarora4967
    @aryamanarora4967 Před 27 dny +36

    Hey, I'm one of the authors of the ReFT paper and pyreft, thanks for making this awesome walkthrough! Let us know if you had any trouble setting anything up / you want any improvements to the documentation :)

    • @NicholasRenotte
      @NicholasRenotte  Před 27 dny +8

      First up, I just want to say you are freaking amazing! Congratulations to you and your for tackling the research and creating such a straight forward library! Would love some clarity on the unit location mapping during inference!

  • @therealsirben
    @therealsirben Před 27 dny +5

    Before I watch the video, just want to let you know that I was really waiting for you to do an LLM Finetuning video. Really excited for this one

  • @coreuped
    @coreuped Před 28 dny +6

    Slow tokenizers (use_fast=False) are those written in Python inside the Transformers library, while the fast versions (use_fast=True) are the ones provided by Tokenizers, which are written in Rust.

    • @NicholasRenotte
      @NicholasRenotte  Před 27 dny +1

      Ohhh got it! Cheers man, gotta do a deep dive on transformers soon.

  • @user-zh3tr3cz1g
    @user-zh3tr3cz1g Před 5 dny

    Welcome Back ! The Nicholas's community need you to provide edge production topics like this :DDDDDD

  • @TheGermanPlopis
    @TheGermanPlopis Před 27 dny +1

    I love your Videos. I always learn new things. Thank you so much. You are such an awesome guy, please keep doing it, I always wait for new content from you.

  • @kenchang3456
    @kenchang3456 Před 28 dny +2

    Excellent video that I need to try. Thank you for sharing.

  • @Orgest
    @Orgest Před 28 dny +4

    always on top with the value

  • @jameswhitaker4357
    @jameswhitaker4357 Před 28 dny

    Does this work with Mac M3 pro? Any parameters I’d need to change? Looks great!! Very excited about this

  • @rishichowdhury4296
    @rishichowdhury4296 Před 20 dny +1

    Hey Nick I am getting this error while trying to train the pyreft model
    TypeError: Object of type type is not JSON serializable

  • @flychuban9896
    @flychuban9896 Před 25 dny

    Great video Nick! Do you know what params should I use when working on Macbook M3 Max. In the video you set_device('cuda').

  • @dhanushtharun8190
    @dhanushtharun8190 Před 7 dny +1

    Hi man i have one doubt you should clear that (book vs online tutorials/course) which is best
    What you learn for programming.

  • @n3ophytus
    @n3ophytus Před 26 dny

    Hey, i think i saw once on your ideas list something about a self driving car, possible to do that? Would be really interested :)

  • @user-ps1hm3tl3e
    @user-ps1hm3tl3e Před 27 dny

    Hi sir 👋.
    It is possible to used the same code for binary classification like the labels are 0 and 1 only ?
    Instead of prompt and response i have a dataset contains the headlines and labels.

  • @maziarhatami-ig9gq
    @maziarhatami-ig9gq Před 22 dny

    Hi, I have a question. Can we write a reinforcement learning model that connects to a real environment instead of simulated environments?

  • @paulmiller591
    @paulmiller591 Před 27 dny +1

    Great video mate! How easy would this be to apply to the Llama 3 version of instruct 8b?

    • @NicholasRenotte
      @NicholasRenotte  Před 27 dny +3

      Gotta double check if pyvene supports it but I can't imagine it'd be that hard. Might tackle that next week!

  • @moviesnowhere9660
    @moviesnowhere9660 Před 20 dny

    Love your videos ❤😅

  • @rafaeldesantis7580
    @rafaeldesantis7580 Před 21 dnem

    Hey Nicholas could you do a video about video classification ??? It’s been so hard to learn about it…

  • @CharifMakaoui
    @CharifMakaoui Před 28 dny +1

    Good one 🎉

  • @swapneelbanerjee8958
    @swapneelbanerjee8958 Před 25 dny

    for this specifc Q/A purpose isn't RAG particularly useful than fine-tuning ?

  • @DrumAndSpaces
    @DrumAndSpaces Před 28 dny +1

    I'm struggling to figure out the format for phi3 mini dataset. Any ideas? Can I still do it exactly like the video for my case?
    Also great video perfect timing for just what I needed

    • @NicholasRenotte
      @NicholasRenotte  Před 27 dny

      Can take a look, needs to be in HF format for it work - there's a list of models on the pyreft/pyvene site that lists compatible models.

  • @yaqubnaqiyev131
    @yaqubnaqiyev131 Před 3 dny

    How long have you been an ai engineer? I want to start i don't know where and what math and level of math needed. It would be very useful if you gives clear road map

  • @souravbarua3991
    @souravbarua3991 Před 28 dny +1

    Please do make a video on fine tuning LLM with your own dataset using transformers trainer. Thank you.

  • @codewithtj
    @codewithtj Před 26 dny

    Is that safe to train LLM on corporate machine ?

  • @laurenbliss8709
    @laurenbliss8709 Před 4 dny

    Hello Nicholas can you help on how to build a resume filtering using machine learning and NLP to quickly and accurately filter and rank resumes based on job description.Can you please if there is any step by step video I can watch for it,kindly recommend

  • @cihan1403
    @cihan1403 Před 3 dny +1

    anyone knows how to push the finetuned model to hf hub?

  • @rijo1254
    @rijo1254 Před 28 dny +9

    Hi bro can you redo beginner to advance ml projects for 2024 , big fan was with you since 2k subscribers ,😊😊😊😊❤❤❤❤❤

    • @DadtotheMax7
      @DadtotheMax7 Před 27 dny +1

      I second this Notion lol

    • @NicholasRenotte
      @NicholasRenotte  Před 27 dny

      Oh, like the old series? Just staight up Deep Learning projects from scratch?

    • @rijo1254
      @rijo1254 Před 27 dny

      @@NicholasRenotte nope , the beginner to advance ml projects , thanks for replying, big fan 🥰🥰🥰🥰🥰

  • @damoncraige7613
    @damoncraige7613 Před 23 dny

    Hi Nick. Great video. Im trying to run the face mask detection model from your previous videos. I think the Object detection api is outdated i have reached out to you on fb. can you please help me out for a few minutes?

  • @qureshimustaqahmed1221

    Hey, can you make Voice based payments application for regional languages??

  • @wasgeht2409
    @wasgeht2409 Před 28 dny +2

    THANK U

  • @yolow8126
    @yolow8126 Před 27 dny

    Hey I am an old sub watching your great content for about a year I have a challenge for you how about you build an LLM from scratch and then use human reinforcement learning plz

  • @sentinelspace
    @sentinelspace Před 28 dny +2

    Yay UTS!

  • @AbulHassankakakhel
    @AbulHassankakakhel Před 27 dny +1

    Hi, Nick we are waiting for the Transformers lectures.

    • @NicholasRenotte
      @NicholasRenotte  Před 27 dny

      Yeah was trying to punch it out this week, realised I had a ton more to do on it. This week is probs qlora.

  • @ashleysami1640
    @ashleysami1640 Před 28 dny +1

    👏🏽

  • @mdfarhananis8950
    @mdfarhananis8950 Před 28 dny +5

    Please make long videos this 15 min thing is kinda too constrained

  • @thelyncan7534
    @thelyncan7534 Před 28 dny +1

    First

  • @MarxOrx
    @MarxOrx Před 28 dny +1

    First 😂

  • @salsabilafirus9413
    @salsabilafirus9413 Před 25 dny

    Hi Nic, I support your video, I try your old video from 2 years ago about Real Time Sign Language Detection with Tensorflow Object Detection and Python Deep Learning SSD and I have problem with step 2 Create TF records in 23:15 cause i got red output (error)
    here the error
    Traceback (most recent call last):
    File "C:\Users\ACER\Tensorflow\scripts\generate_tfrecord.py", line 27, in
    import tensorflow.compat.v1 as tf
    File "C:\Users\ACER\AppData\Roaming\Python\Python311\site-packages\tensorflow\__init__.py", line 45, in
    from tensorflow.python import tf2 as _tf2
    File "C:\Users\ACER\AppData\Roaming\Python\Python311\site-packages\tensorflow\python\tf2.py", line 21, in
    from tensorflow.python.platform import _pywrap_tf2
    ImportError: DLL load failed while importing
    hope you see my comment and help me, thanks before ✨