my AI model box

Sdílet
Vložit
  • čas přidán 18. 05. 2024
  • Setting up AI models on the DAS and speed comparisons - visual studio / virtual machine tests.
    Temperature/fan on your Mac: www.tunabellysoftware.com/tgp... (affiliate link)
    Run Windows on a Mac: prf.hn/click/camref:1100libNI (affiliate)
    Use COUPON: ZISKIND10
    🛒 Gear Links 🛒
    * 🍏💥 New MacBook Air M1 Deal: amzn.to/3S59ID8
    * 💻🔄 Renewed MacBook Air M1 Deal: amzn.to/45K1Gmk
    * 🎧⚡ Great 40Gbps T4 enclosure: amzn.to/3JNwBGW
    * 🛠️🚀 My nvme ssd: amzn.to/3YLEySo
    * 📦🎮 My gear: www.amazon.com/shop/alexziskind
    🎥 Related Videos 🎥
    * 🌗 RAM torture test on Mac - • TRUTH about RAM vs SSD...
    * 🛠️ Host the PERFECT Prompt - • Hosting the PERFECT Pr...
    * 🛠️ Set up Conda on Mac - • python environment set...
    * 🛠️ Set up Node on Mac - • Install Node and NVM o...
    * 🤖 INSANE Machine Learning on Neural Engine - • INSANE Machine Learnin...
    * 💰 This is what spending more on a MacBook Pro gets you - • Spend MORE on a MacBoo...
    * 🛠️ Developer productivity Playlist - • Developer Productivity
    🔗 AI for Coding Playlist: 📚 - • AI
    Repo
    github.com/open-webui/open-webui
    Docs
    docs.openwebui.com/
    Docker Single Command
    docker run -d --network=host -v open-webui:/app/backend/data -e OLLAMA_BASE_URL=127.0.0.1:11434 --name open-webui --restart always ghcr.io/open-webui/open-webui:main
    - - - - - - - - -
    ❤️ SUBSCRIBE TO MY CZcams CHANNEL 📺
    Click here to subscribe: / @azisk
    - - - - - - - - -
    Join this channel to get access to perks:
    / @azisk
    - - - - - - - - -
    📱 ALEX ON X: / digitalix
    #machinelearning #llm #softwaredevelopment
  • Věda a technologie

Komentáře • 83

  • @aliBoumedyen
    @aliBoumedyen Před 14 dny +4

    Never bored with this crazy experiments 💜

  • @froggy5967
    @froggy5967 Před 14 dny +3

    Easy Alex. Just get a 8TB M4 Ultra next time 😂

  • @JiBe128
    @JiBe128 Před 14 dny

    Thanks for your videos ! Love them. It would be very nice to get your review on a Sinology NAS, I am thinking of buying 1 of those.

  • @kilobitz8639
    @kilobitz8639 Před 14 dny +5

    Great video.

    • @AZisk
      @AZisk  Před 14 dny +2

      Glad you enjoyed it

  • @mahesh5452
    @mahesh5452 Před dnem

    Great stuff

  • @le_bouvier
    @le_bouvier Před 14 dny +15

    Get one of the Ugreen NAS. If you get teh 6 bay you get 6 3.5 Drive Bays, 2 m.2 Slots (aside from the OS drive) Finally it has Thunderbolt 4 ports in addition to the 10 Gbit ports

    • @AZisk
      @AZisk  Před 14 dny +10

      ordered :)

    • @eriglac
      @eriglac Před 14 dny

      oh yeah, totally. i would run it on TB if you can afford to buy TB drives. for a poor grad student like myself, it’ll just have to be a makeshift NAS and external drives.

    • @zezhenxu9113
      @zezhenxu9113 Před 14 dny

      Do not buy ugreen nas, they suck

    • @AZisk
      @AZisk  Před 14 dny +4

      @@zezhenxu9113Ive heard “they suck” about every piece of gear I use from one person or another. What are your reasons?

    • @eriglac
      @eriglac Před 14 dny

      They suck because I'm green with envy that I can't afford them. I don't know if ugreen with envy too.
      Ugreen should have an Envy line of products but I think hachpee got that covered. Thank goodness it's not Compaq or eMachines haha.
      Sorry I couldn't help it. Seriously though, wish I can afford that ugreen NAS. Have to make do with proxmox and truenas.

  • @tibbydudeza
    @tibbydudeza Před 14 dny

    Quick question - how many tokens per second do you get on say 8B and 70B local LLM on the Mac ???.
    I want to buy a server dedicated to LLM but adding an NVidia GPU to my PC is not what I had in mind - currently have a Radeon RX 6600XT - it spins up and makes a loud noise when using Ollama.

  • @sveinjohansen6271
    @sveinjohansen6271 Před 14 dny +1

    Just wait for the 400b model coming soon hehe

  • @carloseduardoalmeida6469

    Hey Alex, great content! Would love to see some practical examples of what you have been using LLAMA for.
    Don’t know if I got it right, but what are the advantages over using web ChatGPT, for example?

  • @rithikkumar7683
    @rithikkumar7683 Před 14 dny

    Please make a video which model a software dev should have and others model can be may have, because not all can have resources for this , thanks

  • @HMexperience
    @HMexperience Před 14 dny

    My exact same experience. My new laptop is way too small for AI models. I can only do a few 8B p. models before my SSD is full. Cloud based models will not go away anytime soon they are better and they are fast despite being in the cloud.

  • @EricHarmon67
    @EricHarmon67 Před 9 dny

    Would you suggest the Samsung SSDs with or without the heat sinks for that particular setup?

  •  Před 14 dny +2

    I wonder if a external storage with network connectivity would be fast enough. You could match it with a VPN like Tailscale and have your models available anywhere.

    • @AZisk
      @AZisk  Před 14 dny +1

      i’ll let you know when i get my NAS :) although might need to upgrade my network first

  • @razorgarf
    @razorgarf Před 14 dny

    why so many different AI models though, would be interesting to know what sets them apart

  • @Winnetou17
    @Winnetou17 Před 14 dny

    LoL I can't believe it! 4 SSDs in RAID 0 for gigantic speed, only to be bottlenecked by the 10 Gbps USB transfer rates :)) If that wasn't a bottleneck, those 4 drives, if they were decent PCI-E 3.0 ones, can go over 10 GB/s (that is gigaBYTES). Fast PCI-E 5.0 ones could probably go over 30 GB/s (I remember Corsair has a 10 GB/s SSD, so 4 of them + a bit of overhead should be able to do 30 GB/s). Anyway, the thing that triggered me was that Apple's SSD is much faster at 4:19 ... I really doubt it is. Compared to 4-RAID 0 normal SSDs, that is.
    Also the breakdown at 1:37 is pure gold. Thanks Apple!
    Edit: ok, wanted to check something and rewatched a bit. No mention of that USB 3.2 what type it is, but from the end tests on the Windows VM, reaching over 3 GB/s, makes me think it's actually a 20 Gbps (USB 3.2 gen 2x2 F*** the USB comitee for these absolute i-diotic names). Still, 20 / 8 means only 2.5 GB/s theoretical, more like 2.0 GB/s practical so where's the 3.3 GB/s coming from ? Not sure.
    Also realized that the SSDs are Samsung 980 (not Pro), which is PCI-E 3.0, so around 3 GB/s each (it even says it on the box at 2:46 ). So the mention at 3:27 "It's only USB 3.2 But you don't need than 'cuz the fastest drive in there is gonna be uuhm 1 GB/s" is VERY wrong.

  • @abduislam23
    @abduislam23 Před 14 dny

    So using this solution, I should not care about space customization while making purchasing decisions?

  • @tutacat
    @tutacat Před 10 dny

    Why keep below 34b than 7b. Or just keep the quantized version, you can delete or store in 16bit/8bit

  • @AlmorTech
    @AlmorTech Před 14 dny +1

    No way, how big SSD is big enough for you, monster! 😅

  • @RomPereira
    @RomPereira Před 14 dny +3

    Proxmox + truenas on an inexpensive mini pc (intel n305, if not thunderbolt) with 2.5 Gbit ethernet with thunderbolt or USB 3.2 port eith this DAS box.

    • @AZisk
      @AZisk  Před 14 dny

      i thought about doing this, but then just ordered the new ugreen nas instead :)

  • @DS-pk4eh
    @DS-pk4eh Před 10 dny

    Just download more storage (and RAM)?

  • @terencedodge3249
    @terencedodge3249 Před 14 dny +1

    So much fun…

  • @trenxnet
    @trenxnet Před 14 dny

    🤣🤣 I had the same problem and configured a NAS with some n100 mini pcs, then it wasn't enought so I got a new PC with a 4090 and like 16TB storage. LLMs are the perfect excuse to need storage.

  • @OlegShulyakov
    @OlegShulyakov Před 14 dny +2

    Some day you’ll just buy Synology

  • @RedDragon72q
    @RedDragon72q Před 14 dny

    you can buy the SD card adapter that allows an SD drive to be inserted sideways. I did that and put a 2T SD in there and with that card the the 2Tb SSD in my M3 I have a ton of room for models on the SD card. Love it.

    • @AZisk
      @AZisk  Před 13 dny

      what model do you have?

    • @RedDragon72q
      @RedDragon72q Před 13 dny

      @@AZisk M3 Pro 16 with the Max chip and 64 GB 2TB. I bought this to hide the SD card. BASEQI UHS-II Aluminum microSD Adapter for 2021 M1 MacBook Pro 14 & 16” (Silver) Model USHii-420A

    • @DanielHarrisCodes
      @DanielHarrisCodes Před 13 dny

      @@RedDragon72qWhat are the speeds like on it? I got a Transcend JetDrive for my M1 Pro MacBook and TBH haven’t really used the storage for anything. It’s too slow for most things but it’s there if I need it for storing large files. I keep a backup on my Parallels VM on there but it’s too slow to actually run from

    • @RedDragon72q
      @RedDragon72q Před 13 dny

      @@DanielHarrisCodes standard speeds for an SD card, maybe a bit slower on read for some reason. I keep long term files and models on it. Loading the model takes a bit longer but once it it loaded you're all good.

  • @eriglac
    @eriglac Před 14 dny

    haha. omg alex, seriously put your stuff on a NAS or an external drive. i put everything either on NAS, Dropbox (if i need to share with my lab), or on external drive (spinning disks). have you considered doing a hackathon for those near you?

    • @AZisk
      @AZisk  Před 14 dny +1

      i have a dropbox subscription. i’m sick and tired of the costs associated with it, and the lack of immediate availability of my data. NAS is next

  • @gadaao
    @gadaao Před 14 dny

    وماذا عن كمية الشحنة الكهربائية داخلها كيف نعرف

  • @max75025
    @max75025 Před 14 dny

    why ollama not LMStudio?

  • @ElbayMalik
    @ElbayMalik Před 14 dny

    What is your old time machine? Could you show us?

    • @AZisk
      @AZisk  Před 14 dny

      yes, i’m considering making a vid

  • @_jerieljan
    @_jerieljan Před 14 dny

    If you're eating that much storage, then yeah, you should really be offloading them when not in use to a NAS or external media. It's not like you'll use all these models and whatever quantization or version they have at all times, right?

  • @edvardasjuodakis7644
    @edvardasjuodakis7644 Před 14 dny

    Why not to just remote desk into a desktop?

  • @dtesta
    @dtesta Před 14 dny +2

    Wait wait wait! Hold up! So you are using usb 3.2? So maximum 20gbit, giving you like maximum 2500mb/second. Slower than what ONE of those nvme drives can do! What exactly do you think you gain by putting them in a stripe raid???

    • @DS-pk4eh
      @DS-pk4eh Před 10 dny

      Probably the total capacity of all 4. Maybe a bit better than just JBOD.

    • @dtesta
      @dtesta Před 10 dny

      @@DS-pk4eh With JBOD, he would not lose ALL data if one drive fails. The stripe raid give no benefit at all in this setup. Stripe raid is for maximising throughput at the expense of seek-time, as all drives needs to seek for one read. Does not hurt as much on SSDs of course, but still hurts.

  • @Scarrus666
    @Scarrus666 Před 14 dny +1

    That's a lot of money for "only" computing.

  • @BelarusianInUk
    @BelarusianInUk Před 14 dny

    For your sd raid 0 you are limited by usb3.

  • @mattisrensen9162
    @mattisrensen9162 Před 14 dny

    Why use a das when you can use a nas, so you can also stream films and series + run your vms

    • @AZisk
      @AZisk  Před 14 dny +1

      already ordered

  • @ericy91745
    @ericy91745 Před 14 dny

    Why not use services like Backblaze to increase your cold storage space? Yes, you don’t get the convience of local redundancy, But it’s cold storage! If local HDD fails, get the copy online.

    • @AZisk
      @AZisk  Před 14 dny +1

      Ideally I should, but I don't like paying monthly storage fees.

  • @williamsquires3070
    @williamsquires3070 Před 14 dny +7

    Now put a sign on the black box that says, “do not feed the A.I.” 😀

  • @AndreasMolnar-Dev
    @AndreasMolnar-Dev Před 14 dny +2

    Why didn't you get a dedicated AI server?

    • @AZisk
      @AZisk  Před 14 dny +4

      if i build out a server like that, i’ll want to spec it out with nvidia stuff, and i’m waiting to see what the 50xx series do

  • @itzhexen0
    @itzhexen0 Před 14 dny +2

    Wow, look at that shit.

    • @AZisk
      @AZisk  Před 14 dny +2

      Check it out!

  • @sativagirl1885
    @sativagirl1885 Před 14 dny

    Alex, you need to show #AI who is THE BOSS (you).
    Put each LLM on a 2TB ext. USB so they don't conspire to take your fame & fortune and go to Las Vegas to gamble with other #AI

  • @tutacat
    @tutacat Před 10 dny

    Fine tuning doesn't mean software development.

  • @adrimathlener8008
    @adrimathlener8008 Před 14 dny

    remember Bill Gates:
    Here’s the legend: at a computer trade show in 1981, Bill Gates supposedly uttered this statement, in defense of the just-introduced IBM PC’s 640KB usable RAM limit: “640K ought to be enough for anybody.”

  • @asksearchknock
    @asksearchknock Před 14 dny +1

    RAID 0 is not raid… the clue is in the name 😂😂😂😂

    • @AZisk
      @AZisk  Před 14 dny +2

      lol. i suppose we can just call it AID :)

    • @asksearchknock
      @asksearchknock Před 4 dny

      @@AZisk I have at one time or another used:
      Risky Arrangement Inviting Disaster
      Really Awful Idea for Data
      Reckless Architecture Ignoring Durability
      Reliably Arranging Imminent Deletion

  • @aeonlancer
    @aeonlancer Před 14 dny

    I guess professional video editors are the piggest ones

  • @HadesTimer
    @HadesTimer Před 14 dny

    Wow, Alex DIDN'T get sponsored for this? Who'd you piss off man? Every other creator has one of these and they are all sponsored.😅

  • @mlnima
    @mlnima Před 5 dny

    are you kidding me? if you download others along llm 2 tb is like a joke xD

  • @Aygross
    @Aygross Před 9 dny

    Raid 0 is stupid your limited by usb not the drives .

  • @leomogiano27
    @leomogiano27 Před 14 dny +2

    second comment :)

  • @michalrybinski3233
    @michalrybinski3233 Před 14 dny

    Right off the bat, bro, Ironwolfs pro instead of exos? most probably you have overpaid dearly for inferior product...

    • @AZisk
      @AZisk  Před 14 dny

      they were pricey. the pros were recommended for das, why exos are better?

    • @michalrybinski3233
      @michalrybinski3233 Před 14 dny +1

      @@AZisk pretty much twice the MTBF, and twice allowed TB/year