The Petabyte Pi Project

Sdílet
Vložit
  • čas přidán 31. 05. 2024
  • By far the most storage ever attached to a single Raspberry Pi. Massive thanks to 45Drives for making this possible!
    Check out their storage solutions at www.45Drives.com/
    Support me on Patreon: / geerlingguy
    Sponsor me on GitHub: github.com/sponsors/geerlingguy
    Merch: redshirtjeff.com
    2nd Channel: / geerlingengineering
    #PetabytePi #RaspberryPi #45Drives
    Mentioned in this video:
    - 45Drives (HUGE THANKS!): www.45drives.com/
    - Storinator XL60: www.45drives.com/products/sto...
    - My new rack build video on Geerling Engineering: • Massive Rack Upgrade f...
    - 3-2-1 Backup Plan video: • Backups: You're doing ...
    - GitHub issue with tons of detail: github.com/geerlingguy/raspbe...
    - Raspberry Shake (seismograph): raspberryshake.org/
    Contents:
    00:00 - The Petabyte Pi Project
    00:24 - Challenges
    01:35 - What is a Petabyte?
    02:00 - 45Drives sent a Storinator XL60!
    03:12 - Ripping out the Xeon
    04:57 - Replacing it with a Pi
    07:26 - Backplanes and power
    08:37 - Pre-build tests
    09:06 - So. Many. Drives.
    11:37 - Pi OS mods
    12:17 - First boot!
    13:12 - Did you try restarting?
    14:42 - RAID 0 test
    15:51 - Don't yell at your JBODs
    16:39 - ZFS test
    16:59 - Btrfs test
    18:46 - Back to the basics
    19:50 - 192 days
    20:59 - So why, PCI?
    21:57 - Don't skip this part
    Music attributions:
    The Curtain Rises by Kevin MacLeod
    Link: incompetech.filmmusic.io/song...
    License: filmmusic.io/standard-license
  • Věda a technologie

Komentáře • 2,3K

  • @izzieb
    @izzieb Před 2 lety +5925

    More bottlenecks than a Coca Cola factory.

  • @bartz0rt928
    @bartz0rt928 Před 2 lety +2161

    60 HDDs in RAID 0 is the definition of "all gas no brakes"

    • @staceixan
      @staceixan Před 2 lety +88

      Glorious, glorious RAID 0 🤩

    • @aim-at-me
      @aim-at-me Před 2 lety +78

      glass cannon lol

    • @monad_tcp
      @monad_tcp Před 2 lety +35

      Who needs raid, why aren't you using LVM and just mounting it as direct-IO.

    • @ampex189
      @ampex189 Před 2 lety +12

      And no steering

    • @himmelsrand7527
      @himmelsrand7527 Před 2 lety +27

      @@monad_tcp Who needs LVM. Just use ZFS.

  • @Axel_Andersen
    @Axel_Andersen Před 2 lety +53

    I had this colleague who told this story:
    I had boss who did not understand anything about computer or electronics. When ever I was trouble shooting something he would come and watch over my shoulder and comment "Have you checked the power?" This was very annoying as he did not understand anything. What made it doubly annoying that his advice was spot on so many times.
    Check the power!!

  • @myrkat
    @myrkat Před rokem +74

    Pushing the limits of "hackey" tech is what most hardware and software engineers should be shooting for. Very well done. Kudos to 45 Drives!

  • @RaidOwl
    @RaidOwl Před 2 lety +1806

    A petabyte and a raspberry pi, the crossover we didn’t know we needed. This is sick.

    • @plica06
      @plica06 Před 2 lety +17

      Except it's so slow it's like writing to one of those fake Chinese 2TB USB keys! Yes I fell for that scam on eBay.

    • @adrianteri
      @adrianteri Před 2 lety +4

      @@plica06 Why would you want a 2TB Thumb drive/Flash drive instead of an external HDD? They wear more quickly due to storage technology. Heck even utilities like #Ventoy now support HDDs which are more reliable and you can carry more OSes...

    • @AmruthReddi
      @AmruthReddi Před 2 lety +8

      literally Pi-tabyte lol

    • @dieSpinnt
      @dieSpinnt Před 2 lety

      @@adrianteri Man, it is a joke, a statement and also a confession by plica06.
      A joke like Jeff's project. Which doesn't matter, because it is not about if it is useful or practical, but: Showing that it CAN BE DONE!
      BTW Great job Jeff and respect to plica06!:)
      P.S.: I get it that you are joking, too: "They wear more quickly ..." NO! They don't wear, because they aren't real! Hehehehe

    • @dieSpinnt
      @dieSpinnt Před 2 lety +1

      @@adrianteri On the other side, when I take your comment serious ... I can't take you serious:) A portable media, 2022 in the 2TB range ..... and you say spinning media? That must be some back-to-the-future joke I do not understand.
      Anyway, misunderstanding plica06's grotesque comparison and taking it as a spin(a straw-man!) to compare USB-Sticks[2] against enterprise level hard disks (as in the video, or consumer ones doesn't matter) is like comparing apples to hard drives. Enterprise level HDDs have to be compared to enterprise level SSDs. Latter one beat the first in every point, including price[1], reliability, data safety and energy efficiency in that sector.
      [1] The purchase price is simply irrelevant compared to the maintenance costs, since the former is already planned in the product and can simply be written off after the service life of the medium.
      [2] actually 2TB(etc.) NVME "USB-Sticks"(miniature-Adapters) exist. Try to beat their MTBF (vs. spinning media that IS transported around) ... ;)

  • @SaxaphoneMan42
    @SaxaphoneMan42 Před 2 lety +1160

    the PetaPi seems like a winner, I can only imagine the folks at 45 drives watching this with a mix of awe and horror.

    • @hannes-
      @hannes- Před 2 lety

      PetaPite

    • @glenncaughey5044
      @glenncaughey5044 Před 2 lety +27

      Just like watching two ships colliding. 😎🚢

    • @BradCozine
      @BradCozine Před 2 lety +53

      It IS a petabyte file server... but probably SHOULDN'T go with the name "PetaFile".

    • @michaelterrell
      @michaelterrell Před 2 lety

      @@BradCozine PETAFile, = a database of People Eating Tasty Animals!

    • @JasperJanssen
      @JasperJanssen Před 2 lety

      @@BradCozine petaPile?

  • @RickSaffery
    @RickSaffery Před rokem +18

    This is a bonkers crazy setup. It's so mad I just had to watch, 100%. Kudos to you for getting it to work at all. Your persistence is inspiring.

  • @t7732155980
    @t7732155980 Před rokem +4

    Even though the idea is crazy, Jeff knows his craft. The tips at time 09:00 on how to select hard drives for the task are priceless!

  • @MarcoGPUtuber
    @MarcoGPUtuber Před 2 lety +179

    I think i heard Seagate in the US having a heart attack watching drives being juggled.
    I live in Taiwan.

  • @happilicious
    @happilicious Před 2 lety +377

    How about one controller to one pi, then stripe 4 pis together, that should increase the throughput and scale down the issue to a more managable chunk, and still a pi project.

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +189

      This is probably the most reasonable way to do it, and use Ceph (or another network-based filesystem)... and indeed I will be testing that out soon. Still bottlenecked but probably more reliable and would not run into as many errors on the PCIe bus!

    • @qazwsx000xswzaq
      @qazwsx000xswzaq Před 2 lety +16

      I think you will need more than 4 Raspberry Pis unless the pool is for backup or archiving purposes only.
      Even a low-to-mid end Intel or AMD system would make more sense.. though then the project will lose its “Piliness”. 😂

    • @paulz1780
      @paulz1780 Před 2 lety +6

      @@qazwsx000xswzaq Wirh these Network Speeds you would need 1 Pi per Disk to max out the disks🤣

    • @qazwsx000xswzaq
      @qazwsx000xswzaq Před 2 lety +8

      ​@@paulz1780 We can then expose each drive as an iSCSI target and aggregate them at a server over ethernet. And viola we have a not-so-poor man's SAN haha. It reminds me of those shiny new network addressable NVMoF SSDs btw.

    • @jamesfmilne
      @jamesfmilne Před 2 lety +1

      Gluster might be a good option too.

  • @ChrisContin
    @ChrisContin Před rokem +2

    Fascinating! A lot of work! The issue it seems is having to tether to the hard-disk through a band shared with other drives, rather than universal. There is a technique where using data-points written to each hard-drive mathematical computations can rewrite and measure the data for inference machiningly- without even using the shared io-bus anymore! You’d see a slowdown in computation monologue but all speed ahead on as many drives as you want!

  • @zb9458
    @zb9458 Před 2 lety +3

    Jeff you never cease to deliver, you're an absolute legend, great video!!

  • @twiceineverymoment
    @twiceineverymoment Před 2 lety +383

    "I was going to unbox this on camera but Fedex already did"
    As someone who's had multiple expensive items destroyed by Fedex and actively avoids doing business with companies that ship with them, I felt that...

    • @junkbob6832
      @junkbob6832 Před 2 lety +18

      The reason for FedEx packages being more damaged than USPS packages actually isn't FedEx's fault. They accept heavier packages than USPS, so that means if a shipper doesn't pack their package correctly, when that 150lb box of farm tools slams into your 6lb box of plastic in the sorting machine, your package gets crunched. Meanwhile USPS only goes up to 50(?) lbs, so your box just doesn't get slammed as hard.
      TL:DR your packages get damaged in FedEx because the shipper didn't pack them right. Source: I ship around 5 packages a day with all 3 major US carriers and rarely have an issue since I actually package my shit

    • @twiceineverymoment
      @twiceineverymoment Před 2 lety +27

      @@junkbob6832 That doesn't mean Fedex isn't partially at fault. Especially because on the rare occasion that I do get a damaged shipment from UPS, they leave a note with a number to call offering to pay for a replacement if the item is broken. Fedex just delivers my $300 PC case looking like a truck backed over it and acts like it's no problem. And that's without even mentioning the countless packages that never showed up at all, or that got delivered to the wrong address and sent me on a trip across town trying to find it.

    • @topquark22
      @topquark22 Před rokem +5

      Yep. I had a laptop battery crunched by FedEx. It's a wonder that they can stay in business this way.

    • @atlantic_love
      @atlantic_love Před rokem

      You just didn't package correctly, how is that FedEx's fault?

    • @adwilson0286
      @adwilson0286 Před rokem +4

      In 2021 I had 5 items fulfilled through FedEx. 1 never arrived, 2 arrived damaged, 1 arrived late, 1 was early and undamaged.
      "20% delivery reliability!" 😵‍💫

  • @SeanHodgins
    @SeanHodgins Před 2 lety +403

    Do I need a petabyte of storage? No. Would I mount one in my rack? Yes.

    • @kevinbissinger
      @kevinbissinger Před 2 lety +29

      30 years ago: Do I need a Gigabyte of storage? No.

    • @monad_tcp
      @monad_tcp Před 2 lety +4

      I absolutely need it to store my GPT-3 training data. Turns out having 16 GPUs wasn't enough, but I run out of space even before running out of compute time. Its probably more on the scale of 1PB + 8192 GPUs.

    • @monad_tcp
      @monad_tcp Před 2 lety +1

      10 years from now, Do I need a Petabyte, yes.

    • @sparkyispog
      @sparkyispog Před rokem +6

      do i really need 128 gb of storage on my mac?
      "your mac is almost out of storage"
      wait- whY IS MY SYSTEMS FOLDER 70GB
      whY does microsoft word take up 2gb?

    • @geraldcreager4432
      @geraldcreager4432 Před rokem

      @@kevinbissinger I still remember buying my first gigabyte hard drive. Kept a grad student in school.

  • @BrenskiIP
    @BrenskiIP Před 2 lety +4

    As someone who is about to endeavor on a new NAS project, this was a fun watch! Thanks!

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +2

      just keep upgrading the drives, I'm sure there'll be 1 PB 3.5" drives in like 20 years 🤪

  • @bryanenglish7841
    @bryanenglish7841 Před 2 lety

    You are an absolute madman and I'm rooting for you the whole way

  • @45Drives
    @45Drives Před 2 lety +358

    Woah.

  • @KlausWulfenbach
    @KlausWulfenbach Před 2 lety +261

    1942: "We need to figure out a solution to digitally store dozens of bytes at a time. Vaccuum tubes, maybe? This is going to cost us millions, but it will be worth it to finally have accurate artillery range tables!"
    2022: "I'm going to hook up this petabyte of data storage to this cheap single board computer!"

    • @vaisakhkm783
      @vaisakhkm783 Před 2 lety +41

      1969: "we need to figure out how to use our cutting edge 1.5 million doller, 4kb ram 32kb harddrive to bring people moon"
      2025: lauching a rocker with a single board computer...

    • @Nordlicht05
      @Nordlicht05 Před 2 lety +6

      Wait until the average person doesn't click on a video below 16k 120fps. There will be always a way to fill it 😅 but I remember it's gotten better over time.

    • @NovemberOrWhatever
      @NovemberOrWhatever Před rokem +11

      @@vaisakhkm783 Pi's and Arduino's are already used for avionics on model rockets, and cubesats can have total build costs of like $50,000. It's amazing how far the industry has come and is going

  • @jerryw1608
    @jerryw1608 Před rokem +30

    Well, pairing €20.000 - €24.000 worth of drives with a 100 dollar raspberry seemed like a solid plan to start with. Having it run a 1pb raid0 config while hosting that storage seems like a task made for this little arm processor 😂 Nice to see you try out such extreme things with the raspberry😀

  • @abdusaidabduraufov5615
    @abdusaidabduraufov5615 Před rokem +2

    My God! 20TB 60HDD I never dreamed of such a volume

  • @AlbaxArcade
    @AlbaxArcade Před 2 lety +287

    I always like to think that every time Jeff publishes a new video, the raspberry pi design team feels a disturbance in the force.

    • @jstan5802
      @jstan5802 Před 2 lety +15

      As they should, it's been 3 years, where's the raspberry pi 5?

    • @UnderEu
      @UnderEu Před rokem +2

      I find your lack of faith disturbing 😅

    • @Thinktank-rn6dm
      @Thinktank-rn6dm Před 7 měsíci

      @@jstan5802 you're gonna be happy

    • @stickmanland
      @stickmanland Před 5 měsíci

      @@jstan5802 Here.

  • @CLU2O10
    @CLU2O10 Před 2 lety +278

    Linus would be proud

    • @hardwarefromthegarbage3446
      @hardwarefromthegarbage3446 Před 2 lety +42

      Torvalds too. Nice example of the Linux versatility

    • @shivsankermondal
      @shivsankermondal Před 2 lety +16

      electroboom too.

    • @TheBacktimer
      @TheBacktimer Před 2 lety +9

      I was waiting for the water bottle :D

    • @rvmiv_
      @rvmiv_ Před 2 lety +6

      It's a good example of why the ltt pedibyte project is so expensive

    • @subhimesto7123
      @subhimesto7123 Před 29 dny

      ​@@rvmiv_
      First of all and most importantly, the speed, those were gen 4 nvme ssd drives with extreme speeds, second of all this is just a storage rig but what ltt built is a server, I mean have you seen how they run NASA simulation?
      And lastly there drives are extremely reliable

  • @Echobar
    @Echobar Před 2 lety

    Jeff another wonderful video. Keep up the great work.

  • @rollerboogie
    @rollerboogie Před měsícem

    As an HDD engineer usually drives are built to compensate for vibration due to certain fan RPMs. Especially if we have a big customer we'll optimize things for the frequencies of vibration in their trays.

  • @tyrdchaos
    @tyrdchaos Před 2 lety +114

    I hope LTT sees this. Great stuff as always, Jeff!

    • @LyokoisGreat2
      @LyokoisGreat2 Před 2 lety +6

      I wondered how long it would take for LTT to be mentioned in the vid

  • @scbtripwire
    @scbtripwire Před 2 lety +27

    Oh my goodness, that juggling of those drives.😳

    • @wayland7150
      @wayland7150 Před 2 lety +30

      Every IT engineer has a stack of broken hard drives just for juggling with.

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +10

      Haha true.

    • @SyphistPrime
      @SyphistPrime Před 2 lety +7

      @@wayland7150 yep, pretty much. My old boss when I worked in computer repair used laptop hard drives to level out his microscope. Funny thing is the drives weren't even dead, they were just something like 160GB 5400RPM drives that were more useful for that task than storing data.

  • @markonfilms
    @markonfilms Před 2 lety +1

    This is an awesome project. Makes me wanna build a large storage server.

  • @InterprisesTV
    @InterprisesTV Před rokem +3

    Jeff, I'm catching up with thx, since I've admired your ability for a while. I am an old geek and you do magic. Reminds me of my S-100 days with CromixOS.

    • @spoils8179
      @spoils8179 Před rokem

      1 month ago and no recognition? Especially for a $50 USD donation? Sadge.

    • @InterprisesTV
      @InterprisesTV Před rokem

      @@spoils8179 Thanks for the sentinent, Aiden, but no problem. Hope he's doing well, and you as well for that matter. 👍

  • @MichaelDude12345
    @MichaelDude12345 Před 2 lety +60

    I saw you posted some on the homelab subreddit last week, THIS is what you were hiding from us?? What a fun idea. Can't wait to see your next project!

  • @jmr
    @jmr Před 2 lety +114

    You tried all the things I wanted to see. You know your audience!

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +21

      The last thing I wanted to try (even had it in the final edit but cut it for time) was 4x hardware RAID cards... but I only have one on hand. I was thinking of setting up 4 hardware RAID 6 arrays, then uniting them on the Pi as a RAID 0 array and seeing if that performed better since individual drives would all go through one HW raid card, and that would also give redundancy.
      (And who said hardware RAID is dead? You still need it if your computer performs like one from 2010!).

    • @Winnetou17
      @Winnetou17 Před 2 lety

      @@JeffGeerling Yeah, exactly, who said hardware RAID is dead ? Clearly unrelated, did you ever thought of doing a collaboration with Wendell ? Him and Red Shirt Jeff would surely push themselves to insanity :D Seriously speaking you two seem to do similar exploratory courses, though, of course, he's much less Pi-centric.

  • @adrianaa3059
    @adrianaa3059 Před rokem +2

    I think Arthur C Clarke hypothesized that this would be enough to store a few people's minds into it

  • @communitycollegegenius9684

    I just completed a 24 X 18T build that's almost 1/2 PB. I bought the drives a few at a time all Segate recertified it seems to be the sweet spot in price for me. I had to upgrade and rebuild things several times. Your video was just like my experience switching OS, FS, etc. I had endless drives/arrays just dropping out, mostly on startup. I tried Fedora, Centos, Open Suse, Suse JeOs, and Ubuntu Server all let me down for one reason or another. I tried them with various shares, raid arrays and file systems; plus not all would run my app. I ultimately got it working with Ubuntu workstation and the Ubuntu share - no samba. I'm not happy with ZFS. I had to kill the swap and add more RAM (which meant a new motherboard) to keep ZFS cache from crashing. I solved the dropouts problem by putting the drives really close to the host board and using short/expensive data cables that are all the same length. I'm using a very old Athlon FX-8300 8 core and 64 gig of Ram. I found a great last generation Adaptec 52445 raid card new old stock. I had to install 2 power supplies and rewire one to all molex to get enough amperage on the 5 volt rail. New 1200 watt supplies have plenty of 12v but almost no 5v power. I also upgraded to 2.5 gbps network card. The write speed is NOT stellar. With Raid0 it goes real fast at first 450mbps filling all that cache, but slows down to about 150mbps. With a single JBOD I only get 130mbps, two drives at the same time still go 130 each, and I can transfer to 4 drives at the same time before it bogs down the network and write speed drops to 70 each drive or 280 total. I already got 4 sas expanders and plan to continue adding drives (and power supplies) up to 2.5 PB. My box is an old IBM 2401 tape drive converted to rack space. I yell at the You Tube screen, not my computers. That's not true I also yell at my computer at work (it's windows).

  • @MarcoGPUtuber
    @MarcoGPUtuber Před 2 lety +12

    Jeff Geerling: My storage setup registers on the Richter Scale

  • @TheOleHermit
    @TheOleHermit Před 2 lety +39

    Glad to see Red Shirt Jeff back. I was wondering what happened to him.
    Geez, your projects are soooo extreme and cutting edge. Your troubleshooting processes are very informative and helpful to your viewers, who otherwise wouldn't have a clue where to look. We never see that on "How to" setup videos, where everything just automagically works.
    BTW, I recently used Styrofoam to separate 2 prototype boards while testing them out. It got hotter than expected and the foam sagged, allowing the power rail of one board to touch the other board, which fried as soon as I powered up the following morning.
    Don't try this at home, folks. Cardboard insulation good. Styrofoam bad.
    Thanks for sharing Jeff. I'm fully confident that you'll get your bandwidth up on the 60 HDD RAID. Looks to me like you're already nearly there.
    Instead of installing it inside a server rack, perhaps a locked cage would be a better idea to protect it from Red Shirt Jeff. 😎

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +5

      Heh, cardboard insulation 'better', but I now have it on a 3D printed box that's a little more secure too.

    • @TheOleHermit
      @TheOleHermit Před 2 lety +2

      @@JeffGeerling Good point. 👍
      Now, my 2nd attempt is safely mounted inside a custom acrylic case, too. One less thing to worry about, amongst the multitude of other potential mistakes. Live & learn, eh?
      Just gotta keep putting one foot in front of the other until the final goal is achieved, right?
      The hardware for the Teensy Laser Synth waveform module w/ILDA DAC is complete. Only need to flush out the code to full functionality, before moving on to the custom MIDI controller.
      BR 😎

  • @StrongbowTX
    @StrongbowTX Před 2 lety

    OK, this is the kind of cool content I like seeing appear on my birthday! (even if I didn't see it until 5 days later.) Just ONE of those drives is more storage than everything in my house, and I do Unixy networky stuff for a living. :)

  • @Bronathan
    @Bronathan Před 7 měsíci +1

    This showed up in my recommendations. I have no clue what happened here and I know nothing about data managment or IT in general. But I watched it to the end not realizing that this vid is 22 minutes long 😅good content 9/10

  • @haidenshober6732
    @haidenshober6732 Před 2 lety +23

    Many Pi’s running something like Minio might allow for some interesting single box hardware redundancy. Also might be able to get over the 1Gb limitation since you’d have many pi’s each with their own 1Gb connections.

  • @wildbikerbill6530
    @wildbikerbill6530 Před 2 lety +5

    When I first saw the title I figured this would be one of your massive multiprocessor/multi-Pi projects combined with a massive amount of storage. Something closer to 20 TB per Raspberry Pi. At 60 Drives and 60 Raspberry Pi's, that would still be way beyond any normal homebrew project.

  • @ryda2l
    @ryda2l Před 2 lety

    Awesome video & project!

  • @z98i87
    @z98i87 Před rokem

    This tutorial helped me so much!! Thanks!

  • @caseyhefner1966
    @caseyhefner1966 Před 2 lety +26

    Peta-Pi go BRRRRR
    Im genuinely surprised this worked, very impressive. As for what to do with it, a video on downloading/hosting a local copy of Wikipedia would be pretty cool.

  • @maxd7228
    @maxd7228 Před 2 lety +4

    Jeff, you have carved yourself a niche channel in an overcrowded tech community, taking the Pi to new heights in every video. Keep it up. Loving every video. I'm left astounded on the Pi's capabilities and untapped potential.

  • @EDATEC
    @EDATEC Před 9 měsíci +1

    It's impressive that this works at all.

  • @ronguin7062
    @ronguin7062 Před 2 lety +1

    This guy is the only human capable of winning in a debate with Data from star trek.

  • @nezu_cc
    @nezu_cc Před 2 lety +27

    My NAS is a pi4 with a desoldered USB chip to expose the PCIe bus. It's connected to a PCIe switch(to improve transmission and prevent crashes) and then to a cheap ASMedia SATA controller. The kernel is also patched to force PCIe gen 1 to prevent crashes(this is probably the same thing that happened to you in the video btw). PCIe gen1 is still faster than gigabit so it doesn't matter. Over samba or NFS I can get the full gigabit speed even on large data transfers so no bottlenecks there. Would I recommend this setup? no. Does it work? hell yeah (longest uptime was like 90 days or something and then a power outage killed it, I need to get(or more likely make) a UPS, I know). If anyone wants photos lmk

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +6

      Nice. Coreforge also suggested forcing Gen 1 speeds elsewhere in the comments, so I may need to test that out.

    • @mthqwork123
      @mthqwork123 Před 2 lety

      @@JeffGeerling That was my first idea as well. Those switches are used to run gen1 or maybe max gen2, when used for GPU mining. Above that it will throw errors.

  • @b00573d
    @b00573d Před 2 lety +38

    Be careful with those sata to molex adapters...they are prone to fires!

  • @thomasgoirand488
    @thomasgoirand488 Před rokem

    Hi there! I work for Infomaniak, and I am managing storage networks. We offer backup services and connected drive (we call it kDrive, it's a bit like google drive, just we protect your data and they are hosted in Switzerland).
    Adding 96 Exos 20TB HDD in my swift storage cluster is what I do every day. I manage "moderately large" Swift clusters. On them, we add 6 2U servers at a time, each of them holding 16 HDD (so 6*16 = 96 HDDs). In total, I calculated that I am managing more than 5000 spinning drives in our OpenStack swift clusters, which amounts probably around 100PB (so not just 1 PB like you're doing...). Just this week, I added 12 HDD storage nodes, and 6 proxies (with 2x SSD each, plus the system drive), and 9x NVMe storage nodes (10 NVMe each, to be used in a Ceph cluster).
    Your idea of a Raspbery-Pi is fun, but instead of one RPi for all HDDs, I would setup one RPi *PER* HDD, and then it all makes sense, and you may have decent speed.
    BTW the casing you bought seems of very bad quality compared to what we get from HPe or Lenovo.
    I've been running this service for nearly 5 years now, and we never lost a single bit of data... :)

  • @aajpeter
    @aajpeter Před 2 lety +1

    The slow storage movement. Bytes aged to perfection, not for those in a hurry but for folks who appreciate a good long wait for quality unbalanced spectacle in design.

  • @whitey4986
    @whitey4986 Před 2 lety +37

    Hey Jeff, I’ve been using your ansible roles for almost 10 years. Love seeing you around HN and the other traps, great to see your CZcams channel is doing so well. Very cool projects!

  • @devnol
    @devnol Před 2 lety +7

    One thing I saw you doing when wiring up the NAS was that you connected the molex adapters to two sata power connectors on the same line. If the psu has another line, try connecting to that, as each line can draw a limited amount of current and there might be an issue there. Then again, the issue might be anywhere else but that's a thing you can easily try

  • @soulofjacobeh
    @soulofjacobeh Před rokem

    What PCIe Gen (e.g. Gen3) does the CM4 carrier run? I have some experience trying to push FPGAs over those crummy 1x risers, and almost always had to drop to PCIe Gen2 or even Gen1 to maintain signal integrity over even the shortest USB cables.
    I'm curious if it would be more stable, at the cost of being even more painfully slow on the upper end.
    Edit: Oh wow just saw the follow-up video to this, and sure enough, dropping to gen2 improved stability a bit, and dropping to gen1 seemed to make things solid. Would not surprise me if just about all of it came down to the riser and USB cable carrying PCIe signaling.

  • @badwolf1984
    @badwolf1984 Před 2 lety

    Ripps out high end setup for a PI! You Monster! Though a Fun Project to play with a pi, maybe in the future with a PI 14 hope you enjoy your new Perabyte Server

  • @crashmatrix
    @crashmatrix Před 2 lety +9

    Well Jeff, one thing's for sure, if you and others aren't pushing the pi to bleed on the edge, no progress will ever be made in this direction. I'm not sure what kind of useful stuff this direction will yield, but it surely will yield something. Keep on pushing ya madlad!

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +6

      My hope is the next Pi at least has the PCIe bus bugs sorted so any card will 'just work'. After that, any more bandwidth they could squeeze out would be appreciated.
      The CM4 is actually great for many 1 Gbps network use cases-but with a little more bandwidth, it could be great for 2.5 Gbps (or heck, more than that if we're dreaming!).

    • @levygaming3133
      @levygaming3133 Před 2 lety

      @@JeffGeerling with a USB3.something port, it could get almost-5Gbe, so not that much of a stretch for faster than 2.5gbe

  • @jaxxarmstrong
    @jaxxarmstrong Před 2 lety +23

    60x USB-connected HDDs... I'm telling you, a missed opportunity :D

    • @marcogenovesi8570
      @marcogenovesi8570 Před 2 lety +5

      Yeah use the maximum possible number of USB hub daisy chaining

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +22

      Heh, the USB controller would probably just set itself on fire!

  • @stuffinfinland
    @stuffinfinland Před rokem

    Yet another absurd carage setup. Just love it!

  • @Neilhuny
    @Neilhuny Před 2 lety +15

    Absolutely fascinating and completely, totally barmy! What an immense amount of work you obviously put in to this - research, sweet talking 45Drives, research, RAID solutions, research, talking to 45Drives techs, research, etc.
    I am *extremely* impressed, both by you AND by 45Drives for their courage!
    And my abiding thought? "Chassis" is pronounced "shassey", not "chassey"! Pfft!
    So I looked up chassis in Cambridge Dictionary to back my obviously accurate opinion and ... well, blow me, North Americans really do say "chassey"!
    Every day is a learning day! Pedantry isn't good

    • @JeffGeerling
      @JeffGeerling  Před rokem +2

      lol we North Americans are weirdos. Or maybe you are... I guess it's a matter of perspective, neighbour!

    • @Neilhuny
      @Neilhuny Před rokem

      @@JeffGeerling I have to picture you in dungarees, a battered straw hat, and with a long stalk of grass hanging from you mouth when you call me neighbour! Hock-diggardy, or something like that

  • @SpiritmanProductions
    @SpiritmanProductions Před 2 lety +100

    Maybe the CABD order is like the firing sequence of a 4-cylinder petrol engine lol 🤔

    • @falxonPSN
      @falxonPSN Před 2 lety +6

      The amount of power this thing draws could probably be measured more easily in horsepower, so you're not wrong there!

    • @JasperJanssen
      @JasperJanssen Před 2 lety +4

      @@falxonPSN the PSU isn’t that big - a horsepower is roughly 750W, so it’s going to be around the 1-2 mark.

    • @falxonPSN
      @falxonPSN Před 2 lety +3

      @@JasperJanssen fair enough. I can't argue with good pedantry! 🤪

    • @fohkukohgeki
      @fohkukohgeki Před 2 lety +2

      @@JasperJanssen Could run a petabyte server off a lawnmower engine...

    • @falxonPSN
      @falxonPSN Před 2 lety

      @@fohkukohgeki this is the project we need to see!

  • @volvaary2724
    @volvaary2724 Před rokem

    Thanks for this idea to store my stuff.

  • @braydennturner
    @braydennturner Před 2 lety +5

    Amazing content as always, keep up the great work!

  • @MrSnuffyX
    @MrSnuffyX Před 2 lety +4

    It almost hurts to watch the board being removed. I've been wishing for a storeage like this for decades.
    Still a very interesting project.

  • @jeffreyumeh8580
    @jeffreyumeh8580 Před rokem

    Hu, I know that modern cases have that 1 stud so you can place your motherboard in on the stud which makes it easier to align the rest of the screws, but even then every motherboard I have installed has had 8 - 9 screw holes, sometimes there is a heatsink or something in the way of having all 9 screw holes, but 8 is the minium I have seen, this is only for full ATX though not ITX / mATX.

  • @miolini
    @miolini Před 6 měsíci

    Integrating a Pi compute module with Ethernet into each hard drive can make scaling easier. It lets each drive connect to a network independently, simplifying data handling in big storage setups.

  • @MegaManNeo
    @MegaManNeo Před 2 lety +9

    **CM4:** Hey Jeff! What are we going to do today?
    **Jeff:** You will handle 60 enterprise grade hard drives.
    **CM4:** Oof

  • @JPToto
    @JPToto Před 2 lety +18

    This is amazing. Good job charming 45drives into sending the case! They must be Red Shirt fans 🤣

  • @97oweb
    @97oweb Před rokem

    The real impressive thing ist that he was able to get a pi compute module 4
    I am trying to find one now for almost a year but the only place I even found one listed was on Amazon in the us, it is sold out since then, here in europe I did not even find a seller who listed it

  • @gmaildinozz
    @gmaildinozz Před 2 lety +6

    "I was thinking about unboxing it on camera but fedex already did". Best punchline :'-)

  • @mahtin
    @mahtin Před 2 lety +11

    Even before doing a raid or zfs test; I’d have run a single drive test (either benchmark or simple linear read/write). Loop that 60 times and see success. Then repeat with two drives in parallel and loop 30 times. Redo with three in parallel 20 times, etc etc. When you start seeing failures you really will see the cause-effect point. There is technically no reason why this won’t work with 60 drives - if you ignore performance. Any bugs that are exposed that can be fixed will simply improve the base users world. This is an awesome test that pushes the RPi and kernel to the limit. Making this work at that limit helps all of us just running one drive. Plus, the errors you saw, as bad as they were, should somehow restart the drive (without a reboot).

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +6

      When those errors occurred, the HBA reset itself and the drives always came back-at least 15/16 of them!
      That was one concern from the Broadcom engineer I spoke with, and the reason he really wanted me to run the latest firmware. Unfortunately due to time constraints I couldn't flash all the cards in a separate PC then bring them back to the Pi and re-test. But I plan on trying that out.

    • @JeffGeerling
      @JeffGeerling  Před rokem

      Extra testing has been done-tl;dr the breaking point is 3 cards (or more than 30 direct attached drives). But forcing PCIe Gen 1 speed also fixes the issue. More to come in my next video!

  • @tankgrrl
    @tankgrrl Před rokem

    You gotta admire a guy who takes $50k worth of server and... plops a Raspberry Pi in it. Brave and crazy. :)

  • @pantegministries
    @pantegministries Před rokem

    I have a Pi running RAID1 with 2 External SSDs and on times I have to reboot as it seems to forget that the disk exists. That's Pi 4 and Samba and a Window share. So to get as far as you did its amazing.

  • @niklasxl
    @niklasxl Před 2 lety +3

    i was guessing and hoping this was the project in progress when i saw the rack update video :D

  • @zambonidriver42
    @zambonidriver42 Před 2 lety +8

    I have smaller versions of those EXOS drives.
    >200 drives, over the past 5 years, I’ve had 4 failures. 3 covered under RMA. 1 had just expired.

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +5

      For every anecdote (usually it's "all my Seagate drives exploded in giant fireballs!"), there's an opposite anecdote. In aggregate, if drives like these were truly failing at the rates some people think, Seagate would not be in business :)

    • @llortaton2834
      @llortaton2834 Před 2 lety +5

      @@JeffGeerling The reason all of this information was popularized in the first place is because of backblaze's reports (back in the day) but if we look at their stats now, year over year, seagate is constant.

    • @marcogenovesi8570
      @marcogenovesi8570 Před 2 lety +4

      EXOS and businness drives in general are fine, it's the consumer lines that are more "hit and miss", but even then it's easy to find a pattern unless you buy hundreds of them. I.e. a brand doesn't just "consistenly fail 4x more than another"

  • @_caith
    @_caith Před 4 měsíci

    I built my first datahoarder set up using nothing but 8tb SSDs XD a whole bunch of them. solves the issues of "spinning" in an easy way

  • @ronwatkins5775
    @ronwatkins5775 Před rokem

    Is there a Raspberry PI which fits inside a standard size 3.5" drive enclosure? I don't plan to use a SATA connector, just want the form factor. I was thinking of stacking a 3.5" NAS vertically with a PI stacked on top, so it would need the same form factor as the drives.

  • @hoagy_ytfc
    @hoagy_ytfc Před 2 lety +5

    Jeff doesn't have the word "why" in his vocabulary :)
    (Just kidding, things like this are fun, which is "why" enough for me).

  • @marcogenovesi8570
    @marcogenovesi8570 Před 2 lety +6

    45drives marketing team is on a roll

    • @45Drives
      @45Drives Před 2 lety +4

      hi

    • @wayland7150
      @wayland7150 Před 2 lety

      Were they taking ab big chance? What if Jeff had proved you could do it all better with a PI? Hahaha.

    • @YeOldeTraveller
      @YeOldeTraveller Před 2 lety

      @@wayland7150 Not much of a chance. They are well aware of the performance of their product, and the limitations of a Pi.

  • @ferrinheight
    @ferrinheight Před 25 dny

    I work with a 100x drive server similar in design that uses 8 fans. I am sure with the pi/limited activity those 2 fans will suffice but withe the stock internals I doubt the drives will be reliably cooled below tdp. You would need to keep it a room controlled to

  • @weimenli7342
    @weimenli7342 Před rokem +1

    When the price of hard drives keeps falling, and the price of Raspberry Pis keeps riding.

  • @falcychead8198
    @falcychead8198 Před 2 lety +4

    As a St. Vincent fan, I'm calling it "Pietabyte" whether anybody else does or not.

  • @zambonidriver42
    @zambonidriver42 Před 2 lety +6

    How many times did you recompile the kernel?

  • @red03golf
    @red03golf Před 11 měsíci

    Jeff - I mourn you missed seizing the opportunity to officially name this The Pi-tabyte Project - a portmanteau teed-up for a long drive, but you duffed it, lol. - great vid, keep 'em coming, I'm having a grand time doing some of your projects. Cheers.

  • @Tannius
    @Tannius Před měsícem

    If nothing else, this gave me an idea for converting an old outdated and unsupported six bay NAS box to actually run on a pi. I was trying to figure a way to put six drive on a single pi and the PCIe adaptor with an HBA seems like a solution. I'll need to play around with it but the backplate the drives plug into in the NAS itself may actually be an HBA daughter board. We'll see, but at least I now have another avenue of attack.

  • @BioToxin
    @BioToxin Před 2 lety +3

    I would really like to see you build a cluster based file system like gluster or ceph, run that on a pi cluster with 2 to 4 drives per pi, then show performance

  • @davidmcken
    @davidmcken Před 2 lety +16

    I'd almost be interested in seeing each raid card assigned to 1 PI and then them clustered together.

  • @cervyvin
    @cervyvin Před rokem

    Very interesting watching "The PP Project"! :D

  • @RedVRCC
    @RedVRCC Před 11 dny

    1:54 _looks at my 3.5tb unity project and blender folder that's still rapidly growing_
    "Yeaaaah sure"

  • @Space_Reptile
    @Space_Reptile Před 2 lety +6

    Next step: 3.14PB on a PI

  • @Finkelfunk
    @Finkelfunk Před 2 lety +6

    If lsblk starts to run out of letters and shows drives as "sdaa, sdab, sdac" etc. you know you have a data hoarding problem.

  • @marcodebortoli
    @marcodebortoli Před rokem

    It was only about time that the new storage measurement of reference was going to be the petapite... I'll show myself out :D

  • @xWatexx
    @xWatexx Před 7 měsíci

    Dude can fit 0.01% of the internet on this damn thing. Insane.

  • @jbrown-acuity
    @jbrown-acuity Před 2 lety +5

    Would love to see this with a RISC V processor

  • @popcorny007
    @popcorny007 Před 2 lety +5

    They must really trust you to borrow $35k+ worth of disks

  • @brandonjohnson3566
    @brandonjohnson3566 Před 8 měsíci

    Ive always wondered if there was a way to make a external ram board like hp has one that adds 4 ram slots on a pcie. Ive ran into issues of bottlenecked performance not because of cpu but because ram capped out. The pi has only max of 8gb so i would assume max speed on drives could only be met with maybe 6 to 10 drives at most. If you were to add an additional 128gb i wonder if that would help out. Plus the raid cards you had i couldnt tell if they had their own memory or passthrough.

  • @ksp2viking
    @ksp2viking Před 5 měsíci

    Omg... 11:42
    The hilarity of having all that storage and still booting from a micro SD card :D

  • @Jimmy_Jones
    @Jimmy_Jones Před 2 lety +7

    "I grabbed a small piece of cardboard to insulate the boards from each other. At least temporarily."
    Yeah. Sure.
    I think red shirt Jeff was trying the break back into the room.

    • @wayland7150
      @wayland7150 Před 2 lety +1

      You can see red shirt Jeff is actually in the room with Jeff at one point in the video.

  • @2nd_bloxx
    @2nd_bloxx Před 2 lety +2

    *happy linus noises*

    • @JeffGeerling
      @JeffGeerling  Před 2 lety +3

      How many days until "I lost all my data" video? :)
      But hey, if it summons Wendell from L1Techs, it's kind of a win-win.

  • @zer0reaction313
    @zer0reaction313 Před 2 měsíci

    My first thought was that you are going to store a petabyte of PI digits, but this is also interesting

  • @Captn_Grumpy
    @Captn_Grumpy Před rokem

    OMG the original hardware is so beautiful it brought a tear to my eye to see it removed. Given the resources I would have 2 or 3, one for local redundancy and one for off site but given net speeds the off site would mostly be pointless. And no I don't need one, I just want one so bad it feels like I need it :)

  • @loganiushere
    @loganiushere Před rokem +5

    it’s a shame you didn’t call it “PetaPi”

  • @GH05TYPLAYz
    @GH05TYPLAYz Před 2 lety +3

    dang, fedex unboxed it first huh.
    not surprised

  • @Kwpe_
    @Kwpe_ Před 7 měsíci

    The linus video is literally in the other tab on opera. I was bouta watch it next.