More POWER for my HomeLab! // Proxmox

Sdílet
Vložit
  • čas přidán 24. 07. 2024
  • Use code christianlempa at the link below to get an exclusive 60% off an annual Incogni plan: incogni.com/christianlempa
    Thanks to Incogni for sponsoring this video.
    In this video, I will be sharing my experience of building a Proxmox Cluster with 2 Nodes in my Homelab. This allows me to unite the power of my servers, migrate VMs, and achieve High Availability for all my services. I will be discussing the pitfalls and challenges I faced during this project, and sharing my tips and tricks for successfully creating a Proxmox Cluster. Get ready to learn and have some fun with me!
    References
    - Minisforum BD770i Testing: • This mainboard is PERF...
    - Silverstone RM23-502 Mini: kit.co/christianlempa/my-favo...
    ________________
    💜 Support me and become a Fan!
    → christianlempa.de/patreon
    💬 Join our Community!
    → christianlempa.de/discord
    ________________
    Read my Tech Documentation
    christianlempa.de/docs
    My Gear and Equipment-*
    christianlempa.de/kit
    ________________
    Timestamps:
    00:00 - Introduction
    02:27 - Proxmox cluster planning
    03:40 - This server case is amazing
    05:15 - Preparing the Proxmox server
    06:32 - Setup the Proxmox network
    08:19 - Building the Proxmox cluster
    10:20 - Migrating VMs on Proxmox cluster
    11:55 - Online migration benefits
    13:41 - How to get quorum in a 2 node cluster?
    15:16 - HA testing and what I need to improve
    16:35 - Final thoughts and future projects
    ________________
    All links with `*` are and/or include affiliate links.
  • Věda a technologie

Komentáře • 213

  • @christianlempa
    @christianlempa  Před 3 měsíci +5

    Use code christianlempa at the link below to get an exclusive 60% off an annual Incogni plan: incogni.com/christianlempa
    Thanks to Incogni for sponsoring this video.

    • @KILLERTX95
      @KILLERTX95 Před 3 měsíci

      Hey Christian. Just so you know, you don't need a dedicated "nic" for proxmox clustering. Just a dedicated network, a vlan works more than fine. Using a vlan also means you can bond your NICs giving you overall better performance and redundancy.
      I recommend in your case, bonding the SFP ports, then bonding the bond with the inbuilt ethernet for fail over. At that point you can vlan out to your hearts content :). This is also how I handle my ceph network.
      (My team and I manage a proxmox cluster at work, I only watched this video because I really enjoy your content)

  • @Angyrr
    @Angyrr Před 3 měsíci +1

    Been following your vids for a while and they just keep getting better. Finally have the confidence to press forward with my homelab thanks to you. Rock on brutha!

    • @christianlempa
      @christianlempa  Před 3 měsíci

      Thank you so much for your support 👊 bro! Glad it helps you with your homelab 🫶

  • @scockman
    @scockman Před 2 měsíci +1

    Christian, I have been running Proxmox for ~3 years now in my home lab and used a TrueNAS shared storage solution. But mid-year last year I decided to add a 3rd node to my setup and implemented an all SSD Ceph Storage. So far it has been flawless.

  • @wstrater
    @wstrater Před 3 měsíci +6

    I vote for a CEPH cluster! I would love to see you set up a CEPH Object Store and File System on your 3 ProxMox servers instead of relying on a TrueNAS server or other NAS. I understand that CZcams content creators have a need for a lot of storage and high speed networking but not the average person running a home lab. Having a “local” file system for your work load that is mirrored to other servers would be very appealing.

  • @jkotran
    @jkotran Před 3 měsíci +20

    Christian, you can do a poor man's HA with local disk by scheduling replication of your VM's. That way a copy of your VM is standing by on the failover node. Also, I didn't have luck with SSD RAID for hosting VM's. The cost was exorbitant, the capacity was too low for my purposes, and the i/o wasn't much better. I went with more HDD's and an SSD ZFS read/write cache. P.S. You're doing great work. Thank you for sharing and for your joyful presentation style.

    • @lensherm
      @lensherm Před 3 měsíci

      Exactly! I used to have a 3rd server running TrueNAS Scale with NVME shared storage. Eventually I retired it, since I got a little tired of the extra high-pitched 1U noise. Now I just have NVMEs in the two ProxMox servers for VM storage and have set up replication for every 5 minutes. Works fine for my needs.

  • @solverz4078
    @solverz4078 Před 3 měsíci +42

    FYI, check your VM CPU type is not "Host" as both PVE hosts will definitely need the same CPU if "Host" is choose but should be fine with most of the other CPU Types.

    • @joost00719
      @joost00719 Před 3 měsíci +6

      This is what I do. I can migrate a vm from an Intel based node to an amd based node

    • @solverz4078
      @solverz4078 Před 3 měsíci

      @@joost00719 *Live migrate*

    • @Darkk6969
      @Darkk6969 Před 3 měsíci

      @@joost00719 Same here and works fine. I also have a mix Intel and AMD CPUs.

    • @zyghom
      @zyghom Před 3 měsíci

      I am wondering: I have 2 machines, one is Ryzen 7 and another is Ryzen 9 - would in this case "host" be ok or not? I understand it is abouch architecture nor really the CPU type

    • @joost00719
      @joost00719 Před 3 měsíci

      @@zyghom Just use kvm64

  • @jttech44
    @jttech44 Před 3 měsíci +29

    I run proxmox at work, with a big cluster, multiple shared storage devices. It works great, and, I'm happy to see you added a qdev to your cluster, because it really is needed to maintain cluster integrity.
    As far as shared storage goes, any cheap NAS will work. I'd recommend something that has NFS shares. For homelab, gigabit is mostly fine unless you have a ton of storage. The price jump networking wise to 10gbps isn't bad, but, the storage price jump that can actually utilize that network speed can be quite expensive.

    • @shephusted2714
      @shephusted2714 Před 3 měsíci +1

      I just went to 2.5 - it was unimpressive but going to jumbo frames adds 35% boost

    • @kristopherleslie8343
      @kristopherleslie8343 Před 3 měsíci +1

      A 10 gig card is dirt cheap

    • @jttech44
      @jttech44 Před 3 měsíci

      @@kristopherleslie8343 Yes, as I said, networking isn't too bad, getting storage fast enough to actually use it is fairly expensive.

    • @jonathandavis4711
      @jonathandavis4711 Před 3 měsíci

      Isn't any cheap NAS a massive failure point that can take out the entire cluster?

    • @kristopherleslie8343
      @kristopherleslie8343 Před 3 měsíci

      @@jonathandavis4711 depends how you build things. Just saying you have something doesn’t imply it works or is bad. How was it setup and engineered?
      It’s people with raspberri pies as nas. You can use the synology os on x86.

  • @KingsOfNerdonia
    @KingsOfNerdonia Před 3 měsíci +4

    I have successfully running a Proxmox cluster with 2 nodes and a quorum device in my small home lab with a simple storage sharing solution without external network shares. Both nodes have a dedicated SSD with the same size just for this purpose. How to set it up? On the first node you go to disks, create ZFS, select the disk and give a name (this name should be a general name, not dedicated to a node, like 'pve-data' or something similar), select add storage, everything else on default and check the box on the disk you want to have as storage in the device list, click on create. The first node now have a new storage. On node 2 do the same, go to disk, create zfs, now important use the same name like on node 1 (in my example 'pve-data') and also it is very important to deselect the checkbox 'Add Storage', select the disk from the device list and click on create. After that you can go to Datacenter -> Storage, select the ZFS storage 'pve-cluster' and click on Edit. In the dialog you select all nodes in the dropdown menu named Nodes. After that the nodes will be come the storage listed as local storage on each node. With replication tasks for each VM or LXC you can set how often a storage of an VM or LXC will be synchronized between the nodes. In HA you can also set the list of containers or VMs you want to have migrated live between the nodes. You have to do this for each VM or LXC container, but it works very well for me. Thank you for covering this Proxmox topic. Hope to see more of this.

  • @cbaservs
    @cbaservs Před 3 měsíci

    so many good pointers Christian. thank you so much for this valuable information

  • @joshuakunz5368
    @joshuakunz5368 Před 3 měsíci +20

    Christian during a Stream: "you know energy is realy expensive here, i was thinking about downscaling"
    Also Chsiatian: "wanna se my new 2 node pve cluster"😂😂😂
    Ps. Love your content

  • @fedefede843
    @fedefede843 Před 3 měsíci +4

    Nice work. Proxmox is such a nice project. I came here attrscted by the 2 nodes tittle. I imagened the voting issue and the qdevice to help out. HA is really nice. As you said a 3rd fully cspable node allows you Ceph among other things. And that is another rabbit hole.
    Regards Christian!

  • @NianNordic
    @NianNordic Před 3 měsíci

    Haven't seen the video... Already "LIKED" it! You KNOW it's good stuff when mr. Lempa uploads! THX for the TOP QUALITY content, as always!! 👌💯

  • @ivanmaglica264
    @ivanmaglica264 Před 3 měsíci +19

    I run two node Proxmox cluster at home with one of VMs being a router. Nothing beats migrating router to second node, while performing upgrades on first one 😀

    • @ryuzakisama
      @ryuzakisama Před 3 měsíci

      Agree. I have the same setup

    • @zyghom
      @zyghom Před 3 měsíci +4

      nothing beats router to be on ... dedicated machine ;-)

    • @Carlos-Rodrigues
      @Carlos-Rodrigues Před 3 měsíci

      Agree. Same setup here too.

  • @royborgen
    @royborgen Před 3 měsíci +1

    Currently I am not running a cluster, but have plan for this in future. I was planing on failing services over, but I see that is going to take more than initially expected. At least I can do like you are doing now.

  • @djcmike
    @djcmike Před 3 měsíci

    Got a cluster and am quite happy with it. I've changed hostnames on them as well, without much trouble, so it can be done.

  • @MarcBehar
    @MarcBehar Před 3 měsíci +1

    I was just looking at this case, glad to see it in use

  • @dmkanter
    @dmkanter Před 3 měsíci

    very fun! Thanks for the great video! have a beelink mini pc (intel) as primary, and an old 2012 MacBook Air (intel) that I recently put in a cluster. They are sharing storage for backups. Migration works great so far, even with the CPUs being different models. Looking forward to your SSD NAS storage in the next 1/2 of the year.

    • @christianlempa
      @christianlempa  Před 3 měsíci

      Awesome! Thanks, and stay tuned for the storage server video :D

  • @stum9y867
    @stum9y867 Před 3 měsíci +1

    Hi @ChristianLempa one note i made was when you used qdevice when running HA you need to ensure that the q device has root access which I had to do to get my cluster working but great video I use 1 dell 5090 & lenovo in my ha setup pm cluster & have a raspberry pi 4 (which also runs 24/7/365) as my q device but great content

  • @AlexOlivieruk
    @AlexOlivieruk Před 3 měsíci +5

    You can do HA with two nodes by setting up a replication job of the VM between the two nodes. That way there is an (older) copy of the disks on the other node so HA will be able to start up on the 2nd node.

    • @solverz4078
      @solverz4078 Před 3 měsíci +2

      ALL cluster nodes need to have the latest live data for it to be classed as HA. Otherwise you could run into some nasty split brain situations depending on what you are running in the VMs.

  • @PhilMalle
    @PhilMalle Před 3 měsíci +1

    Thanks for the video. I have a proxmox cluster running, but I need to reinstall proxmox on the cluster. There are a few things I didn’t take care at the first time. Hopefully with your video it will be better :)

  • @mistakek
    @mistakek Před 3 měsíci

    Yes, there are some quirks with a 2 node proxmox cluster. I run the same, but no qdevice. I just have a direct 10Gb connection between them. I did it for a similar reason as you, I wanted fast migration between the servers.
    Fun fact, I'm running PBS in HA with the virtual disk running on a 1Gb link to my Synology NAS, and the PBS datastore is on the Synology also. It works quite well considering it's going over 1Gb.
    I've backed and waiting on the Zima Cube Pro to arrive as that will become my 3rd Proxmox node running TrueNas VM configured for my HA storage with a 10Gb link and NVME storage available.

  • @DustinShort
    @DustinShort Před 3 měsíci +4

    You should also be able to set the "CPU TYPE" under your VM hardware to a version supported by both processors (like x86-64-v2/3/4 processors) to improve live migration compatibility. PVE has a ton of options for each generation of chips going back to 486, so you can really tune in the instruction sets made available to the VM this way. Might be worth testing if you have a mixed cluster environment.

  • @ivanmaglica264
    @ivanmaglica264 Před 3 měsíci +4

    For HA you definitely have to assess what you want vs. what you actually need. Having HA cluster with only single node shared storage, you are back to square one on being fully redundant.

    • @christianlempa
      @christianlempa  Před 3 měsíci +1

      True! That's why I see HA in a Homelab only as a fun/hobby/testing project, not really a prod environment.

  • @JonatanCastro
    @JonatanCastro Před 3 měsíci +6

    @christianlempa I recently created my Proxmox node and had the same problem when migrating VMs, but I fixed it by selecting on the VM configuration > Hardware > Processors > Type > x86-64-v2-AES. This works flawlessly between my Intel N100 and my i7 6700T

    • @mistakek
      @mistakek Před 3 měsíci

      I was going to mention this also but you beat me to it.
      This is the way for different CPU's. Just find out the common instruction sets and use the highest x86-64-Vx so your cpu's can use those instruction sets.

    • @christianlempa
      @christianlempa  Před 3 měsíci

      Thanks! That's a good tip, I will try that :)

  • @Excited-IT-Architect
    @Excited-IT-Architect Před 3 měsíci

    Hi Christian, thanks a lot - another great video! 👍 - Would you please also share which NIC you installed and if you are satisfied with it (for Proxmox usage)? Thank you.

  • @bumblingwelshman
    @bumblingwelshman Před 3 měsíci +1

    I was running a 3 node cluster but have put them back to standalone nopw, after ~12months it ate through the alwayson nodes OS ssd to around 97% wearout, this was a cheap consumer SSD so would deff recommend either a enterprise ssd for the OS or a hdd and configure it as a mount point for the logs and quorum db to write to.

  • @ThatNateGuy
    @ThatNateGuy Před 3 měsíci

    That's so cool! I enjoy seeing your setup grow. I'm considering building such a cluster myself. Given the date/time synchronization requirement, I wonder if it would be worth implementing PTP with something like a Time Card.

  • @Shirosak1
    @Shirosak1 Před 3 měsíci

    I'm running a truenas cm with 4x4TB NVMe drives just for VM migration. Works very for VM migration and very quickly. If you can, it's definitely worthwhile to have a look

  • @shephusted2714
    @shephusted2714 Před 3 měsíci

    really good content - take this to its ultimate conclusion is all i would say - think about 2 node 25 or 40g cluster - no switch needed - you could use the 56g dual port cards (40gbe) for cluster and then a 10g management network - the netfs is a good one to dig into - ceph/nfs/zfs/gluster? i think you are going to have to experiment - maybe try some nvme arrays? thanks for the content!

  • @ewenchan1239
    @ewenchan1239 Před 3 měsíci

    You don't actually NEED a super fast storage for the CTs/VMs for migration.
    It can help when/if you are booting up/shutting them down regularly, but if they're running pretty much 24/7, once the CT/VM is booted up, it doesn't really need much (unless you are doing I/O intensive tasks).
    I have three N95 Mini PCs and run Ceph between them over GbE, and it works just fine.
    If the CT/VM disks are actually on the shared storage, I have live-migrated my CTs/VMs between the nodes in as little as 8 seconds (because it doesn't need to move the CT/VM disks around since it is already on shared storage).

  • @Erllz
    @Erllz Před 3 měsíci

    I am running a similair setup in my homelab. 2 nucs, 1 pi zero with ethernet hat as qdevice. Also configured HA.
    Ive setup ZFS on the nucs with replication. Therefore not requiring a shared Storage. The nucs are connected to 1gbps connection but thats fast enough for my lightweight vms and containers.

  • @TheInfinitiGuy
    @TheInfinitiGuy Před 3 měsíci +4

    Great video as always. To address your shared disk with 2 nodes issue you could use Stormagic SvSAN which takes local drives from each node and presents it as a iSCSI shared disk to the cluster. They also directly support Proxmox. Give it a try.

    • @ehink2716
      @ehink2716 Před 3 měsíci +1

      Do they have a free/community edition?

    • @danilfun
      @danilfun Před 3 měsíci +3

      What are the advantages compared to CEPH?

    • @ehink2716
      @ehink2716 Před 3 měsíci

      @@danilfun I currently run CEPH on proxmox and my VM's are slow, I am using all ssd disk and think it maybe the 1gig network and how CEPH replicates the data. Reads are fine but writes are really slow get like 25meg writes.

    • @TheInfinitiGuy
      @TheInfinitiGuy Před 3 měsíci

      @@danilfun I'm pretty sure you need 3 nodes to use ceph

    • @christianlempa
      @christianlempa  Před 3 měsíci

      Thank you! Great tip :D

  • @mzaferyahsi
    @mzaferyahsi Před 3 měsíci

    I am using the cloning to make sure that my vms/containers are migrated automatically if one of the nodes are down. That way, I don’t have to deal with the network storage which brings its own problems.
    Also you can change the votes in corosync config. That’s how I ran my cluster for a while where my main node had 2 votes and slave had 1 votes.

  • @romayojr
    @romayojr Před 3 měsíci +5

    i’ve been running proxmox 3 node cluster with ceph ha for over 6 months now and it’s been stable so far. i even upgraded to proxmox 8.1 from pve 7 without any issues. curious why you decided not to go this route. maybe an opportunity to make this topic a 2 part series? 😅

    • @Darkk6969
      @Darkk6969 Před 3 měsíci +4

      Probably due to cost in electricity and hardware. He might pick up another AMD motherboard to run as third node in the future. He's also thinking of what home lab users would do. For us home labs we tend to reuse what we can find. I use mostly old Enterprise hardware from work as it was going to be e-wasted anyway. Figure I give it a second lease on life. Plus I source old gear from e-bay.

  • @parrybedi-du5vf
    @parrybedi-du5vf Před 16 dny

    Hi @Christian Lempa, would you mind sharing the model of Be Quite power supply you used in this build and confirm if you used 120mm fan on top of the CPU heating? Would greatly appreciate 😊

  • @tomegb
    @tomegb Před 3 měsíci +3

    You don't necessarily need a shared storage for HA. You can also replicate the VM disk on both nodes. Nevertheless, a very nice video for beginners!

  • @armoredstarfish921
    @armoredstarfish921 Před 2 měsíci

    so I run a two node cluster, one machine is a custom build that is used for things like media streaming / home lab and NAS (TrueNAS Scale VM with SSD's passed directly into it) and is only powered on when I need it (WOL), the second is on a Zimaboard that runs a Docker LXC for my 24-7 services and PiHole DNS LXC which is all working great. As I didn't need (or want!) HA services on Proxmox (I disabled them as they seem to wear SSD's out super fast for some reason?) and one of the nodes would be switched off most of the time I looked into getting a qdev but found out you don't need one if you set "two_node: 1" & "wait_for_all: 0" in the "quorum" section of "/etc/pve/corosync.conf". If one of the nodes is down the VMs will all boot like normal and can edit the VM configs, backups run, etc.

  • @tvojejbabkydedko
    @tvojejbabkydedko Před 2 měsíci

    how did you manage to access the cluster domain directly? where you were able to choose on which node to log in?

  • @Ekosusiloharjo
    @Ekosusiloharjo Před 3 měsíci

    Hi Christ nice tutorial, but i have question how about the case is migrate and shut down the master after migrate it

  • @user-dv5nx3wu8q
    @user-dv5nx3wu8q Před 3 měsíci

    try a vm based cluster for first steps with separate virtual netwerk segment. Works fine as a teaser.

  • @asunder1980
    @asunder1980 Před 3 měsíci

    I use replication with HA and have different sync schedules depending on the VM importance. All of my hosts have storage pools with the same name and I’ve been able to fail over pretty effortlessly. The main issue I have with HA is when I do a switch update, I have do disable HA otherwise all of my HA hosts will shutdown when a total loss of connectivity is detected bring down my whole lab even tho I have fail over connections on a separate switch. Still trying to figure that out. lol.

  • @trungnguyen6708
    @trungnguyen6708 Před 3 měsíci

    with 2 node+Qdevice, we can run HA using replicate function that buildin proxmox, set time to sync VM disk.

  • @casey1027
    @casey1027 Před 3 měsíci

    I am running a 3 node cluster which shared NFS storage on a NAS

  • @Glitch_860
    @Glitch_860 Před 3 měsíci +1

    Nice Video! How about a 5 NODE Proxmox cluster!!! Using an NAS for VM backups!

  • @smolicek90
    @smolicek90 Před 3 měsíci +1

    You can try running DRBD Linstor on your cluster for online replicated storage (block devices). Im running few clusters this way ;)

    • @Darkk6969
      @Darkk6969 Před 3 měsíci

      I've tried CEPH. So looking into using Linstor.

  • @vot4va
    @vot4va Před 3 měsíci

    You can make a replication job to node B, the migration is faster then, because only the icrement to the last replica have to transfered (+RAM Data). If node A dies, over the HA function, node B boots up the latest replica of the vm.

  • @NiceDevil
    @NiceDevil Před 3 měsíci +1

    Hey Chris, take a look at CEPH and maybe you can build a network over USB4 thunderbolt? ;) at least this is what I run on a 3 node cluster with NUCs 13ths gen.
    But even then: it takes around 2-3 min after a Node dies to run the HA migration. But not because of slow network 😂 getting around 20-30 gbit over the TB4 ports

  • @pquodling
    @pquodling Před měsícem

    Doing similar at the moment - Just with Intel NUCs a pair of NUC6i7 with a smaller NUC as the third node, all using my Synology NAS for NFS Storage...

    • @christianlempa
      @christianlempa  Před měsícem

      Nice! Do you have any performance problems over network? And do you use 10G?

  • @JirkaHarcarik
    @JirkaHarcarik Před 3 měsíci

    I tried two nodes proxmox cluster too a couple of months ago, but I ran into some weird unstable situation when one of the nodes goes off, I think the second node frees until the first goes online or something like that, so I noticed that is not a good idea.

  • @Yehok17
    @Yehok17 Před 3 měsíci

    Quick question: I have 2 hosts as well, I dont really want to get a 3rd either for HA to work better. Can a Raspberry Pi 5 per your video be used also as a Voting member and it would work a bit better spinning up the VMs on the other host with the Pi being up? The one I was gonna get is the 8GB ram and 128GB storage micro sd card. Thanks, just looking for general feedback on this idea.

  • @MRPtech
    @MRPtech Před měsícem

    My home lab is simple 3x N100 mini PCs in Proxmox HA config. Synology DS423+ (NFS) used as shared storage. This HA thing save me so many times when one node goes down.
    Even with my local 1Gbps network everything feels snappy.

    • @christianlempa
      @christianlempa  Před měsícem

      Wow, that is awesome to head! I didn't know if the 10G would be fast enough, :D Do you use an SSD on your NAS btw?

  • @RockTheCage55
    @RockTheCage55 Před 3 měsíci

    FYI I tried to do a qdevice inside of a docker container & it was a mess. I end up messing up my entire cluster. I got it added to the cluster but it was non-voting. While messing around with it i borked my entire cluster so i probably won't be doing that again.

  • @scubeedu2
    @scubeedu2 Před 3 měsíci

    So I've been wanting to give this a try, but after reading some of the comments below regarding hardware requirements, I'm thinking it may not be worth the effort. For the average home-lab guy who has a windows SFF pc (NTFS file sharing / WSL2 for linux docker stuff), an HP micro-mini pc (Proxmox) and a raspberry pi4 (docker) all making up the environment, do you think this is something that is workable or possible by adding an additional Proxmox node running on a capable Dell laptop?

  • @johannesspace3879
    @johannesspace3879 Před 3 měsíci +1

    You should definitely set the shutdown_policy to "migrate". If you try to shutdown one Node, it will automagicly copy all VMs to to remaining Nodes based on your HA-group prioritys.
    I've never used the VM replication in my prod environment because it's running ceph, but maybe it would solve your Problem with the missing disk after shutdown.

  • @GundamExia88
    @GundamExia88 Před měsícem

    So... if my two Proxmox versions are the same, but the kernel is different, might run into issues then?

  • @Paul-bm8og
    @Paul-bm8og Před 3 měsíci +1

    8:17 It is still possible to change the IP address of the server. You just have to be very careful because you have to change many different things, not just one file or one adapter!

  • @posalab
    @posalab Před 3 měsíci

    If you use a 3 node with ceph/gluster or a network filesystem, the migration is happena almost instantly cause only ram e config file need to migrate from nodes.

  • @zanewebb1853
    @zanewebb1853 Před 2 měsíci

    My first foray with proxmox has been less than fun, I migrated from esxi, first few days were great then my truenas vm started doing some weird stuff, like randomly rebooting, seriously considering fgoing back to esxi

  • @user-ct7wu1zv9e
    @user-ct7wu1zv9e Před 2 měsíci

    Das ist ein gutes T-Shirt Mandalorian!

  • @fossdom5568
    @fossdom5568 Před 3 měsíci +2

    Even I will be rebuilding my Proxmox with the hardware I recently got from work ,
    Nutanix nodes G6
    3.8TB RAM
    60 Cores
    120TB Storage
    Will be epic homelab

    • @zyghom
      @zyghom Před 3 měsíci +1

      bloody hell: 3.8TB RAM....;-)

  • @FreddieDK
    @FreddieDK Před 3 měsíci

    To make HA work. You can use replication of the VM's if the storage-name of both nodes are the same.

  • @sku2007
    @sku2007 Před 3 měsíci +1

    to get live migration work more reliably this can help: pve8 has cpu types which wheren't available before. sth like x64_x86_v3 is for example intel skylake and newer and amd epyc (i assume zen1). just klick help in cpu menu of a VM
    and as others pointed out, with replication HA is possible. but loose data up to migration interval.
    otherwise do a bulk migration before shutdown?

  • @Toparlak123
    @Toparlak123 Před 3 měsíci

    @christianlempa are you still using the same Terraform provider "Telmate" that you used before in PVE 7?

  • @MvL72
    @MvL72 Před 3 měsíci

    @christianlempa can you add a 120mm fan on top of the heat sink? Is there enough room in the case?

    • @christianlempa
      @christianlempa  Před 3 měsíci

      Yes, I just did that, but it's very little space, so I'm not sure how much it will make a difference. I have also added 2x 80mm fans at the front of the case and I believe that helps much more than the cpu fan...

  • @Olgerd96
    @Olgerd96 Před 3 měsíci

    Please tell about SDN for whole cluster

  • @JohnDoe50047
    @JohnDoe50047 Před 2 měsíci +1

    One more thing that will bite you is that your storage has to be the same on both machines. If a VM is on local-zfs on one node, you have to have a local-zfs on the second node in order to migrate.

  • @HisLoveArmy
    @HisLoveArmy Před 17 dny

    If you don't have shared storage do you still need a cluster to migrate to another node? Seems like a cluster should always have shared storage

    • @christianlempa
      @christianlempa  Před 16 dny

      I think it makes a lot of sense, but it's not a requirement.

  • @FrontLineNerd
    @FrontLineNerd Před 16 dny

    Minimum of 3 nodes + ceph cluster with 3 OSD’s = Happy high availability Proxmox manager.

  • @bryanpedini4497
    @bryanpedini4497 Před 3 měsíci

    I use application high availability btw
    I mean, I have three pfSense virtual firewalls in HA (not the recommended scenario, they go with 2 but, meh, works)
    I have three VMs running mariadb galera in a cluster, HA database
    I'm learning kubernetes just for this reason (will have 3 master nodes and 6 worker nodes, 3 VMs for each of my proxmox hosts)
    I don't need a shared storage, LOL
    I also containerize everything so, kube it is

  • @paul3151
    @paul3151 Před 3 měsíci +8

    You can use the Replication Service. So e.g. every 5min the Disk ist mirrored

    • @EBlom666
      @EBlom666 Před 3 měsíci +2

      Came here to say the same. But it requires ZFS on your Proxmox hosts.

    • @gilfreund
      @gilfreund Před 3 měsíci

      @@EBlom666 and the storage names should be the same, link the network bridges

    • @xandrios
      @xandrios Před 3 měsíci +3

      @@gilfreund That applies to any storage type. The name of the storage that was saved in the VM config is what the hypervisor will be looking for to load the disk image from. That applies to local, shared or replicated storage.

    • @mistakek
      @mistakek Před 3 měsíci +1

      Yes this works well for static type VM's where the data isn't changing very often.

    • @christianlempa
      @christianlempa  Před 3 měsíci

      Don't have zfs 🤷‍♂️

  • @BrunodeSouzaLino
    @BrunodeSouzaLino Před 3 měsíci +2

    Don't think I haven't noticed you installed the motherboard without the I/O shield plate despite your efforts to hide it.

  • @dbuko3
    @dbuko3 Před 3 měsíci

    You dont need shared storage or ceph for HA... It can be done like you have it now, but you need enabe VM replication for example to replicate vm every minute... Im not sure if you can do it only on ZFS for this because I have done it only on ZFS... Maybe also check HA groups because if you plan turning off one node periodicly you can make 2 ha groups and set in which host you preffer run VM for example grop run_on_pve1 and run_on_pve2 then you set in ha which vm use which group and select for pve1 group that it use pve1 host and pve2 group use pve2 host. Then when you want turn off or maintenance host pve2 you edit ha group run_on_pve2 and instead of selected pve2 host select pve1 and save. System will online migrate all VMs which run on pve2 host to pve1 host and you can turn off pve2 host when it finish...
    I think that this can be done also with api calls(edit ha group), check if vms are migrated and then turn off host on schedule(cronjob) on your zuma board... After host is turned off you can edit run_on_pve2 ha group and select thqt group use pve2 host and when host will be turned back VMs will be migrated back to pve 2 host...

  • @dukeseb
    @dukeseb Před 3 měsíci

    You don’t need shared storage, you can also setup replication to run ever 15 mins…or less if you want then you don’t have to worry about HA failing when you pull the plug

  • @chrisumali9841
    @chrisumali9841 Před 3 měsíci

    Thanks for the demo and info, have a great day

  • @dominick253
    @dominick253 Před 3 měsíci +3

    Two nodes plus a q device (pi 5) seems perfect for me. They can each backup to the other node. I tried ceph but my ssds were too slow.
    So i just have redundant services on each node. So a pihole lxc and a tailscale lxc on each one.

    • @rogertan1130
      @rogertan1130 Před 3 měsíci

      hi, how can you test or tell if your ssd was too slow to support ceph?

    • @hugevibez
      @hugevibez Před 3 měsíci +2

      @@rogertan1130 you want IOPS, but also consumer SSDs typically slow down as the drive fills up which is not ideal for ceph (initially a nand cell operates in SLC mode, when the drive fills up it divides that cell in to smaller chunks) and due to the way write caching works write performance falls off a cliff as well.
      However, a lot of people misconfigure ceph as well. You can have a ceph cluster with HDDs that performs well enough as long as you have enough OSDs, but you also actually need the network speeds and latency to support this. 10gbit is often recommended, but I see it more of a minimum to get any decent sort of performance with ceph. Getting ceph on it's own network should help a lot with this as well.

    • @VoklavTube
      @VoklavTube Před 3 měsíci +1

      ssd or network interface?
      a typical SSD has a speed of 3500MB/s
      1 gigabit network has 1gbps = 1000 mbps = 125 mb/s
      10 gigabit network has 10gbps = 10 000 Mbps = 1250 mb/s
      with is ~1/3 of the typical SSD speed...

  • @blinkitogaming
    @blinkitogaming Před 3 měsíci +3

    You should give CEPH a try. GO for a couple of sata SSDs, install each one on each node and configure it. You'll see the difference on HA.

    • @kevinneufeld3195
      @kevinneufeld3195 Před 3 měsíci +2

      How if you need 3 Nodes for Ceph ?

    • @blinkitogaming
      @blinkitogaming Před 3 měsíci

      As well as for HA

    • @jeanpierresaldana5794
      @jeanpierresaldana5794 Před 3 měsíci

      @@kevinneufeld3195 if it is for services that dont't need top performance (IOPS), setting ceph in VM's is more than enough. You can also have almost bare-metal performance is you passthrough the sdd's or nvme's instead of virtual disks.

  • @Glatze603
    @Glatze603 Před 2 měsíci

    Using a Proxmox-Cluster with 3 Nodes and Ceph as a common Storage-Pool for real HA.

  • @FrancescoCarucci
    @FrancescoCarucci Před 3 měsíci

    If you set the CPU type to QEMU64, live migration tends to work "most of the times". Not 100% though.

  • @serdalo5035
    @serdalo5035 Před 3 měsíci

    You dont “need” a separate device for shared storage. You “could” use CephFS or GlusterFS to create a shared storage pool from the disks/partitions in you servers.
    Im using quotes because its possible, but a seperate device is probably still beter.

  • @Labombab
    @Labombab Před 3 měsíci

    Nice video's, is it just me, or do the latest video's have a small ms audio lagg?
    Could be wrong, but I always get that feeling when watching them.

    • @christianlempa
      @christianlempa  Před 3 měsíci +1

      Maybe, I still have audio probs with my recording setup sometimes 🙈

  • @John-77777
    @John-77777 Před 2 měsíci

    Which network card did you choose?

  • @kapkanfps3694
    @kapkanfps3694 Před 25 dny

    Dude you NEED HIGH AVAILABILITY Christian! 😂 maybe Next video tutorial?

  • @ryanmalone2681
    @ryanmalone2681 Před 3 měsíci

    I've been thinking about doing something similar but have been struggling with shared storage since I don't want to use the slow HDD on my NAS and don't want to build a SAN. I didn't catch what you decided to use for shared storage. Do you mind clarifying?

    • @xConundrumx
      @xConundrumx Před 3 měsíci

      NAS with iSCSI or NFS?

    • @ryanmalone2681
      @ryanmalone2681 Před 3 měsíci

      @@xConundrumx iSCSI seems a bit dated at this point and there is very little used gear available on ebay that isn't ancient. I'm trying to avoid using my NAS because I'm using Unraid and it's not really fast enough without the right RAID configuration for a target.

    • @solverz4078
      @solverz4078 Před 3 měsíci

      ​@@ryanmalone2681 Literally every SAN uses iSCSI, not dated at all.

  • @VitePapa
    @VitePapa Před 2 měsíci +1

    I don't know why I feel Mr Lempa is Lex Luthor.

  • @DuncanUnderwood1
    @DuncanUnderwood1 Před 3 měsíci

    Which low profile dual SFP card have you found to work well with proxmox?

  • @zyghom
    @zyghom Před 3 měsíci

    the dedicated network between 2 proxmox machines: why did you set it /16 instead i.e. /31 or /30 - there are only 2 IPs there, right?

    • @christianlempa
      @christianlempa  Před 3 měsíci +2

      Good question, I've done it because in the future I might use a switch or another network instead of a direct link, but yea... it doesn't have to be that big :D

    • @zyghom
      @zyghom Před 3 měsíci

      @@christianlempa your confirmation means 1 thing for me: I am getting better in these THINGIS ;-) - thank you ;-)

    • @christianlempa
      @christianlempa  Před 3 měsíci +1

      @@zyghom awesome :D

  • @kareemschultz
    @kareemschultz Před 3 měsíci

    Proxmox has glusterfs built in to solve the shared storage issue

  • @kirksteinklauber260
    @kirksteinklauber260 Před 3 měsíci +2

    Just add dedicated HDDs to both Proxmox nodes just for VMs and then format those using ZFS. Then you can mount those drives as shared and have ZFS doing replication every XX minutes from node to the other. No need for external storage.
    Craft computing YT channel has a nice tutorial

    • @Darkk6969
      @Darkk6969 Před 3 měsíci +1

      Yep, that is currently what I do for home and at work. Yes I could have used CEPH for work but I don't have time to troubleshoot cluster issues. ZFS is easier to deal with and fix.

    • @kirksteinklauber260
      @kirksteinklauber260 Před 3 měsíci

      @@Darkk6969 I started with ZFS but I am planning to test at home Ceph and see if it worth to switch

  • @tunglamhp
    @tunglamhp Před 3 měsíci

    It will going bigger i can ensure that more clusters. I am that hole too :(

  • @cruxwire
    @cruxwire Před měsícem

    You need your own university called "Christian Lempa University".

  • @lennartjoswig6283
    @lennartjoswig6283 Před 3 měsíci +1

    why you don't use the replication feature from proxmox

  • @VoklavTube
    @VoklavTube Před 3 měsíci

    7:00
    not sure which would be better...
    direct connection between the two nodes?
    Or
    bound interface from the two 10 gigabit ports.
    through switch.
    Quick question I can think of:
    If you decide to add a third node? What do you do with direct links?
    btw
    you should look at the new settings for v8 for vlans in the datacenter/network/

  • @fasti8993
    @fasti8993 Před 3 měsíci

    I don’t understand why you would want to go for a 2U case instead of a 3U case especially if you are planning to use an ATX PSU. That’s still gonna occupy 3U in your rack because you can’t put another server directly above as the PSU needs to suck in air from above. You could put a full height PCIe card in the 3U case without additional riser cables and you get proper front to back airflow…

    • @christianlempa
      @christianlempa  Před 3 měsíci

      I wanted to have a little more space in the rack, and since Silverstone doesn't have a 3U case, I went with 2U instead of 4. But yeah it has some concerns about airflow that I didn't thought about in the first place.

  • @declanmcardle
    @declanmcardle Před 3 měsíci +3

    First things first: Do not create VMs or LXCs on node #2 _before_ it joins the cluster! 🙂

    • @Darkk6969
      @Darkk6969 Před 3 měsíci

      Correct! It also states that in the cluster docs.

    • @christianlempa
      @christianlempa  Před 3 měsíci

      True, just keep in mind prx-prod-2 is the first node in the cluster, due to historical reasons

  • @andydtoma
    @andydtoma Před 3 měsíci

    For the network aspect, I prefer configuring the bridge as a Ovs (open virtual switch) and then creating a separate vlan for the corosync link.

    • @solverz4078
      @solverz4078 Před 3 měsíci +3

      Linux bridge interfaces fully support VLANs, you don't need to use OVS for something so simple.

    • @christianlempa
      @christianlempa  Před 3 měsíci

      No I don't use OVS

  • @symbioticparasite6268
    @symbioticparasite6268 Před 3 měsíci

    Yes, maybe I have like 13 of them .....😅

  • @reneb5222
    @reneb5222 Před 3 měsíci

    One Tip do not run it on a SD Card because it writes alot of data for the Q DB. I blew up an expensive SD Card within 2 wks. You can also use a cheap OrangePi as long as it can run a linux derivate preferable Debian because Proxmox is Debian based.1 GB LAN Port works fine as well.

  • @The-Weekend-Warrior
    @The-Weekend-Warrior Před 3 měsíci +2

    Errrm... venting for a power supply on THE TOP OF A RACKMOUNTED case, where the unit above will obviously obstruct the fan?? Hmmmm, I'm not sure this is well thought out.... not at all. Looks cool, but this should be a serious argument against a full size ATX power supply in there...

    • @christianlempa
      @christianlempa  Před 3 měsíci +2

      Yeah, that's indeed a problem I haven't thought about :D I'll do some tests if it still works or if it gets too hot in the summer

    • @The-Weekend-Warrior
      @The-Weekend-Warrior Před 3 měsíci +1

      @@christianlempa No issues, it's always easier to play football from the viewing area than the field :D:D:D You can always count on the comment section to point out things :D Much love.

  • @tdragon87
    @tdragon87 Před 3 měsíci

    I think you just need to have Intel/Intel or AMD/AMD to live migrate. Says same Vendor, not Model :)

    • @christianlempa
      @christianlempa  Před 3 měsíci

      That might be true, but that's how Proxmox has it in their docs

  • @carsten612
    @carsten612 Před 3 měsíci +3

    Why dont you use a simple two node ZFS Cluster on those two guys?

    • @teyhouse
      @teyhouse Před 3 měsíci +1

      ^this
      for such a small setup, a ZFS-Replica between both Proxmox-Nodes ist perfect for HA

    • @solverz4078
      @solverz4078 Před 3 měsíci +3

      There is not such thing as a "ZFS Cluster", you'd need something like Gluster in combination with ZFS.

    • @Darkk6969
      @Darkk6969 Před 3 měsíci

      @@solverz4078 Sadly Gluster is a dead project. It's currently on life support right now.

    • @solverz4078
      @solverz4078 Před 3 měsíci +1

      @@Darkk6969 it has commercial backing and is used in a LOT of data centres. So no, it is not on "life support"

    • @christianlempa
      @christianlempa  Před 3 měsíci +2

      Just haven't tried it, since I haven't considered ZFS on a single drive, and due to performance reasons. However, I might check it out, because of the replication feature