The "New" File System in Windows: ReFS

Sdílet
Vložit
  • čas přidán 2. 06. 2024
  • Well it's not exactly new, but it's still rare!
    ⇒ Become a channel member for special emojis, early videos, and more! Check it out here: czcams.com/users/ThioJoejoin
    Better explanation of Sparse VDL: www.osr.com/nt-insider/2015-i...
    ▼ Time Stamps: ▼
    0:00 - Intro
    1:48 - How is it Resilient?
    3:32 - ReFS Features
    3:38 - Block Cloning
    4:26 - Sparse VDL
    5:28 - Mirror Accelerated Parity
    7:17 - File Level Snapshots
    8:06 - Features NOT in ReFS
    10:32 - Are You Missing Out?
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    Merch ⇨ teespring.com/stores/thiojoe
    ⇨ / thiojoe
    ⇨ / thiojoe
    ⇨ / thiojoetv
    My Gear & Equipment ⇨ kit.co/ThioJoe
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
  • Věda a technologie

Komentáře • 1,4K

  • @ThioJoe
    @ThioJoe  Před rokem +426

    Correction: You CAN actually format a single drive with ReFS, it does not need to be part of a pool.

    • @Melody666
      @Melody666 Před rokem +8

      Yes, I recall using ReFS in the past on a single HDD (I think it was a Windows Server 2012 R2 installation configured to be like a client, with desktop experience and the Acceleration Level dwords to enable 3D acceleration), and then I tried installing GTA V off Steam on it. It went horribly with Steam simply going crazy, believing that the game was corrupt and restarting the download on an infinity loop (the OS was of course on a different disk, formatted as NTFS, but I created a Steam Library in the ReFS HDD).

    • @Rn-pp9et
      @Rn-pp9et Před rokem +1

      Correct, but the feature is only available from the command line in the 'diskpart' and 'format' commands.

    • @Melody666
      @Melody666 Před rokem +6

      @@Rn-pp9et In Windows 11, you can also format in ReFS filesystem from the UI.

    • @Melody666
      @Melody666 Před rokem +2

      @Proton ThioJoe explains it pretty well. I suggest using ReFS only if you're going to use its exclusive features, since NTFS generally performs better.

    • @nster3
      @nster3 Před rokem +2

      ReFS is also bootable with version 3.7 and Windows 11

  • @ItsBigManCallum
    @ItsBigManCallum Před rokem +1478

    hey babe wake up a new file system just dropped

    • @John-Smith02
      @John-Smith02 Před rokem +66

      10 years ago

    • @joen4287
      @joen4287 Před rokem +25

      @@John-Smith02 wdym?
      Edit: Just watched the vid and realized Lmao
      So nvm

    • @gallium-gonzollium
      @gallium-gonzollium Před rokem +35

      “hey what it is called”
      “reeeee-FS”
      “what”
      “don’t worry, it dropped in 2012”
      “so why is it new”
      “idk”

    • @gallium-gonzollium
      @gallium-gonzollium Před rokem +8

      @@joen4287 0:29 :)

    • @joen4287
      @joen4287 Před rokem +4

      @@gallium-gonzollium yeah just watched the vid and realized what he meant XD
      Thanks

  • @_SJ
    @_SJ Před rokem +989

    I appreciate your honesty in telling that you're not familiar on other features like Sparse VDL 👍🏻

    • @UnbreakableMJ0
      @UnbreakableMJ0 Před rokem +49

      Honesty always adds to credibility, and we appreciate it

    • @oMeGa0122
      @oMeGa0122 Před rokem +15

      That's who Joe is and that's why we love him

    • @teknerd
      @teknerd Před rokem +2

      I loved his humility too.

    • @TD-er
      @TD-er Před rokem +7

      Yep, it truly adds to the credibility of the rest he's telling.
      To give a very simple idea on "sparse" files.
      You can see it as a specific implementation of the "block deduplication" feature.
      A block with zeroes will occur quite often on virtual machine file system images. So a block with zeroes (or whatever specific content they think of to fully use this feature) will occur multiple times and a lot of data blocks can point to this same block.
      This makes creating and copying the file (on the same filesystem) very fast and using a lot less space on the storage.
      Virtual machine images often already used this concept of sparse files. But I don't know if it was supported as such on NTFS or that it was aapplication specific implementation.

    • @_SJ
      @_SJ Před rokem +2

      @@TD-er Alright Thanks for the information. Highly appreciated 🙂

  • @marinustennapel2661
    @marinustennapel2661 Před rokem +1066

    I'm still hoping that Microsoft will adopt OpenZFS (of which they are a member, if I'm not mistaken) ZFS does everything that ReFS does. And more. And better.

    • @ckingpro
      @ckingpro Před rokem +64

      Indeed, I use ZFS on Windows using a cludgy workaround. I have an Ubuntu VM that runs in the background with samba and ZFS, with the data showing up on Windows as a SMB network share. It works on my laptop (with all the nice features like resiliency (two virtual disks on two different drive) and is set to launch on startup but it is a cludgy solution)

    • @magfal
      @magfal Před rokem +62

      A production grade filesystem in Windows would be cool.

    • @brodriguez11000
      @brodriguez11000 Před rokem +32

      Locked into a pool size, and not as well for a lot of small files (BTRFS better).

    • @ckingpro
      @ckingpro Před rokem +57

      @@brodriguez11000 BTRFS is still such a buggy mess with many data corruption bugs that can cause you to lose data. I was hopeful about it but after more than a decade, I have given up on it. It’s always going to remain a buggy mess

    • @coctailrob
      @coctailrob Před rokem +15

      @@ckingpro Something I found out by accident is that Hypervisor virtual machines remember their running state, surviving a shutdown and reboot of the host machine. I was surprised to find my guest OS happily idling in the background one day. It had been weeks since I last used it!

  • @ckingpro
    @ckingpro Před rokem +322

    So NTFS already has spare files, which means files only take as much space as are written to (reading other areas of the allocated space just returns zeros). But once written, you can't undo that so if you want to clear that region, you have to actually write zeros all over. Sparse VDL allows you to essentially make those areas sparse again in a sense (allowing you to zero regions of a file without actually writing zeros) making it much faster

    • @gblargg
      @gblargg Před rokem +14

      So it's basically like TRIM does at a lower level with SSDs, letting it know there's nothing important stored there so it can reuse the space.

    • @ckingpro
      @ckingpro Před rokem +11

      @@gblargg close but unlike sparse files, that space can’t be reused. So while sparse files are only truly allocated on disk as written to (I can create a 100 TB sparse file on 1 TB and write 200 GB, the size of the file on disk is about 200 GB. With Sparse VDL, however, those allocated regions remain allocated but you don’t need to write zeros to them (just need to update metadata). Instead, the system will return zeros when read (like with sparse files if I read beyond 200 GB) because ReFS tracks what regions of file has Valid data (hence VDL). For SSDs, the TRIM information is important as otherwise it would have no idea that certain sectors contain no valid data and would copy it or keep it there allocating different cells (causing write amplification). While you could just write zeroes (and some controllers would consider the sectors reusable), it makes deletion slower (as you have to go out and zero it out), it is an implementation details that SSDs didn’t have to implement (after all, file systems don’t zero out file during delete) and wouldn’t work with transparent disk encryption which sits under file system. So TRIM is genuinely useful here (but TRIM for Shingled hard drives is just terrible. Shingled hard drives shouldn’t exist imo)

    • @gblargg
      @gblargg Před rokem +5

      @@ckingpro Ahhh, looking at Microsoft docs on VDL you are correct, it's just a way to avoiding having to zero all the data on disk. Sounds like it basically marks sections of the file as zeroed and returns zero when read, to avoid long wait times when creating huge zeroed files. I could find very little about VDL relating to ReFS. One from a MS engineer was "More on Maintaining Valid Data Length".

    • @brkbtjunkie
      @brkbtjunkie Před rokem

      Like a .sparsebundle in unix?

    • @ckingpro
      @ckingpro Před rokem

      @@brkbtjunkie aren’t sparse bundles exclusive to macOS? Regardless, they are not the same as sparse VDL. Think of them as dynamic disk images that Parallels, VMWare or Virtualbox, where they can grow as they get used up (they can often be shrunk when offline too just like sparsebundle). However, while you can just create a sparse file or use the VM disk formats (where the mappings of empty space is part of the format itself for dynamic VMs), Apple chose small files in a folder approach. The reason is sparse files may not be supported on the remote file system on a file sharing network, and some network file sharing systems would send a whole file so having one big file like VM dynamic disks would not work either. That left many small files in a folder. That said, SMB can send only part of a file (I have tested this), and I am not sure if AFS did as well. Unlike Sparse VDL, you can shrink sparse bundles and dynamic disk images to fit the allocated size after you delete and zero out the contents

  • @graealex
    @graealex Před rokem +1019

    Yeah, don't use ReFS, unless you want your data held hostage by Microsoft.
    Unfortunately, there is currently no other file system than FAT32 or exFAT that allows you universal access from all major platforms. NTFS is at least a viable compromise.

    • @ThioJoe
      @ThioJoe  Před rokem +261

      A fair point

    • @pyp2205
      @pyp2205 Před rokem +80

      Yeah pretty much you can only format drives with FAT32, exFat, NTFS on Windows. But from using Linux, you can format drives with its default file systems, the ones we've mentioned, and pretty much any file system.
      Pretty much from what I noticed, FAT file systems (mostly FAT32) are mostly used for boot partitions. And NTFS and ExFat are normally used for drives.

    • @graealex
      @graealex Před rokem +77

      @@pyp2205 The big question is what happens when you nuke your ReFS-based Windows install and want to use a Linux rescue stick to recover data. Good luck, as there is exactly one commercial file system driver for ReFS on Linux, which you can't even license.
      Oh, and your C: drive can't be exFAT either. Only NTFS or ReFS.

    • @bernardonegri5416
      @bernardonegri5416 Před rokem +133

      Linux has a pretty good NTFS implementation. But they had to black box reverse engineer it. Change a file on Windows, look at a tool that says what changed on the disk, note it down, rinse and repeat until you figure it out.

    • @graealex
      @graealex Před rokem +44

      @@bernardonegri5416 Yes, that's why I am saying that NTFS is a viable compromise.

  • @BluemediaDE
    @BluemediaDE Před rokem +74

    ReFS is primarily intended for enterprise environments. It is especially worthwhile on hypervisors for storing VM disks, as well as for backup repositories. Block cloning allows you to create synthetic full backups, or to merge incremental backups without rewriting the data, which makes it incredibly performant compared to other file systems. ReFS is also used as the basis for Storage Spaces Direct and Azure Stack HCI, where storage arrays are distributed across multiple servers.

    • @aklip
      @aklip Před 11 měsíci +1

      Sounds kinda like snapshotting on btrfs

    • @SamuelfisherSC
      @SamuelfisherSC Před 11 měsíci +1

      And even there ReFS is.. not quite as performant as one might think in comparison to other file systems such as xfs. Synthetic full backups of the same backup job in Veeam being many hours quicker on xfs than on ReFS is a common occurence, even when xfs is ran on a backup repository with inferior hardware.

    • @hiddeninthewires2308
      @hiddeninthewires2308 Před 11 měsíci

      Xfs does not have parity checks and isnt designed as a resilant file system.

    • @gustavojoaquin_arch
      @gustavojoaquin_arch Před 27 dny

      It's just Microsoft trash, Linux is for enterprise environments, don't be a soydevs
      Btw I use Arch

  • @georgeh6856
    @georgeh6856 Před rokem +123

    ReFS sounds very similar to ZFS on Linux, *BSD, etc. Except ZFS does support native compression and encryption, is bootable, and can be used with any distribution which supports it (for free!). I use ZFS encrypted root partition on my laptop and it works great.

    • @bootmii98
      @bootmii98 Před rokem +7

      Well as long as you agree to the CDDL

    • @linuxnormie
      @linuxnormie Před rokem +1

      I thought the same

    • @henryglends
      @henryglends Před rokem +12

      I'd just like to interject for a moment. What you're referring to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX. Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called "Linux", and many of its users are not aware that it is basically the GNU system, developed by the GNU Project. There really is a Linux, and these people are using it, but it is just a part of the system they use. Linux is the kernel: the program in the system that allocates the machine's resources to the other programs that you run. The kernel is an essential part of an operating system, but useless by itself; it can only function in the context of a complete operating system. Linux is normally used in combination with the GNU operating system: the whole system is basically GNU with Linux added, or GNU/ Linux. All the so-called "Linux" distributions are really distributions of GNU/Linux.

    • @georgeh6856
      @georgeh6856 Před rokem +8

      @@henryglends I did not read most of your long comment. However, I wonder how many people now want to try out Linux after all of your criticisms of me for doing the horrible crime of calling it "Linux". Nice way to turn people off from something. If we fail to abide by your harsh restrictions, then we get severely criticized. I guess I am missing your point...maybe you want to turn people away.

    • @henryglends
      @henryglends Před rokem +26

      @@georgeh6856 it was a copypasta joke

  • @exid3277
    @exid3277 Před rokem +90

    Another warning for enthusiasts!
    ReFS has different versions that are not backwards compatible. Sometimes when you upgrade a version of Windows or mount an array to a newer version of Windows, the version of the ReFS on your volume will be automatically updated without any warning. You will NOT be able to use this volume with an earlier version of Windows, even if the volume itself was created by it.
    Do not use refs if it is possible that you will be moving this volume between systems.

    • @Doso777
      @Doso777 Před rokem +4

      Pretty shure that has also been the case with NTFS. Not much of an issue these days since NTFS is mature and Microsoft hasn't really added a new things for a while.

    • @wolf2965
      @wolf2965 Před rokem +7

      @@Doso777 It was, and yes, it it was a problem, but last revision was released 20 years ago when XP came out. It has a bigger problem with how ACL and metadata works, which is why it is less-than-stellar as a removable drive.

    • @temp50
      @temp50 Před rokem +2

      "ReFS has different versions" Yeah, not to mention that Storage Spaces itself also has different versions across Desktop and Server OSes!
      Once I've created a Storage Space and pools inside of it (using the latest pool version available) on a Workstation and when I put it into a server 2019, Windows can did not even see the Storage Space on the drives! So, beware.

    • @Steamrick
      @Steamrick Před rokem +1

      Additionally, old ReFS 1.0 partitions on Server 2012 (R2) will shit themselves if you install this years' security updates and read as RAW until you uninstall the update.

    • @nash984954
      @nash984954 Před 8 měsíci

      Wholly f**k don't I know it?......NOW, and FWIW, you're an added verification, so thanks for that.

  • @the-niker
    @the-niker Před rokem +90

    You can use ReFS on a single drive in enterprise, it's in the format dialog. But if you do that you put your data at risk, if there is a catastrophic failure and the partition metadata get corrupted, there is currently no free way to recover your data and chkdsk won't save you. Lost entire 2TB drive during a power outage and eventually had to give up the (non-critical) data and wipe it.

    • @00and
      @00and Před rokem +1

      On Windows 10 Pro for Workstations you can do that too, in disk management while creating new volume or formatting existing one.

    • @Mikesco3
      @Mikesco3 Před rokem +10

      I would never recommend using it until we can trust it, and we won't be able to trust it until the open-source community reverse engineers, it.
      I would be screaming at people NOT to use it on every chance I get.
      Microsoft should put some effort into adopting tried and true systems that are being developed in the open. Rather than try to play the apple game and hold people hostage over their data.

    • @Mobin92
      @Mobin92 Před rokem +3

      This is why you should have backups of ryour important data. Basically assume that *every* storage device that you use is shit and has a 1% chance of just dying next time you want to use it.

    • @the-niker
      @the-niker Před rokem +2

      @@Mobin92 Yup, I have two backup levels of all critical data. This was a drive with VMs used to spawn new systems, too unwieldy to backup with big frequent changes. I was able to restore the most important ones by retro-cloning live systems and cleaning the images.

    • @bagamax
      @bagamax Před rokem

      It’s like singe disk raid array, why anyone will want to do that besides experiments? And if it an experiment so the data must be invaluable.

  • @markgoodrich4916
    @markgoodrich4916 Před rokem +99

    I thought perhaps the mention of RAID (Redundant Array of Independent Disks, and their levels/variations) might have helped to explain things in this particular subject area of file systems. Again, just a thought. Love your channel, Joe!

    • @amak1131
      @amak1131 Před rokem +7

      Was going to say, it sounds a lot like RAID with some extra stuff tacked on.

    • @bagamax
      @bagamax Před rokem +3

      @@amak1131 you can percept it like a software raid done right. ReFS appeared when more and more servers were going softraid on chipset driver level and MS was like “wtf are you doing guys?”

    • @Zeero3846
      @Zeero3846 Před rokem +3

      @@amak1131 Yep, it sounds like RAID directly implemented in the file system.

    • @dano612s
      @dano612s Před 11 měsíci +1

      right. this sounds a lot like software level raid, which has been around for decades.

    • @user-ee6lk1pb6w
      @user-ee6lk1pb6w Před 11 měsíci +2

      "This video is sponsored by RAID SHADOW LEGENDS"

  • @waryinzero
    @waryinzero Před rokem +1

    What was important to me was your telling me about how removable drives use a particular file format. Thank you

  • @bazoo513
    @bazoo513 Před rokem +3

    5:05 - If I understand correctly, ReFS is able to _present_ allocated, but newer written to, clusters as containing zeroes. So, when you create, say, a 200 GB file that will contain a VM disk volume, it will be allocated, but not overwritten. However, if the VM reads a "virgin" cluster, the FS will return all zeroes, not whichever leftover from previous content was actually there, thus dramatically speeding up creation of huge empty files without compromising security.

  • @bradallen8909
    @bradallen8909 Před rokem +19

    Haven't Microsoft realised they can't design a decent filesystem yet? There's many existing solutions that are far better than anything they could come up with. They should have just used one of those.

  • @omgogmgomgomg
    @omgogmgomgomg Před rokem

    Kudos man. You kept it very simple and helped make the first steps in soft soft. Very Helpfull! Thanks!

  • @therealdebater
    @therealdebater Před měsícem

    9:40 Yes, that's a very good way to describe the swap file ('overflow for memory'). In general, trying to explain this kind of stuff to the layman is pretty hard, so good job!

  • @U.S.A.
    @U.S.A. Před rokem +12

    9:57 On Windows 11 I was able to format a single drive to ReFS so I don't think so that there is such a limitation, you can also format external HDDs to ReFS because Windows sees them as non-removable disks, but you really can't format pendrives and micro SD cards to ReFS.

    • @ThioJoe
      @ThioJoe  Před rokem +7

      Maybe my info was outdated, I read several places that it can’t be on one drive, but they may have added that feature.

    • @Mikesco3
      @Mikesco3 Před rokem +1

      Well placing it on a single drive defeats the whole notion.
      People should be warned against ReFS, they should be pointed towards OpenZFS, BTRFS or Ceph

  • @Doso777
    @Doso777 Před rokem +3

    ReFS is pretty useful for some server applications. For example block cloning helps to save a lot of space and processing time in backup repositories. The space savings can also be huge on things like VDI (Virtual Desktop Infrastructure).

    • @JSLEnterprises
      @JSLEnterprises Před 11 měsíci

      block cloning is 'deduplication' from a different perspective, what it does however is the same thing.

  • @psa110
    @psa110 Před rokem

    Thanks, Joe. This type of video helps those who have maybe heard of ReFS and that it might be great, or not, to know why or why not we might want to look into it.

  • @mrjoshmtz97
    @mrjoshmtz97 Před rokem +1

    Thanks for the tech info Joe! This is pretty helpful! 👍

  • @enart
    @enart Před rokem +3

    I didn't even know there is a new file system, nice video

  • @reiniernn9071
    @reiniernn9071 Před rokem +9

    Add to this video.
    This Re FS will give you a possibility to recover from hardware failures. NOT from software failures (because all dat on all 3 disks will be corrupted the same).
    Also ...It does not protect you from the effects of malicious hackers encrypting your data....all copies will be encrypted (only a backup will help you at that moment.)
    And that is where a snapshot will be valuable....take that snapshot and keep it as backup.....but NOT directly connected to your computer to keep it safe from hackers. (Better also keep it stored in another location....helpfull when one of your location gets destroyed....maybe fire.)
    I've years ago told my brother in law some of these tips....about having a safe backup.
    It has saved him many thousands of dollars because he could refuse the offer from hackers for the decripting key...(costed him one day of work restoring from his safe backup)

    • @fat_pigeon
      @fat_pigeon Před rokem

      Right; always remember that RAID is for performance or uptime, but *RAID is not a backup*! Also remember to test your backups.

  • @sufration
    @sufration Před rokem

    Great and up-to-date information 👍🏻 Thank you

  • @sethlin01
    @sethlin01 Před 2 měsíci

    Great video, you made this really easy to understand.
    Just to add some information, when Joe explaining Copy-On-Write the main point was missed. COW isn't just for multiple file locations (that point to the same actual location) but how some of the new "hip" filesystems make changes to data by always writing to new blocks on the disk for all data. This is for making data more resilient to errors, and also gives the data its own snapshots.
    This is why it has file level snapshots, because it is built in to how the file system works. Windows also does file snapshots with NTFS since windows Vista, but this from manually making snapshot copies of data, where as COW file systems (like ReFS) do this natively. Functionally they work the same for you because Windows makes it so, but internally ReFS will do this faster because it doesn't have to manually make a new copy, it is part of how the file system normally works. This also makes the file system better at freeing up the space from the snapshot copies because it naturally overwrites the oldest data when space is needed.

  • @Poldovico
    @Poldovico Před rokem +19

    Sounds like a specialized FS for use with RAID arrays. Probably wouldn't make sense in general use.

    • @darksill
      @darksill Před rokem +2

      that and lets be honest your regullar user dosent know what raid is let alone how to set one up it would be just confusing to most people

    • @Poldovico
      @Poldovico Před rokem +5

      @@darksill Also there are tons of open standard RAID-optimized filesystems, if Microsoft ever felt like working with everyone else for once.

  • @ckingpro
    @ckingpro Před rokem +43

    What you are referring to Copy-On-Write refers to RAM (where memory pages (fancy word which is a unit of memory. On x86 and x86_64, it is 4 KiB of memory) can be shared between processes until a process tries to write to a shared page, in which case, the system copies it instead for the process). File system Copy-On-Write is different. Let me give you a brief overview of how file systems remain consistent (corruption-free). So, in the old days, file systems basically had little checks. If the power goes out mid-write, the file system has no way of checking what went wrong. So, you would run chkdsk or fsck (Linux/macOS/Unix equivalent) and it would have to check every single thing to look for any corruption to fix. This could take days. Then came journaling. Typically, journaling is only enabled for filesystem metadata (though some filesystems like the linux ext3 and ext4 do allow you to enable data journaling. But since your data has to be written twice, expect abysmal write performance). This means that the file system first writes what is is going to do/change to a journal (not your data, but say it is going to rename or allocate space, etc) and then perform it. If the power goes out and comes back again, the file system first checks the journal and finishes the operation. This means no more lengthy checks. However, data consistency is not ensured. To ensure that, we have copy-on-write. In copy-on-write, you don't write the data in place, but in free space, before updating all references to point to the new space, and updating references to that reference and so on until the superblock (think of it as the main block of the file system. And typically there are multiple superblocks so when all are updated, the operation is done). In this case, if the power goes out, you can be sure either the entire write went through or none of it did, ensuring consistency. All of this is done without writing data twice. Now, when it was first introduced, it was meant for enterprises or businesses (or anywhere where data integrity matters) with file systems like ZFS as it increased fragmentation (think about it: writing within a file will have to be placed somewhere else where there is free space rather than in place). But now that we have SSDs, it does not matter.

    • @ezluci
      @ezluci Před rokem

      Yeah

    • @Qwerty-uiop
      @Qwerty-uiop Před rokem +2

      Lots of text (I read it all)

    • @ruwn561
      @ruwn561 Před rokem +4

      That is NOt what copy on write means. COW based file systems are not what you have described.

    • @ckingpro
      @ckingpro Před rokem

      @@ruwn561 it is though

    • @jeschinstad
      @jeschinstad Před rokem

      @@ckingpro: No, it isn't.

  • @cmilkau
    @cmilkau Před rokem +1

    Thanks for the link about Sparse VDL! Boy, I'd never have found that. So hard to find information about this feature!

  • @edwinrosales6322
    @edwinrosales6322 Před 8 měsíci

    Great video, very good explanation!

  • @snarkykat
    @snarkykat Před rokem +9

    I was wondering about this file system - thanks for covering it. 😄 By the way, there is something strange with the audio equalization for this video compared to your other videos. Your S's aren't coming out as crisply as before. Either that or someone stuck meat probes in my ears while I was sleeping. [Edit: The problem traced to the fact that every time there's a Windows feature update, it wipes out my equalizer settings 🤬]

  • @stultuses
    @stultuses Před rokem +3

    Most of this stuff was around in ODS-5 on OpenVMS 20+ years ago, although OpenVMS needs updating in terms of storage capacity now (it's planned, they have just been busy the past few years porting the OS itself to x86)
    There;s a lot of very good file systems out there, ZFS would be my pick although there are technically better ones

  • @RayMak
    @RayMak Před rokem +1

    Really never heard of this. Thanks so much

  • @darrelbryant8632
    @darrelbryant8632 Před rokem

    Thank you ThioJoe, I found this fascinating.

  • @Woodzta
    @Woodzta Před rokem +15

    We are finally moving some of our main storage to ReFS at my workplace. Our use case is backup storage and virtual machines and from all of our reading, it's going to save us a lot of time for many large, write-heavy operations!

    • @gda86pl
      @gda86pl Před rokem

      Until some Windows update don't wipe clean your drives, as it was the case with refs early this year.

    • @lawrencedoliveiro9104
      @lawrencedoliveiro9104 Před rokem

      Microsoft seems to be backing away from promoting it, even removing support for it from newer versions of some OS products.

  • @danieljonsson8095
    @danieljonsson8095 Před rokem +5

    For the features not in refs that you mention. For booting, that's not actually a refs limitation, but because most modern UEFI boot systems, simply do not have a driver for it. So instead you have to give it a driver on the EFI partition, same as for ntfs for a lot of systems. Now windows installer does that for you with ntfs, but it won't do that for refs. This is because MS does not consider refs to be ready for this yet. They have published a roadmap for refs which is in three stages, and refs is currently in stage 2. MS is not going to install the drivers to the UEFI prior to this, but there are some third party methods you can use to do it if you decide you really really really want to... For file system level compression, this is not entirely true. You see, compression on refs is tightly integrated into the deduplication, exactly because they sort of need to be for optimal usage of either. So you enable compression, by enabling the deduplication. Encryption however is not available, again due to its negative impact on deduplication. As for page file, well, it's not that you technically couldn't, but first of all, you're using refs through storage spaces, and you're not allowed to place the pagefile on a storage space volume. You can however do refs on a single drive (even though you say you can't and you absolutely do not need to use storage spaces for it), in which case you could do pagefile for it. BUT, windows GUI will not allow you to do so. And this has to do with the Copy on Write, which I might add, you explained incorrectly. Copy on Write means that if you have a file opened, you make a change in it and save that file, then it will write out the full block that the changes are made in, in a completely new, separate block, then move the file reference over. While it does have the effect that if two programs open the same file, one writes to the file, the other program will now still be reading the old data, because it opened the old reference and has not been told to reload the reference which now points to new data. CoW filesystems are incredibly good for storage that rarely change. It is however incredibly inefficient and slow for data that change rapidly, such as a pagefile. Hence why Windows won't allow you in the GUI to use it that way. You can force it, but you will have a very VERY bad time from it, even if you're nowhere near to running out of ram simply because of how a pagefile is NOT just "overflow ram" which isn't how page or swapfiles have worked for over 20 years now. Anyway... Next you bring up not for removable drives. This is again sort of true but also sort of not. First of all, there's nothing stopping you from adding a removable drive to a storage space pool and having the pool formatted refs. Secondly, the reason it's normally not shown is because refs does not work well with the quick removal that is the standard on modern Windows. If you instead go in and enable caching and optimizing for performance on a removable drive, you can how using powershell force it to format refs (the gui still won't let you). Be warned though that this drive will not work in systems that is not configured the same and you will likely corrupt it simply by plugging it in to such a system.
    As for the mirror accelerated parity. Since you as you admit didn't understand it, I'll try to explain it. So because parity calculations are slow, but more efficient. What it does is that it writes a data block to the two parity drives, creating a mirrored set of that block. Now this is ofc inefficient for storage amount. It will however report it as if it was written with parity rather than a mirrored set. It will then when it either has some free idle time or when it's starting to run out of real space, do the parity calculations and rewrite that block as a parity block, at a later time. It's very good for when you have infrequent writes that you want to complete fast. It's bad if you have constant writing as it actually has to write data twice.

    • @ChrisMaster2
      @ChrisMaster2 Před 11 měsíci

      Can anyone agree that they would not want to see who is faster at typing against this guy?

  • @wildmanjeff42
    @wildmanjeff42 Před rokem

    I did not know about this fs...
    Thanks for the video !

  • @defective1up
    @defective1up Před 11 měsíci

    Thank you for this, I had no idea what ReFS was for before this.

  • @ringtailedfox
    @ringtailedfox Před rokem +7

    My biggest issue with ReFS is that there are no data recovery tools for the file system, unlike FAT32, NTFS, and so on. If the partition simply becomes too full, the data becomes inaccessible and irretrievable. This also holds true if the partition becomes corrupt for whatever reason (it sometimes can't correct for all errors automatically and fails in a non-graceful manner, preventing data recovery). That's a hard pass for me!

    • @Lofote
      @Lofote Před rokem +2

      Yeah, that why it never really took off. :(

    • @JSLEnterprises
      @JSLEnterprises Před 11 měsíci

      Its basically the modern version of spanned volumes on dynamic disks. Its also the reason why Microsoft stopped supporting dynamic disks & spanned volumes when they introduced ReFS.

  • @michaelrichardson8467
    @michaelrichardson8467 Před rokem +4

    I believe the reason refs failed so hard was because it doesn't compete with open source file systems like zfs or btrfs. Most enterprise solutions are virtualizing Windows with a Linux like hypervisor anyways. It's honestly very rare to find a Windows server instance on bare metal in a datacenter. Not like it's impossible to find for very specific use cases, but rare nonetheless. Btrfs and zfs do everything refs does but WAY better. Storage spaces just doesn't compete in performance, flexibility and management.

  • @kantraa
    @kantraa Před rokem +1

    I accidentally read "Mirror Accelerated Parity" as "Mirror Accelerated Party" at 5:34 and I thought "that exists?!"
    Great vid btw!

    • @UnifiedFriends
      @UnifiedFriends Před rokem +1

      Yes the mirror accelerated party exists.
      It's called disco ball.

  • @daniellundin8543
    @daniellundin8543 Před 4 měsíci

    Thanks for explaining! Subbed

  • @donciak
    @donciak Před rokem +7

    Important: cant’use refs on different win server edition. Refs have versions so if you plan to attack disk to another system (like for recovery ) , must be same version

    • @UnbreakableMJ0
      @UnbreakableMJ0 Před rokem +1

      Why would you attack it though 😅 Just kidding, I know you mean attach 😉

    • @how_to_lol_u
      @how_to_lol_u Před rokem +2

      @@UnbreakableMJ0 ALL DRIVES ADVANCE ON ALL FRONTS, WE HAVE THEM!

    • @UnbreakableMJ0
      @UnbreakableMJ0 Před rokem

      @@how_to_lol_u That is hilarious 😂🤣 Good one 👌🏻

  • @davidmarquesneves
    @davidmarquesneves Před rokem +8

    BTRFS, ZFS, in Linux with similar features.

    • @Mikesco3
      @Mikesco3 Před rokem +3

      and waaaay more secure and dependable than this commercial attempt at ransomware

  • @langeludo
    @langeludo Před rokem +2

    About parity: as said it does affect performance (at least write wise). I don't know for ReFS, but it should improve read performances for large files since the data can be read from at least 3 drives together (if parity is mixed accross drives eg. block 1 is stored on drive A & B parity on C, block 2 is stored on drive B & C parity on A etc…).

  • @antochristus9046
    @antochristus9046 Před rokem

    Thank you. TNice tutorials is really helpful. Much respect

  • @ChrisAzure
    @ChrisAzure Před rokem +7

    If you like to experiment with your computer don't use Refs, I tell you from personal experience. If you go to a higher version of Windows (Insider Preview in my case, even Release Preview which is the slowes ring of Insider Program) and you downgrade, you'll no longer be able to use it or access it until you format or go to the same version or newer than you had.

  • @MithicSpirit
    @MithicSpirit Před 11 měsíci +5

    Would be interesting to compare it to file systems oft used on servers, like ext4, btrfs, and zfs.

  • @ultraali453
    @ultraali453 Před 3 měsíci

    Thank you for the video. It is very useful!

  • @the__man
    @the__man Před rokem

    didnt know about this, good to know! i love the position you take with the videos, somewhere between average tech user and programmer. it allows a lot more depth and meaningful information to be given about the topic, while still being in a condensed and easily digestable format

  • @arturm4558
    @arturm4558 Před rokem +6

    Actually, you CAN run ReFS on a single drive not in a pool, have done so on both win10 enterprise and hyper-v core 2019.
    Not certain how I did it and if it was as intended by Microsoft, but I believe I set it up using Windows admin center.

    • @Mikesco3
      @Mikesco3 Před rokem

      can, but should NEVER until you can trust it , and you can't trust it, until it is open source or properly reverse engineered by the open-source community.
      We should be pointing people towards, OpenZFS, BTRFS or Ceph

    • @arturm4558
      @arturm4558 Před rokem

      @@Mikesco3
      This is the kind of elitism we don't need.
      Yes, btrfs and ceph are by far superior, but not an option in Windows. So they're out of the equation. And compared to ntfs, it is better in some usecases.

  • @whitebeartigtig
    @whitebeartigtig Před rokem +9

    Interesting. I was able to format a single partition on my secondary SSD as ReFS. It was formatted on Windows 10 LTSC 2021.
    Probably not that useful on a single disk though. When I eventually reformat that SSD, I'll probably be sticking with NTFS in future.
    I don't really have a real reason to be using ReFS, so probably best that I don't until I need to.

  • @CliffCarmichael
    @CliffCarmichael Před rokem

    your titles are so attractive man but the vid length to explain a concept or subject that could have taken half that time really doesnt let me click, i clicked this one just to convey this comment

  • @agent-qm7oe
    @agent-qm7oe Před rokem

    Great explanation, thanks

  • @Zhixalom
    @Zhixalom Před rokem +9

    Parity is often easily done in hardware with dedicated logical circuitry... taking the load off the CPU.
    - Ben Eater has made a great video on error detection and parity checking... It is really a simple, ingenious and well established concept.

    • @Mikesco3
      @Mikesco3 Před rokem +6

      look into bitrot, that's why there are projects like OpenZFS, BTRFS or Ceph.
      Hardware raid is known for the potential to introduce silent corruption and or locking people into proprietary solutions that become a problem once the manufacturer doesn't want to support that version of the hardware.
      CPU time is not as expensive as it was way back.

    • @Zhixalom
      @Zhixalom Před rokem

      ​@@Mikesco3 I was referring to what parity checking is and how it functions. It has been around for ages, long before RAID was even a concept. I remember reading about it as a teen back in the '80s.
      Parity checking isn't going anywhere or being developed any further. It is not like with compression algorithms, where the latest one is able to compress data even harder than it's predecessor. Parity checking is what it is, the more parity bits you add to your circuit, the larger any portions of potentially corrupted data can be reconstructed/recovered. And yes, it can be emulated in software as a lot of circuits can today. But it is a way of building an error-correction circuit with bitwise logical gate IC chips (or in logic arrays like PLAs, GALs, or FPGAs, etc.) completely without the need to wait for a CPU or even a MPU to finish running any code. The parity data is ready the very instance the transmission has been received. This operates at the "bare metal level" as we old school computer nerds used to say (even though "bare silicon level" would probably have made more sense)... drivers or software are much much further up the "food-chain"... along side such phenomena as compatibility issues. Done correctly the OS doesn't even need to know that it exists.
      - Sadly YT won't let me post any links... But go find and checkout Ben Eaters videos... you will see what I mean.

    • @DanielSmedegaardBuus
      @DanielSmedegaardBuus Před rokem +1

      @@Zhixalom It sounds like you're talking about some kind of hardware accelerated parity calculations? It sounds useful for stuff like server farms or the like, but it doesn't solve the issue of end-to-end data integrity checking and recovery. Hard drives actually already have error correction via ECC data for each sector, these days 4k each. This will work - for data integrity on the platter only - so long as the corruption that has occurred on-disk is not greater than what the ECC data can repair. Then we have RAID parity setups, hardware or software based to deal with more massive damage, all the way up to whole disks dying altogether. The point Michael is trying to make is that all of these approaches, hardware or software, and including the product you're speaking of if I've understood it correctly, so long as they're not integrated into the filsystem itself, won't be able to detect or repair damage that happens in-flight or in-memory. Data can be corrupted in memory (unless it's ECC memory, as in servers), by a faulty CPU, or while in transit either from or to storage. Only a checksumming filesystem where integrity checks are performed in-memory post retrieval such as ZFS, btrfs or ReFS (APFS promised this, but AFAIK they still haven't fully delivered (only metadata is checksummed)) can detect such a corruption. As a personal example I have, I had a massive ZFS array running on a Linux box with SATA PMPs that had a kernel driver bug when running in SATA-300 mode, which caused data transferred to have thousands of errors every few minutes when fully saturated. I had no idea for almost a year, until I mirrored a SMART faulting drive for replacement, and discovered that I was completely incapable of copying even a single megabyte off the drive repeatedly getting the same hash sums even twice. I then started checking the other drives and found that each and every one of 15 drives running off these PMPs where producing error-filled data when read from them raw. I then debugged ZFS and discovered the torrent of failed reads that it was experiencing - after having read blocks successfully from the drive with no crc errors and no controller errors reported - and silently retrying until it got back what it know was good data. I "fixed" the bug by forcing SATA-150 speeds, and ZFS performance increased massively as a result, as reads where now almost always good instead of almost always bad, and it no longer had to retry until receiving good data. Same for writes, which are by default read back and confirmed in ZFS, then rewritten if bad. Had I had a regular filesystem here, perhaps even running RAID with parity, all my data would have been destroyed. I'd have parity to ensure that it would remain destroyed in exactly the same way going forward, but any software RAID couldn't have prevented the data corruption that happened afterwards on the SATA channel, ECC on the disk neither, and neither can hardware RAID know if the arbitrary blocks sent to it were actually damaged since the fs sent a write request.

    • @jhgvvetyjj6589
      @jhgvvetyjj6589 Před 11 měsíci +1

      Xor parity is an extremely simple bitwise operation anyway. Recover any disk by xor of the other two and the runtime of the xor will be vastly outweighed by disk read time anyway.

  • @gpturismo
    @gpturismo Před rokem +5

    So I wonder if this is salvaging of WinFS from longhorn/vista. If you remember that was suppose to be more a full journaling system they ended up shelving around 2005/2006

    • @zoomosis
      @zoomosis Před rokem +1

      No, WinFS was a metadata database store on top of NTFS. ReFS serves a different purpose.

    • @gpturismo
      @gpturismo Před rokem

      @@zoomosis Ah I forgot. Thanks for the clarification.

    • @lawrencedoliveiro9104
      @lawrencedoliveiro9104 Před rokem

      WinFS was supposed to be like a relational database built into a filesystem. That was one of the three major technologies that were promised for Longhorn/Vista, all of which were abandoned before release.

  • @ddinfant679
    @ddinfant679 Před rokem

    I'm just happy ThioJoe showed up on my recommendation Feed.

  • @jarsky
    @jarsky Před rokem +1

    I used to have a Windows "NAS" which used ReFS on Server 2012 back in ~2012-2013. It was an array with 8 x 2TB disks. I had so many issues with early Storage Spaces and ReFS. I ended up building a new RAID6 array on an LSI MegaRAID, migrated the data, and never looked back.
    I think of ReFS not so much as a replacement file system, but really as a specialist file system like ZFS. I'm surprised that ReFS is really still in development...I think the biggest benefit and reason it was originally built was for Hyper-V, and well as we know On Premise Hyper-V got EOL'd with Hyper-V 2019. I wouldn't be surprised if ReFS got canned as well, I cant imagine theres much use of it when theres much better solutions like NetApp, StorServ, 3PAR, hell even ZFS solutions like TrueNAS Enterprise.

  • @Yves_Cools
    @Yves_Cools Před rokem +3

    @ThioJoe : I think that it's pronounced "Ree-F-S", not "R-E-F-S" (hence the smaller letter "e" instead of capital "E")

  • @yocobicus
    @yocobicus Před rokem +4

    ReFS is a neat type only for programs that are used for backing up files. The files that land on ReFS should already be compressed, duplicated, and encrypted by the program that uses that volume. You want an error correcting, resilient file location. Then set the Block Size to 256-512 to match the program so when to program backs up each block there is not any wasted space at the end of each physical block saving you Gigabytes to hundreds of Gigabytes of space. Also matching the Block Size will allow you to have better OI speeds.

  • @zanbalek13
    @zanbalek13 Před 11 měsíci

    Great I watched this video before formating my flash drive as ReFS ..thanks

  • @JustPyroYT
    @JustPyroYT Před rokem

    Interesting Video as Always 👍👍

  • @funtechu
    @funtechu Před rokem +4

    The parity system is actually pretty simple, and just exploits a neat property of the XOR operator.
    If you have three sets of binary data of equal length, A, B, and C, then with the XOR operator, if we set C = A XOR B, then we can recover A, B, or C regardless of which one fails from the other two:
    A = B XOR C
    B = A XOR C
    C = A XOR B

  • @GunnerSiIva
    @GunnerSiIva Před rokem +25

    WTF the like button got an animation

  • @olafschermann1592
    @olafschermann1592 Před rokem

    I tried out ReFS back in 2012. But then it was disabled by default for hyper-v unless you set a bit on each vhdx file. Also i needed deduplication and it was not supported in 2012. First time those issues where adressed was in winSrv 2016 or 2019. But i didn’t want to mess up and rebuild my storage. But it would have been a good choice because dedup on ntfs corrupted a large vhdx file every 3 years on 2 of my systems. VMs still running, but unable to copy the vhdx file as a backup. Time to try it out again. Thanks for the video. Also tested out repair tools like rstudio which successfully could restore refs on storage spaces back in 2012 - both brand new stuff back then.

  • @cyndacat76
    @cyndacat76 Před rokem

    i remember when you made the parody how to videos. i thought that shit was so funny. glad to see you're still kickin it with youtube.

  • @gamingthunder6305
    @gamingthunder6305 Před rokem +21

    fun fact ReFS was in a earlier version of windows 10 but MS decided to remove the feature and reserve it only for higher versions of windows
    also storage pools and parity are so soooo slow that its not worth it. i setup a truenas vm in hyper-v on my windows 10 pc that boots up automatically when my PC starts. I then added my 5 4tb drives directly to it and use ZFS which is the superior file system.
    there is a beta out there to bring zfs to windows but its far from prime time ready.
    if MS ever get parity writing speed fixed (27mb/s on 5 4tb drives on a ryzen 1700x) then i might consider switching back but storage pools and parity has been awful sines 2012. so i have very low hopes that ever happening.

    • @Rn-pp9et
      @Rn-pp9et Před rokem +1

      I was going to say the same. ReFS in Storage Spaces in any configuration other than 'Mirror' is slow. TrueNAS in VM is what I'm doing too. Got a zvol presented using iSCSI LUN to my PC. Works great, but would be very hard for non-nerds to set up.

    • @gamingthunder6305
      @gamingthunder6305 Před rokem

      @@Rn-pp9et funny that you commented this very thing as i was thinking about changing it from a mapped network share to iSCSI a few days ago.
      some programs cant access a mapped network drive. how complicated is it to setup iSCSI in truenas?

    • @Rn-pp9et
      @Rn-pp9et Před rokem

      @@gamingthunder6305 Easy, there's plenty of guides on YT itself. Just a few clicks.

  • @pyp2205
    @pyp2205 Před rokem +3

    Really a new file system?

  • @DanielSmedegaardBuus
    @DanielSmedegaardBuus Před rokem +2

    It absolutely can be used on a single drive. Just use /fs:refs with the format command in command prompt. Also, scrubbing can only be done on pools with redundancy. On non-redundant ReFS, it'll simply fail when reading a corrupted piece of data, which is still preferable to not knowing that you just read invalid data and get corruption or a crash.

  • @stealthzi7465
    @stealthzi7465 Před rokem

    You say you won't talk technical and your a tech channel...god give me hope

  • @kevinm3751
    @kevinm3751 Před rokem +3

    Lets bottom line it... Its NOT new, Its NOT a replacement, nor is it even a viable alternative, or a next gen for NTFS and unless you have a specific need that takes advantage of it.

  • @Serpensin
    @Serpensin Před rokem +3

    So it's more or less Microsofts attempt for RAID? 😕

  • @TheIvicask
    @TheIvicask Před rokem +1

    You can format external/removable USB drives to refs, veeam backup even recommends it for example.

  • @laz288
    @laz288 Před rokem +2

    ReFS shines in enterprise backup systems. The Data De-Deuplication feature saves a ton of space.
    I have a few drive arrays ranging from 60TB to over 100TB and NTFS works the best. I have lost data using ReFS setup via a Storage Pool where the meta data was missing on boot. I only use it now for my backup server. Would not recommend regular users to use this.

  • @MorganSullivan
    @MorganSullivan Před rokem +3

    Nothing beats Btrfs

  • @shubham8550
    @shubham8550 Před rokem +8

    is it only me who think this so called new files systems is cheap ripoff of RAID ?

  • @ShydenPierce
    @ShydenPierce Před rokem

    dude I remember when I was like 8 I was talking to my dad about the batteries on ethernet cables and didnt listen to him even though at the time he would have had 30ish years of experience with computers. Nice to see you're making "real" content now

  • @tamodolo
    @tamodolo Před rokem

    I'm surprised. Last time I got recomended to this channel it was somewhat an "absurd as true" funny type channel. This is actually legit.

  • @vanodon2257
    @vanodon2257 Před rokem +3

    Sad to see shitdows hasnt switched to ext4 yet. More proprietary junk that isnt half as good. 💩 The joys of a system administerator having to deal with proprietary junk.

    • @AchmadBadra
      @AchmadBadra Před 7 měsíci

      EXT4 and linux stuff is junk 💩

    • @vanodon2257
      @vanodon2257 Před 7 měsíci

      @@AchmadBadra you say that when even enterprise standard for a stable 100% uptime filesystem are unix only. ZFS is the defactor standard and is For Linux / BSD. Windows spyware is too inferior to support such reliable options

    • @vanodon2257
      @vanodon2257 Před 7 měsíci

      @@AchmadBadra I'd just like to interject for a moment. What you're referring to as Linux, is in fact, GNU/Linux, or as I've recently taken to calling it, GNU plus Linux. Linux is not an operating system unto itself, but rather another free component of a fully functioning GNU system made useful by the GNU corelibs, shell utilities and vital system components comprising a full OS as defined by POSIX. Many computer users run a modified version of the GNU system every day, without realizing it. Through a peculiar turn of events, the version of GNU which is widely used today is often called "Linux", and many of its users are not aware that it is basically the GNU system, developed by the GNU Project. There really is a Linux, and these people are using it, but it is just a part of the system they use. Linux is the kernel: the program in the system that allocates the machine's resources to the other programs that you run. The kernel is an essential part of an operating system, but useless by itself; it can only function in the context of a complete operating system. Linux is normally used in combination with the GNU operating system: the whole system is basically GNU with Linux added, or GNU/Linux. All the so-called "Linux" distributions are really distributions of GNU/Linux.

  • @Blarnix
    @Blarnix Před rokem +4

    Yay, yet another file system that won’t be supported by anything but windows!

  • @SirFaceFone
    @SirFaceFone Před rokem

    Gotta love how this channel features lesser-known Windows features.

  • @GYTCommnts
    @GYTCommnts Před rokem

    Ultra interesting!!!!!!!! Thank you very much! 💪

  • @smith4591
    @smith4591 Před rokem +3

    So basically, what Microsoft has done was, rename RAID 1, 5 and 10 with their own names and then introduce it as ReFS.

    • @jeschinstad
      @jeschinstad Před rokem +1

      No, they add COW and a lot of other things, so that is not a proper representation.

  • @JamesNashWalker
    @JamesNashWalker Před rokem

    Really a helpful video 😄

  • @GameIT.
    @GameIT. Před rokem

    Thx for the info tech master!!

  • @TechLord79
    @TechLord79 Před rokem

    From the little I read, file-level snapshots are more about time and space efficiency - as you showed Previous Versions, recovering a single file from a snapshot (shadow copy) has been possible for a long time on NTFS with VSS as a system service in the background. But true, you had to explicitly take out folders and files from the snapshot and restoring a whole snapshot restores the whole volume with data loss of all stuff since.
    ReFS file-level snapshots seem to target one specific use case in particular (although not limited to), snapshotting virtual disks of virtual machines spot-on; this prevents having to snapshot a whole volume in the terabytes of a production Hyper-V server and the hassle of mounting a snapshot of such a huge volume somewhere (assuming some enterprise storage) and then copying out (recovering) some VHDXs -- which may also take a lot of time. Hyper-V has snapshots of its own but not without some implications and gotchas -- so for those in datacenters I see why this ReFS feature might be very handy.

  • @deilusi
    @deilusi Před rokem

    5:40 if you make VM it reserves set amount of disc (if you pick full reservation, not "grow as you go") When you make saves of VM it will save all "reserved" space. If you use compression on that, which is standard to avoid 20GB backups/images, every bit that is not zeroed, causes issues for compression.
    while creating images of VM's, it is a good practice to zero-out whole free disc space, it can make 3.7 GB image/backup into 2.1 GB one, just because compression algorithm don't have random strings in free space. This fs does that to any unused but still reserved space, so any VM images or operations on the whole thing, as backups/restores/loads will work with much smaller images, and that speeds up a lot of things.

  • @ReivecS
    @ReivecS Před rokem +2

    You don't have a "parity drive", people often say this to make visualization easier but parity bits exist on all the drives and striped across with the actual data bits. Also the calculation of how parity works isn't nearly as complex as you might think, it is just XOR calculations. This is very easy in a 3 drive RAID 5 example because if you take 8 bits (to make it easy to write out) and think of each 4 bit chunk as being what is striped to a single drive, then you get 2 data bits, if you XOR those you will get your parity bit. If you lose any of your data bits you just XOR the remaining one with your parity and you will get your missing piece of data. This gets more complex after 3 drives but the basic concepts are the same.

    • @fat_pigeon
      @fat_pigeon Před rokem

      Technically true for actual parity (e.g. RAID 5 on any number of drives). But for RAID 6 (double-parity)-type systems, the "parity" is not actually parity, but something more complex. I think they use either Galois fields or some sort of Reed-Solomon code (not very familiar).
      That said: as far as I understand, the main performance issue with parity RAID is not computational cost, but rather fragmentation due to having to treat every write as its own stripe to compute parity. The "mirror-accelerated parity" feature sounds like it mitigates precisely that problem by computing the parity asynchronously, likely after a larger amount of written data has been accumulated. I believe Bcachefs uses the same technique for its parity RAID support.

    • @ReivecS
      @ReivecS Před rokem

      @@fat_pigeon You basically just went in slightly more depth on what I already said: "This gets more complex after 3 drives but the basic concepts are the same". Reed-solomon is commonly used and allows for much more complex striping and parity variability for systems like ceph, but at the end of the day it is all somewhat based on the same ideas. They just get more complex and build on each other more and more until it gets a little too hard to explain without breaking out math proofs. I have done a little work with ceph and the reed-solomon algorithm but I wouldn't attempt to break it down any further on something like youtube (Plus I am far from an expert on the minute details)

  • @KeithGraves
    @KeithGraves Před rokem

    If I remember correctly when ReFS came out is was really only meant to be used for storage on Hyper-V hosts. It wasn’t really meant to replace NTFS for normal workstations or VMs.

  • @duet_1959
    @duet_1959 Před rokem +1

    So I want to ask something, the Block Cloning feature suggests a pointer to the original file, what if the drive containing the pointer file needs to be taken away, will the file be actually stored? Also, the same for the drive containing the original file and it gets removed, will the other drive persist it’s original copy or the pointer?

    • @fat_pigeon
      @fat_pigeon Před rokem

      This filesystem isn't meant for removable media at all, but for an array of internal drives that are generally removed only when they fail. In the scenario you describe, if you pull one of the disks, the filesystem would treat that as a drive failure. CoW is separate (it would work on a single-drive volume); when used together with mirroring, *both* drives would have a copy of *both* the original and the modified file.

  • @mire873
    @mire873 Před rokem

    How did you enable the fancy drive paths for powershell ? I've seen people do that for the Linux terminal before I just don't know how to do it.

  • @mishortProductions
    @mishortProductions Před rokem +1

    I think the limitations are there for a reason. In modern IT it‘s considered a best practice to keep the Production data and configuration data separately. So the server would boot on a NTFS drive and the user data, Databases, VMs are stored on ReFS.
    If following best practices the configuration would be stored as Ansible playbook (or other deployment toolchain). Because restoring a machine from a snapshot is always a pain in the …
    Easiest is to just make a automated fresh install, connecting the user data later.

    • @KC-rd3gw
      @KC-rd3gw Před 10 měsíci

      Using zfs combined with sanoid/syncoid for VMS and lxcs allows near instant rollback compared to using more traditional backups. My setup has automatic replication of VM zvols every hour taking up very little space on my pool.

  • @6alyssar
    @6alyssar Před rokem +1

    Hello :D havent watched the video but already know its gonna be good

  • @stanleybowman-hood6194

    Cool your talking about this as I have know about it for a while

  • @UNICORNSF3ProgameplayProRACER

    Great sharing!

  • @rayoflight62
    @rayoflight62 Před rokem

    VDL is a dream with there are many small files, smaller than the block size. Having the remainder of the block filled with 0s instead of random file fragments make data recovery and disk-level maintenance a very clean and safe operation.
    The ReFS is mostly intended for enterprise and data centres...

  • @Kata_Mae
    @Kata_Mae Před rokem

    the parity drive is easy to do : compare bits from disk 1 and 2, if the same put 1 otherwise put 0. The parity drive wil basically say "is it the same bit".
    So if you lose data on 1 disk, you can find the other with this "is it the same".

  • @Mazxlol
    @Mazxlol Před rokem

    It's always so fascinating someone comes up with all these new technologies... Some people are insanely smart.

    • @jeschinstad
      @jeschinstad Před rokem +1

      Yeah, they're called Sun Microsystems and they really were quite smart back in those days. From the limitations, ReFS seems to be a very cheap copy of the original though. Everything that ReFS can do, is possible with Ubuntu with both BtrFS and ZFS and it has none of the limitations listed in the end. But in addition, on Ubuntu, you not only can use them on single drives and boot from them, but you can also put them in files to use as internet-movable drives with full disk encryption.

    • @fat_pigeon
      @fat_pigeon Před rokem

      @@jeschinstad To be fair, the "mirror-accelerated parity" mentioned seems like a genuinely new feature. Bcachefs recently added parity RAID support, and from what I understand it's using the same technique.

    • @jeschinstad
      @jeschinstad Před rokem

      @@fat_pigeon tell me whats new under the sun. :)

    • @jeschinstad
      @jeschinstad Před rokem

      @@fat_pigeon: ZFS and Btrfs were designed for this from the get go, though?

  • @michaeldavis9772
    @michaeldavis9772 Před rokem

    Good to know. Just going to store this in my side bits tool belt.. Got it, just in case.

  • @logancrosier3298
    @logancrosier3298 Před rokem

    What did you do to get the settings that changed the appearance of your powershell command line?

  • @demonx7735
    @demonx7735 Před rokem +1

    learned something new today 😍

  • @jsrodman
    @jsrodman Před 2 dny

    File metadata does not include the location. In all modem filesystems, the location is not a property of the file at all, but a consequence of the directory structures that reference it and that referencev this directories and so on. When a file is repositioned (moved) within a filesystem, its metadata might not change at all or even be touched.
    On windows, certain caveats apply because of weird legacy logic around short filenames,depending on version.