these compression algorithms could halve our image file sizes (but we don't use them)

JentGent

zhlédnutí 168 409

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 11. 09. 2024

Komentáře • 506

@JentacularGent Před 25 dny ⁺²⁵³
TO CLARIFY: don't be afraid to use these algorithms! the patents for arithmetic coding are all expired, and the only justifiably patentable material in Microsoft's patent are specific implementations of adaptive-probability rANS (which JPEG XL actually doesn't use). Microsoft has also given anyone permission to use it in open source, royalty-free codecs
at around 7:20, i put "NH(X) = log_2 X" ... H(X) is the entropy of a symbol, not of the code! X on the left is a symbol, and X on the right is the code ... sorry! let me know of any other corrections. and thank you for watching!!
I saw some mixed reactions to the music. I made a poll about this for future reference, but I don't know how to link it here, so please check it out in my channel
@JorgetePanete Před 25 dny ⁺³
Please reupload it fixed.
@gardian06_85 Před 24 dny ⁺³
for the purpose of encoding files even the full ASCII set wouldn't either of these methods fall down because instead of being base 10 we are base 255 converted into base 2. granted the the opcode values would have very low occurrence for a probabilistic approach (unless the files contains assembly) then how do these algorithms deal with character sets like UTF16 or UTF32? wouldn't that imply that even the arithmetic approach would need a given char value to be 32 bits at which point we are in almost non compressed state?
@wirelyre Před 24 dny ⁺³
@@gardian06_85
UTF-8, UTF-16, and UTF-32 are all encodings for the same character set. So you could compress a UTF-32 file by encoding it as UTF-8 and then using one of these methods.
You could also use 64-bit integers, or even greater bases.
But most compression formats don't encode bytes directly. They encode symbols that are instructions to the decoder, like a little machine. "Copy the next three bytes to the output stream." "Look back 40 bytes in the output and copy 25 again."
@mehmetdemir-lf2vm Před 22 dny ⁺³
please remove background music because it distracts. we can always listen to music while watching videos, but we cannot remove them from videos.
@meneldal Před 19 dny
You can also get the code from the h264 or hevc reference encoder (which both should only use expired patents by now) for how they handle the variable probabilities and how they implement it. Basically there's a bunch of bins that store probabilities and they have parameters for a starting probability and how much it should be updated each time.
@leeroyjenkins0 Před 25 dny ⁺¹³¹⁷
0:47 intellectual property law is a completely fair and balanced system with no exploits whatsoever
@KillFrenzy96 Před 25 dny ⁺²⁹⁷
Why the hell can you patent something from the public domain? It should never be enforceable, because anyone can claim they obtained it from the original public domain source itself.
@ryansullivan3085 Před 25 dny ⁺⁸⁶
One of the most frustrating things I've ever seen
@Gelatinocyte2 Před 25 dny ⁺⁵⁴
Spiffing Brit reference detected
@JarheadCrayonEater Před 24 dny ⁺⁶
@@Gelatinocyte2someone gets it
@viliml2763 Před 24 dny ⁺⁷⁷
@@KillFrenzy96 They didn't patent it directly, they patented some straightforward expansions and optimizations that virtually everyone would need to use in practice.
@purplenanite Před 25 dny ⁺⁵⁶⁶
If there were enough algorithms to do this with, I could imagine making a series of "Algorithms you can't use"
@au-lit Před 25 dny ⁺⁴¹
There totally is a lot of them in this situation, though usually there is good alternatives.
@Xankill3r Před 25 dny
There is a whole book of these and even that just barely scratches the surface. Look up Math You Can't Use by Ben Klemens. It's over 2 decades old now so I'm sure we could easily write several new editions of it with newer patent BS like this one.
@beeverfeever4930 Před 25 dny ⁺⁵
If I had to guess, there most likely would be plenty of algorithms you can't use
@arossfelder Před 24 dny ⁺⁵
I would not recommend watching this video, it's very dangerous for a developer because it makes you liable if you end up using one of them
@DwAboutItManFr Před 24 dny
You can use in FOSS projects no?
@timseguine2 Před 25 dny ⁺⁴⁷⁰
Patents suffer from the problem of trivial extension. The typical thing I have noticed is that you can take a well known thing, not change it at all, and use it in a context it hasn't really been used before. Then patent that.
I can't patent Huffman Coding? What about Huffman Coding... in space?
@TreeLuvBurdpu Před 24 dny ⁺³
You can't patent things that are in common usage. You have to be the inventor.
@anonymousalexander6005 Před 24 dny ⁺⁷⁹
@@TreeLuvBurdpunope, as long as you can prove it’s a novel idea, there’s a possibility of getting a patent. It all depends on how good your lawyer is…
@timseguine2 Před 24 dny ⁺⁶²
@@TreeLuvBurdpu tell that to the patent officer that approved some of the patents I have been directly or indirectly involved in.
In theory, completely agree with you. In my experience that is not the case in practice, and the concrete application is more important.
I haven't read the actual patent myself, but I found out one of the teams I worked with successfully got a patent for using CNNs for image classification in a certain medical context. They didn't invent using CNNs for image classification.
@thisisabsolutelystup Před 24 dny ⁺²²
Part of the problem is that the standards of whether something can be patented or not is completely variable. In theory it has to be original and demonstrate an inventive step - something not obvious, something the inventor thought of or did.
So Microsoft taking Huffman encoding and simply applying it to a few common critical structures and patenting that should not pass the inventive step threshold, because it's an obvious application that anyone using Huffman encoding would do.
Patent law, especially in the software space, needs real reform somehow.
@timseguine2 Před 24 dny ⁺⁸
@@thisisabsolutelystup The huffman encoding in space example wasn't real. I was referencing the "recycled in space" trope. Maybe I am too chronically online.
@LuveelVoom Před 24 dny ⁺⁵⁶
jarek duda: check out this really cool compression algorithm I made! it’s free and unpatented and twice as good as Huffman coding.
Microsoft: check out this really cool compression algorithm I totally didn’t steal from jarek duda! it’s proprietary and patented and twice as good as Huffman coding.
patent office: yeah, we don’t have it on file. Here’s your patent, Microsoft
jarek duda:
@jb31842 Před 24 dny ⁺¹²
The "I made this" meme would have encoded the same information but vastly shorter
@LuveelVoom Před 24 dny ⁺²²
@@jb31842 Unfortunately it was patented by Microsoft.
@safebox36 Před 24 dny ⁺¹⁸⁶
It's actually being used in JPEG XL.
The problem is so few programs support it despite its backing from every major web browser, photography company, open source organisation, and cybersecurity group.
Cuts the size of PNGs in half in most cases while still retaining lossless quality.
@keisisqrl Před 24 dny ⁺⁷³
Google pulled JPEG XL from Chromium in favor of their own webp which effectively killed it as a web format, Mozilla and WebKit (Safari) followed suit.
@JohnDoe-jk3vv Před 24 dny ⁺⁴
JPEG XL is not lossless
@darthjaffacake8573 Před 24 dny ⁺⁴⁴
It can be, just depends on settings
@JohnDoe-jk3vv Před 24 dny
@@darthjaffacake8573 I was wrong
@connor_santoyo Před 24 dny
@@keisisqrl Google killed JPEG XL in favor of AVIF, not WebP. WebP is no longer being developed. Mozilla did not kill off JXL either, it is still available as a nightly flag. And Apple has full support for JXL across all their OSs including WebKit.
@Manabender Před 24 dny ⁺⁵⁸
6:13 Took me a second to figure out how clever this was.
Assuming you're playing a game of hangman in which the word to be guessed is an actual, English word, it could be plane, plant, plank, or plans. In that case, saying that the last letter is a vowel is worth two bits of information (as it divides the space of valid possibilities by 2^ *2* ). Saying that the last letter is a letter is worth zero bits, given assumptions.
@wikiPika Před 25 dny ⁺¹⁷⁵
1:25 "hello, how are you? fine, thank you"
@gamerdomain6618 Před 25 dny ⁺³⁸
oh my gaah!
@puppergump4117 Před 25 dny ⁺²
I know that from a meme someone sent and now wonder how relevant it is to the world
@froxcey Před 24 dny ⁺¹⁹
Hello everynyan
@4.0.4 Před 24 dny ⁺¹³
@@puppergump4117the anime was surprisingly influential in that it also coined the word "waifu" from one teacher saying "mai waifu".
@xhbirohx2214 Před 24 dny ⁺¹
im out of the loop, can someone fill me in?
@jeanf6295 Před 23 dny ⁺³³
This a neat video, and an interesting subject, however, it was a bit too fast paced for me. A few could be done differently :
- Sometimes, you list goals, but do not put those in written format, which makes things harder to follow.
- Having a synthetic representation of the algorithm as a whole could also help a lot, seeing the states of the variables evolve in real time is neat, but insufficient.
- Getting the meaning of abstract notations takes some time, using a more explicit naming convention could help.
@Anon-do7lo Před 22 dny ⁺¹⁹
My data compression professor last semester made really good points about how Huffman still has a place. It’s just a substitution thing and you can simply read the encoded text if you have the scheme. It’s a lot more performant when it comes to decoding.
@jursamaj Před 19 dny ⁺¹
Yes, the best compression schemes have multiple algorithms, and choose the appropriate one. Nobody is saying to just throw Huffman away.
@anon_y_mousse Před 24 dny ⁺²⁵
I'm of the opinion that software shouldn't be patentable, but also that copyright should expire after no more than 28 years, although 14 would be preferable. Years ago, back when I was in middle school, I remember playing with a compression program written in QBasic which used arithmetic coding. I thought it was pretty cool, but at the time didn't know how to use it. That was significantly longer than 17 years ago now, and even if the idea were capable of being patented, it definitely should no longer be as 17 years is the limit for a patent. Of course, I also think that our patent system needs to be fixed in more ways than one, including that people with technical knowledge should be the ones to review patents in technical fields and the law, which gives us the criteria that they should be novel, should actually be applied and prevent patent squatting as MS, and many other corporations acting in bad faith, have done.
@Xankill3r Před 25 dny ⁺²⁴²
We need another edition of Math You Can't Use by Ben Clemens just to cover this BS. Well, I mean we probably need a reconsideration of the patent regime first. Software - being just maths - should *not* be granted patents at all. If the whole idea behind patents was to foster innovation then "we've been sending completely unnecessary amounts of data over the internet because patent trolls" is definitely evidence that it doesn't work that way.
@mytech6779 Před 24 dny ⁺⁵
Most of the time software cannot be simply patented, however source code is covered by copywrite. They are not the same thing. There are a few exceptions but you need to dig into the specific cases to have any meaningful discussion of that area.
@Xankill3r Před 24 dny ⁺²⁰
@@mytech6779 copyright doesn't matter here since we are talking about patents. Copyright also only applies to the specific implementation - or expression - of an idea. Therefore although there are problems with copyright as well it doesn't at least give someone a complete right over an idea.
@wb3904 Před 24 dny ⁺¹
Maths and algorithms are fundamentally different. Computers have more in common with logic than maths.
@Turalcar Před 24 dny ⁺¹³
@@wb3904 logic is maths
@wb3904 Před 24 dny ⁺²
@@Turalcar no maths is based on logic not the other way around
@denislamarche224 Před 24 dny ⁺²⁹
I can't tell you how far above my head this video is....but it approaches infinity.
@aceman0000099 Před 23 dny ⁺⁴
You could probably understand it if it was something physical like sorting books and you did it as a job for about an hour straight
@Chalkadoo Před 24 dny ⁺⁸⁵
So what I got from this video is that we need patent law reforms and also Microsoft needs to get its shit kicked in REALLY hard?
@DiThi Před 24 dny ⁺³⁰
Better: software patent abolition. We don't have them in Europe and we don't want them.
@i-am-linja Před 24 dny ⁺¹
@@DiThi Nice, but is there a way to _prevent_ an American company from patenting your idea?
@i-am-linja Před 24 dny ⁺¹
@@DiThi Or failing that, to prevent said patent being enforced in your jurisdiction?
@DiThi Před 24 dny ⁺⁶
@@i-am-linja I thought prior art would be enough, and maybe it's still enough and the patent for ANS wouldn't really hold in court, who knows. Patents are not enforced in my jurisdiction, they can't. However we live in a global world and using US patented algorithms would prevent me from publishing my software in the US.
@kristianTV1974 Před 25 dny ⁺⁴⁴¹
Typical fucking Microsoft, trying to patent something in the public domain.
God how I hate that company.
@crix_h3eadshotgg992 Před 24 dny
@@koghslook up the Microsoft hate website. All tech corporations are dog shit evil.
@serkandevel7828 Před 24 dny ⁺⁴⁹
@@koghs Microsoft is notorious for being patent-trolls towards FOSS
@PefectPiePlace2 Před 24 dny ⁺⁷³
the problem is copyright/trademark/IP law, Microsoft is a symptom
you should blame the root cause if you want the problem to be fixed
@4.0.4 Před 24 dny ⁺¹³
It's usually said of journalists but, "you don't hate Microsoft enough" 😂
@AlexGeek Před 24 dny ⁺⁵
They used to do that a lot before. But now they try to compete more with online services.
@Flourish38 Před 24 dny ⁺⁴⁹
Saw some comments complaining about the music, and just wanted to say that I disagree completely. Having some music is nice, and the choices you made are very low energy and out of the way anyways.
@yuriiknapik7241 Před 24 dny ⁺⁴
Yeah i can see your point.Maybe he could sidechain the music so when he speaks the music dives in volume more.
@el_quba Před 25 dny ⁺⁸⁵
This is an excellent explanation no doubt, but I must be honest I find it very hard to follow and understand
@kitamashi Před 24 dny ⁺³⁰
It is only a good explanation if you are familiar with the jargon, in other words, it is a bad explanation.
@jmsether Před 23 dny ⁺¹²
It's basically an academic paper in video form. I found the information to be well laid out and easy to follow.
@atiedebee1020 Před 23 dny ⁺⁸
If its hard to follow and understand, its not a good explanation
@arduano Před 24 dny ⁺⁸
Thank you so much for sharing this! I was implementing LZMA earlier this year (for fun, and to learn how it works), and when I got to LZMA's range encoding component I realized how genius this approach is. It took me days to understand, but your video would've helped so much if I was around haha
@dumdum7099 Před 25 dny ⁺⁶²
This is way above my paygrade. I need more readings before I watch this video.
@05degrees Před 24 dny ⁺⁹
Haven’t watched yet but there’s a neat thing: arithmetic coding, or a closely related thing, allows one to convert a sequence of fair N-sided coin outcomes to any other discrete uniform distribution without rejection, though if you don’t want to reject you’ll need unbounded memory, and if you want to flush time to time, you’re effectively rejecting some information you’re getting from the initial sequence. But at least you can fine-tune how much you want to reject/save and change it over time!
Maybe it’s already in the video but if it’s not, here. Also it’s as if there’s some duality about entropy generated by rejection, non-uniformness introduced if we decide to reject only a given number of times and then be complacent, working memory etc.. I don’t know in what kind of inequality this can be written but I hope information/coding theorists have it already discovered and generalized similarly to what happened to the uncertainty principle (in QM and Fourier analysis-now there exists a general operator algebra approach IIRC).
@DumToasty Před 24 dny ⁺⁸
1:23 Azumanga Daioh reference 「I wish I were a bird.」
@dl5244 Před 18 dny ⁺⁵
I remember seeing a demo for fractal encoding instead of JPG in the early 1990's in the days where 14.4kbps modems were still very expensive so I only had 2400 baud dialup. The demo was a real-estate website with pictures of a houses. Interlaced jpg images took a few minutes to download and view, where as the new fractal format was
@leeroyjenkins0 Před 25 dny ⁺³⁷
Slight gripe, at 9:00 it was a bit confusing that you described the operations as "outputting a bit representing which half we're in" and "centering and renormalizing" instead of just saying we want to find a number within the range, that halving is just how decimal binary works, and briefly pointing out why the "centering and adding 0s" works.
Otherwise interesting video, and cool subject, thanks!
@MAD-SKILLZ Před 25 dny ⁺⁶
Yeah, he lost me at that part, too.
@k1ll3trs86 Před 25 dny ⁺²
I can see the intention of the video
But what he doesn't realize is that thinking in whole probability is not functional, that's not how machine works and it's ultimately extremely inefficiency.
It's a classic "looks good on paper" but in practice you are trying to force something that doesn't fit
@Kynatosh Před 24 dny ⁺¹
Took me some time wonderi'g what he was getting at but it works I guess
I understood at the end of his explanation that it was just writing the number in base 2
@anonymousanon4822 Před 24 dny ⁺¹
@@k1ll3trs86I don't understand what you're trying to say. This is not only good on paper but good in practice too. It is literally strictly better than huffman and huffman is used everywhere.
@ZeroPlayerGame Před 24 dny ⁺¹
@@anonymousanon4822 I think what's meant is that keeping the whole real number in memory and then writing it out in binary is a bad idea (because doing math on long real numbers is a bad idea).
@sniper9143 Před 24 dny ⁺⁶
I recently used an adaptive Arithmetic encoder and a LSTM for a custom compression algo, the arithmetic encoder was cool to work with
@Psychx_ Před 24 dny ⁺⁹
There is a FUSE file system called PiFS. It can store data using some sort of arithmetic coding where the resulting number itself gets compressed as an offset + length into the unlimited digits of Pi.
@vurpo7080 Před 24 dny ⁺⁴
Relying on the assumption that pi is a normal number... which is probably true but not proven!
@BridgeBum Před 24 dny
@@vurpo7080That would be one hell of a bug to run into some day.
@ThylineTheGay Před 24 dny
@@vurpo7080 even if it's not, you can just split the data you want to add, might even act as a kind of data de-duplication
@ThylineTheGay Před 24 dny
i would genuinely like to see someone make a semi-serious version of that
@rizqiv2 Před 19 dny ⁺⁸
i dont understand anything :(
@colonthree Před 23 dny ⁺⁸
1:14 Hellooooo everynyan! :3
@randomlegodev Před 25 dny ⁺²⁰
love me a good encoding algorithm breakdown 🔥
animations lookin snazzier
@4.0.4 Před 24 dny ⁺¹³
I'll be honest it was hard to follow, maybe I'm just dumb 😅
The music was fine imo, but if people complain you can "duck" it (the volume lowers on speech).
@jeffreychandler8418 Před 24 dny ⁺⁵
no you're not dumb. This video seemed to be made for an audience that already knows compression algorithms and information theory at a competent level.
@ArnaldurBjarnason Před 21 dnem ⁺²
@@jeffreychandler8418 Not even, I'm pretty familiar with many of the concepts but he's just running through the ideas with no space to breathe or re-emphasis on the critical aspects.
@jeffreychandler8418 Před 20 dny ⁺¹
@@ArnaldurBjarnason That's validating to hear xD I usually ascribe misunderstandings as a personal issue than a communicator issue (unless it's REALLY bad like the Zion NPS instagram or something like that).
As I think about this video again, honestly a two minute succint summary of existing algos would be great, then trim the fat on the new algos to make those simpler, and allow more breathing space.
@LovelyBozo Před 24 dny ⁺⁹
stuff like this is why I'm glad I took AP stats so I can come back and understand what I'm listening to later on this year
@tandemdwarf745 Před 24 dny ⁺³
I got a 5 on AP stats and I'm pretty lost lol. This video is less stats heavy, at least the stuff you learn in AP, and more computer science jargon heavy. It also just moves uncomfortably fast for the stuff I have some handle on.
@jazzerbyte Před 25 dny ⁺⁶
I wonder why Apple rolled out HEIC instead of just using arithmetic compression, considering it would already be incompatible with standard JPG?
@vylbird8014 Před 24 dny ⁺⁹
There is a technical and a business reason.
The technical is that HEIC - like the rival AVIF - goes far beyond just arithmetic compression. It's really an adaptation of techniques developed for video compression, and will wipe the floor with JPEG, even arithmetic-compressed JPEG.
The business reason is that HEIC, being based on the h265 video codec, is a heavily patented format - and one of the contributors to that patent pool happens to be Apple. Not only do they get to use it at a reduced licencing cost, but whenever another company wants to use HEIC, Apple gets a cut of the royalties. It's clear that JPEG is going to decline, and there are a number of formats competing to replace it - it's in Apple's best interests to make sure that the format they stand to profit from is the one to achieve dominance, and making HEIC the default format for taking photos on iPhones and iPads gives it a huge advantage - every photo-sharing service needs to at least support uploading HEIC, or else Apple users will find their photos won't share.
There's a rival format, AVIF, which is supposed to be patent-free (you can never be sure) and pushed by a rival consortium, with Google as the major backer. They aren't trying to score any patent money, but that doesn't mean it's entirely altruistic: Their aim is to promote an open and unpatented format in order to avoid having to pay up royalties to the likes of Apple.
@liam3284 Před 8 dny ⁺¹
because they have a financial interest in it
@JTCF Před 24 dny ⁺⁸
I am usually not in the "wow this is so cool but I didn't understand anything" group, but I think I might be with this vid. The video is paced pretty fast, if I wanted I would probably rewatch it. Currently I just feel like falling into a rabbit hole (which happens to me quite a bit) but not keeping up with the falling.
@picksalot1 Před 21 dnem ⁺²
The music is pretty, but distracting. Imagine trying to look at and analyze a Painting while large, pretty birds fly between it and your eyes.
If you're you feel it's necessary to have music, I suggest you lower its volume substantially.
@jb2170-mathmo Před 24 dny ⁺²
This video is like the perfect counterpart to my #SoME entry :D
Huffman Trees are the one part I left out of my vid, this is awesome
@dadamaldasstuff1816 Před 25 dny ⁺⁴⁵
You can use the JUSDTB compression (JUSDTB stands for Just Use Smaller Data Types Bro)
@dadamaldasstuff1816 Před 25 dny ⁺¹³
This doesn't actually exist, it's a joke.
@wall-e5869 Před 25 dny ⁺⁶
this is buhdj
bro
ur
hillarious
dad
jokes
@dadamaldasstuff1816 Před 24 dny ⁺¹⁴
I'm pointing at YOU, database users! You should know the data types and use them accordingly.
You don't need a string to store one of 3 states, use an enum.
You don't need an int for a number between 0 and 100, use a byte. It's 4x smaller.
You should always use unsigned if you don't need negatives. It doubles the positive range.
Also use decimal, it can store decimal places more precisely than float. If you don't have a horrendously huge range, just use it.
@ThylineTheGay Před 24 dny ⁺¹
I prefer πFS personally
@Kynatosh Před 24 dny
@@dadamaldasstuff1816wdym? A string can be two bytes of it holds just one char (and the null char appended). An enum is in most languages an integer, which can be 32 or 64 bytes and use up more space. I'm sure the db stores it at one byte though
I still agree that you should always use the right type though. Even helps you catch bugs early when you get screamed at for having the wrong type
@WiseWeeabo Před 25 dny ⁺¹⁴
literally anything can be interpreted as "a single number"..
@ZeroPlayerGame Před 24 dny
that's pretty much the point of the video yea
@quintopia Před 24 dny ⁺³
When i implemented arithmetic coding a few years back, i had no idea of the fraught patent history. Nice bit of info.
@lnee Před 21 hodinou
I realize I came up with the arithmetic numbering system algorithm when I was 14 while creating a chess compressor. Literally that algorithm.
@fedang Před 21 dnem ⁺¹
Imagine leaving your hard work public for the sake of humanity and greedy corporation takes credit for it and ends up forgetting it in their patent drawer
@TreeLuvBurdpu Před 24 dny ⁺⁵
I'm stuck on the variable length encoding in your explanation. "Shorter codes encode more frequent symbols". How do you indicate the length of bits that should be decoded into a single symbol? Presumably, short codes would exist inside longer encodings, for instance "101” is inside " 1011”, and "10” is inside ”101”.
@inanefool8781 Před 24 dny ⁺¹⁵
In the Huffman algorithm, the codes used are designed as such that no shorter code is inside a longer code.
So in the example you've provided, 1011 would mean you have the symbol "101" and then you have another symbol starting with "1". The way huffman is usually taught in school, this involves following a tree, where 1 is right and 0 is left, and the decoder traverses that tree with each bit you read until you eventually hit a symbol, output that symbol, and then go back to the top of the tree.
with that explanation, the symbol encoded for 101 would mean you went right, left, right in the tree, hit a symbol, and went back to the top.
Hopefully this explanation is easy to understand, it would be better with diagrams but those are hard to make in youtube comments.
@tandemdwarf745 Před 24 dny ⁺²
@@inanefool8781 You can pause the video at 1:48 to see this effect; all the shorter codes are not contained within the longer ones.
@TreeLuvBurdpu Před 24 dny ⁺³
@@inanefool8781 that is actually a very good explanation that is ready to understand. Thanks. Hopefully i can remember it this time.
@antoine2571 Před 23 dny ⁺²
By the way a coding algorithm where this property holds (no word is prefix of another one) is called a prefix code.
Iirc from reading CLRS, a non-prefix code can always be transformed into a prefix-code, so usually textbooks focus on prefix-code.
Though I have very few knowledge about codes so I might be wrong.
@Cl0udWolf Před 23 dny ⁺¹
Great manim animations. It’s a shame ANS isn’t as widely used. There are similar patent issues with lossy compression algorithms as well, but those are even more complex since there are two vectors of optimization (compression and reconstruction accuracy) where the second does not have a concrete way to measure it.
@Aera223 Před 22 dny
There are some synthetic benchmarks, like 7zip used to show programs compressed with zip vs 7zip. A standardised benchmark may be useful, though compressors could overfit for that benchmark and look better than they really are
@Cl0udWolf Před 22 dny
@@Aera223 well zip and 7zip are still lossless, but as for lossy a benchmark doesn’t exist for that reason. Some of the reconstruction quality will be subjective. PSNR vs Compression Rate is the most common metric but PSNR can be high while having a very noisy output, or a singular spike of error, as long as it averages out. Some situations lossy compression will have rules for the application such as not deviating more than X amount, or always producing smoothed results
@Shirani007 Před 2 dny
What a wonderful presentation and justification of ANS you have posted. I loved watching it, although speech is a bit faster, I used .75 speed 😊. I would reccomend you create more videos explaining other compression algorithms too, which in fact have arithmetic roots. Highly recommend and highly appreciated dear !
Onw video I would want can be on different methods of approximation, and second on almost all, mostly forgotten bitcodes generation methods in 1 go. Also error correction, hamming codes, reed Soliman codes too , etc.
Looking forward to more content ❤
@joshuabrown4952 Před 24 dny ⁺³
I hope some google/amazon data centre engineers are taking notes, if they can hear the video over the water pumps out there in the Nevada desert.
@mind_11 Před 24 dny ⁺¹
I appreciate the many funny example words you used. Very epic video! Awesome visuals and interesting explanation!
@simonwillover4175 Před 24 dny ⁺¹
It was a pretty serious video. None of the examples are funny.
@Martin-km4gb Před 23 dny
ayaya
@koka3243 Před 19 dny
Entropy coding explained in 5 mins. Must be a new record 🎉
@pistachos4868 Před 24 dny ⁺⁷
1:12 azumanga daioh reference😺
@idehenebenezer Před 24 dny ⁺²
Revelation 3:20
Behold, I stand at the door, and knock: if any man hear my voice, and open the door, I will come in to him, and will sup with him, and he with me.
HEY THERE 🤗 JESUS IS CALLING YOU TODAY. Turn away from your sins, confess, forsake them and live the victorious life. God bless.
Revelation 22:12-14
And, behold, I come quickly; and my reward is with me, to give every man according as his work shall be.
I am Alpha and Omega, the beginning and the end, the first and the last.
Blessed are they that do his commandments, that they may have right to the tree of life, and may enter in through the gates into the city.
@MadocComadrin Před 24 dny ⁺¹
Oh my Gah!
@MadocComadrin Před 24 dny
For the folks interested in functional programming or calculating correct programs, Richard Bird's "Pearls of Functional Algorithm Design" has two chapters on arithmetic coding, one on the rational number variant and one on the integer variant.
@thedeemon Před 18 dny
I actually used both algorithms in my lossless video codec - ScreenPressor. Initially it was range coding (arithmetic coding with integers) but it involved division which isn't very fast. Then later version used rANS which is faster but requires compressing data in reverse order. The end result was pretty great: best in class compression and good speed (when the content is not too complicated, as in typical screen data).
@AhrkFinTey Před 22 dny ⁺¹
Nice to see someone who appreciates the second Arabesque!
@BGBTech Před 11 dny
A downside with a lot of these is that they are slow: Arithmetic coding = Very slow, Bitwise Range Coding = Rather slow (but faster than traditional arithmetic coding), LZMA / 7zip uses this. Huffman can be made semi-fast (and a little faster if one limits symbols to 12 or 13 bits vs 15 or 16; with a lower limit the decoder's lookup table can fit into the L1 cache).
Rice coding can also be made fast (more so when a similar length-limited is applied), but only works well with particular probability distributions; but can be turned into an adaptive scheme with relatively little added cost. One trick I have used is to combine Adaptive Rice with a "swap towards front" table (each symbol is encoded as an index into a table, and when encoded, the symbol at index J is swapped with the symbol at index (J*15)/16, which is used the next time this symbol is encoded). This scheme is adaptive, can be almost as fast as Huffman, and almost as good (in terms of compression) while needing less memory footprint (sometimes useful) and less code (also sometimes useful), and (unlike Huffman) the decoder lookup tables can be made read-only. But, a lot depends on what one needs...
@fuzzy-02 Před 13 dny
I understood about 25% of the video on my first try!
@marcfruchtman9473 Před 20 dny
It is important to understand that the Huffman Encoding requires that it build a "dictionary" (Huffman Tree) with the encoding and that must be included in the file so it increases the size of the compressed file. Additionally, the "encoding" used by Huffman is "Prefix-Free", meaning that no symbolic encoding can be a prefix for any other encoding used in the compression dictionary. It might have been one of the reasons why people were having trouble following. Maybe add the tree and prefix free quality in your next revision.
@steveunderwood3683 Před 19 dny
Arithmetic coding is widely used. It was the only option for some standards, even before the patents expired. So, it got used, patents or not. I've had to implement it for things like JBIG coding for FAX, where its specifically there to improve on the huffman coding in the previous compression methods used for FAX.
@scj643 Před 10 dny
AMR-WB+ is the same place for audio encoding/decoding. It's a very specific codec used in talking books that had its patent expire in 2020
@uis246 Před 24 dny ⁺⁵
You made Jarek happy. He looks pleased in JXL discord server.
@__MINT_ Před 23 dny
Quite interesting subject, I didn't treat probability as something related to my interest field, until now. A year ago I started writing my own lossy audio codec, which is done for now, but I didn't implement any kind of entropy coding. I tried Huffman, RLE, but only thing I got was bigger file size and increased complexity. The problem with these algorithms is that there must be a defined set of symbols and a strong pattern in encoded message, otherwise it won't work. Using raw binary sequences as symbols doesn't really work, since there is quite a lot of them and probabilities for each other sorted in decreasing order are far away from shape of 1/x function. But the only pattern I saw were groups of 0's with less 1's in between. I still can't understand how MP3 pre-processes values so that they are suitable for entropy coding. By saying values I mean quantized frequency coefficients after transform from time to frequency domain, MDCT in my case. After psychoacoustic analyzis I know which coefficients can be discarded and what precision to use for the rest of them. Coefficients are divided into subbands with fixed size, let's say 16, and then they are quantized with different precisions. Single frame can have subbands quantized with both 2 bits and 16 bits, although now I know that 16 bits was way too much even for almost lossless compression. Each subband has its scalefactor field and a precison field, saying how many bits are used per coefficient and what scalefactor was used to divide values to quantize them. Quantized values can be basically anything, I didn't see any strong pattern there. From what I heard, MP3 uses more or less fixed precisions for coefficients, and maybe that helps to entropy code them. But in my codec everything is dynamic and bitrate can vary from for example 20 kbps to 400 kbps for the same file and same quality setting, 20 kbps would be used to encode a single tone, while 400 kbps would be needed for a drum hit. Even quantized values are not fixed-length, for example with 3-bit precision, 0 would be just 0 in binary, 1 would be 100, 2 is 101, -1 is 110, -2 is 111. Scalefactors and precison fields also have variable lengths and are differential-coded. There are also magic values for those fields which can specify number of empty subbands, so few tens of empty coefficients can be stored on a few bits. And these aren't all clever mechanisms that I used instead of entropy coding. But I'd like to understand how to implement this coding for audio / how to prepare values for that coding. With that and some other changes I could create even better codec, which already outperforms MP3 a bit. It's called DAC (Dynamic Audio Codec), and I have a video about it. Unfortunately in polish, but today it shouldn't be a problem. I also shared the working codec (browser it all you need to run it), so take a look if you want. Anyways, thanks for the video and I'm waiting for the next one!
@__MINT_ Před 23 dny
Oh, and there is a link to a video about DAC. YT sometimes removes such comments, so I'll share just the video ID: Qy_AinciI9Y
@blacky7801 Před 24 dny ⁺²
Just do a binary search on the list of all words? brilliant!
@Kynatosh Před 24 dny ⁺²
Before 11:11, I was like great, this is cool but doesn't seem that helpful
After that simple change, everything makes so much sense
@galewallblanco8184 Před 19 dny ⁺¹
"this is one of the way your jpegs [...] compress data without losing information"
jpeg: lossless compression
jpeg does lose information, but it was optimized to not lose too much *meaningful* perceptible information (that is until you zoom in)
@beardymcbeardface69 Před 10 dny
JPEG uses both lossy and lossless compression. The lossy compression first, then lossless compression of that as a last step. Obviously with the final result being lossy.
The lossless last step is a variant of Huffman.
So to say that JPEG compresses data without losing information is a half truth. Badly worded on his part, I'd say, without clarifying that it uses both and the final outcome is indeed lossy.
@2false637 Před 24 dny ⁺¹
I still don’t quite understand why in the 10-symbol alphabet Huffman encoding does worse than the binary conversion (which outperforms the entropy?). Is it because, by treating the sequence as an entire number, we extract some hidden correlation/information? This is really fascinating.
@JentacularGent Před 24 dny ⁺³
entropy is a lower bound for _average_ code length, but there can (and should) still be messages with shorter codes. In particular, this message lent itself well for the binary conversion algorithm because its first digit was a 1 (i.e. it was small). Huffman performs worse in general because the entropy of a symbol is log2(10) ≈ 3.22 bits per symbol, which isn't a whole number
@Neptoid Před 20 dny ⁺¹
What about AI? It predicts the probability of the next tokens, why not the Hofphman thingy given those priors
@yuantan9292 Před 15 dny
The probability prediction and the encoding method are two separate steps, and the video just focused on the latter and not the former.
In most of the video, the probability is fixed (or at least known) and independent, so there's no need to predict anything.
In benchmarks using real world data, AI is indeed used for the probability prediction; for example, the top 23 (highest compression ratio, lowest file size) compressors of the enwik9 dataset all used some sort of AI and the current best (nncp v3.2) used a Transformer model.
In real life applications though, AI is too slow compared to simple, non-AI predictors, so the smaller file size is hardly worth the extra time and power cost. While nncp v3.2 can compress enwiki9 to 49% of that of zstd (zstd 0.6.0 level 22 ultra), it takes 9.6x as much memory and 683x as much time to compress and then decompress the dataset.
@Mega-wt9do Před 25 dny ⁺⁶
This channel is a hidden gem :O
@flameofthephoenix8395 Před 24 dny
2:40 Certainly! Small amounts of extra precision come about if you store each digit separately, this extra precision can be used in other places to be more efficient which is just what you're doing here, though, it doesn't work if each digit is capable of being a power of two different characters because there is no extra precision left over meaning that it will give the same result.
@RichardBronosky Před 12 dny
11:56 took me [I stopped counting at 6] times listening before I could hear "extra digit".
@isidor37 Před 17 dny ⁺¹
This is an interesting topic and discussion, but the presentation and explanations were really hard to follow. The music was *mostly* unobtrusive, but there were a couple points where it really came into the foreground due to the tonal content and became distracting
@Verrisin Před 21 dnem ⁺⁴
ah yes, patents - the bane of all good. Can we use cancel culture for good for once, and CANCEL all patent laws around the world? Thank you.
@Jonas_Meyer Před 15 dny ⁺¹
Maybe its just me but I find the music a bit distracting when following the video.
@SupaKoopaTroopa64 Před 20 dny
AV1, the codec I'm watching this video in right now, uses non-binary arithmetic coding. I think the patents only apply to binary arithmetic encoding.
@teltubes1 Před 24 dny
Great animations. I understood nothing about the proof but the examples are nice. Is there a specific software you are using for this presentation?
@Jedermeister Před 24 dny ⁺³
I am a maths person. I like and understand maths and have a masters degree but all of this went straight over my head. I feel genuinely lost.
@NorthWay_no Před 20 dny
Some day I will grok how you output a Huffman file with the weights included, or possibly how they are inferred from an adaptive version - and not taking up masses of space, relatively, for small files.
@sapiosuicide1552 Před 24 dny
This is the plot of HBO's Silicon Valley. The main protagonist Richard Hendricks creates a more efficient method of storing data and they talk about Huffman codes. Everything in this video went over my head but you should watch that show.
@juancarlospizarromendez3954 Před 24 dny
I suggest it for the research, to use nibbles 0..9A-F instead of decimal digits or binary digits (bytes as digits are too symbols).
@flameofthephoenix8395 Před 24 dny
0:28 While of course it is stored as a number, any piece of data on a computer is a series of 0 or 1 bits, there are not themselves numbers, but any series of them can also be represented by a number.
@liliangimenez4461 Před 24 dny
Thank you this helps me to understand arithmetic encoding and ANS.
Both (+b Huffmann) assume the symbols are independant. Could Markov chains improve compression rates, or techniques based on repeated occurrences makes Markov chains irrelevant ?
@arunkennedy9267 Před 24 dny ⁺³
3 blue 1 brown background music
@prdoyle Před 24 dny ⁺⁴
Patents tend to get granted to companies like Microsoft or IBM because they have more lawyers than the patent office. They grant them "by default" but I imagine it should still be possible to challenge the patent on the basis of prior art here, if someone wants to put in the effort.
@uis246 Před 24 dny ⁺⁶
Chalanging rANS patent requires 100k£(it was granted in UK)
@vurpo7080 Před 24 dny ⁺³
The problem is that Microsoft didn't patent ANS directly, they just took it and added some extremely general context to it, and patented it (imagine "you can't claim prior art, we didn't claim to invent ANS, we only invented the idea of using ANS to compress files on a computer!")
@uis246 Před 24 dny ⁺¹
@@vurpo7080 Embrace, Extend, Extinguish. Except without Extend part.
@CSMHD Před 19 dny
thank you great insight - good explanation - there were some wow moments - thanks
@Iamaduck888 Před 23 dny ⁺³
I really like this new CZcams algorithm
@brockobama257 Před 24 dny ⁺⁴
“Bits… or shannons” was funny
@uis246 Před 24 dny ⁺²
If you bit four times, you will nibble. Nibbling twice is called byting.
@idehenebenezer Před 24 dny
@@uis246
Revelation 3:20
Behold, I stand at the door, and knock: if any man hear my voice, and open the door, I will come in to him, and will sup with him, and he with me.
HEY THERE 🤗 JESUS IS CALLING YOU TODAY. Turn away from your sins, confess, forsake them and live the victorious life. God bless.
Revelation 22:12-14
And, behold, I come quickly; and my reward is with me, to give every man according as his work shall be.
I am Alpha and Omega, the beginning and the end, the first and the last.
Blessed are they that do his commandments, that they may have right to the tree of life, and may enter in through the gates into the city.
@albatroshd7945 Před 24 dny ⁺²
Funny words magic man
I'll come back when my personal memory is stored properly
@lainwired3946 Před 24 dny ⁺¹
Not finished the video, but are they as efficient at decoding? I realise theres a lot of overheads on modern devices, but decoding and rendering images happens so often in a normal user session, it does make some sense to trade minimal disk space in the large picture for faster decoding. Just an idea on another reason they may not have moved on either?
@arxalier2956 Před 24 dny ⁺²
Even if its slower to decode, just like zip files take a while to open, it would be worth doing it. Transferring compressed data through a network is worth the cost of computation and time it takes to encode/decode at the sender or receivers end, because transferring uncompressed data takes even more time and money than that
@lainwired3946 Před 24 dny
@@arxalier2956 not all images get transferred through networks though and webmasters have to make tradeoffs between optimization on their end and load speed on the users end. You lose something like 50% of your visits if the page still is loading after 3-5 seconds
@uis246 Před 24 dny ⁺¹
ANS is super fast at decoding. Especially tANS.
@lainwired3946 Před 24 dny
@@uis246 faster than Huffman? If so, interesting. Was just an idea that came to mind.
@uis246 Před 24 dny ⁺²
@@lainwired3946 oh. I thought you was asking ANS relative to arithmetic coding. Derp. Arithmetic coding is slower than Huffman and ANS I think is faster than Huffman.
@apricotcomputers3943 Před 21 dnem
blipblipblirip, you're jobs taken😂
@mullanalle4318 Před 13 hodinami
Really well done video. Thank you!
@romanhimmes171 Před 24 dny ⁺¹
I thought that x.264 uses arithmetic compression.
@puppergump4117 Před 25 dny
I think you'd make a great youtuber, depending on how long it took you to make this. If you keep the style and make some in-depth videos on things where online resources suck donkey like shaders or common library implementations you'd probably hit 100k before the year ends.
@user-ek7ow8sf4y Před 24 dny ⁺¹
That was quite interesting!
Also does anyone knows the song at 10:35?
@martinb.770 Před 24 dny ⁺⁹
Another title ignoring reality: Why talk of "halving image file sizes"? Most image formats are lossy and Huffman/Arithmetic provide lossless compression. Yes, they are/can be used at some stage of multimedia compression, but the major part to reduce the needed data is done by lossy means of representing and quantising the data.
@Akreson Před 24 dny ⁺³
yes title been made for get attention but you also not quite right
you use entropy coding as a back-end for you model, if your model output symbol with prob close to power of two or there so many of them that deviation from non power of two i terms of code len will be very small that you will not benefit from entropy coding that can handle fractions of bits, so it's more about modeling your data
@_soundwave_ Před 24 dny
What if someone discovers these patented algorithms? Can they still be sued ? Are they legally allowed to use it?
@The_Pariah Před 10 dny
Middle out?
Do we have a new contender for TechCrunch Disrupt??
@Rudxain Před 20 dny
I wonder if P-adic numbers could help improve these algorithms
@kebman Před dnem
Yes, I've implemented Huffman coding for an exam. Yay.... But it is interesting tho :D
@flameofthephoenix8395 Před 24 dny
13:13 I'm not a big fan of the probability part of this but I suppose it is likely a good thing overall seeing as it is beneficial to your average image or text. Unless the entire thousand letter long text document is just repeated "As" you'll have no problem.
@Subbestionix Před 21 dnem
Holy.... This is dense *pun intended*
@abd_cheese7353 Před 25 dny ⁺⁵
This channel is boutta blow up
@eugene1317 Před 24 dny ⁺⁴
I wish more people described coding more like math. It is so much easier to understand.
@notu483 Před 25 dny ⁺¹
3:10 Why is there a factor of 1/N if it approaches infinity? Won’t that just lead to 0
@jacobschweiger5897 Před 25 dny ⁺²
1/n approaches 0 as n goes to infinity but the sum of the series of 1/n diverges as n goes to infinity.
@dimihi1173 Před 25 dny ⁺¹
1/N represents the dividing by N involved when calculating the average(mean)
@asdfqwerty14587 Před 24 dny
I don't really know what all the terms/notation are in that equation so I can't really analyze it properly, but probably the sum is an infinite value, so it becomes a 0*infinity indeterminate form.
@vedantneema Před 24 dny
The summation scales up linearly as N increases so it'll just diverge to infinity. If your message is twice as long, on average you'll need twice as many bits to encode it. So, the 1/N is there to give us a more useful number to measure coding efficiency.
@user-fj9hf4bu9f Před 24 dny ⁺²
probably a good idea to either get rid of the background music entirely or to signficantly reduce its volume - as when listening to this at 1.25x or greater it becomes very distracting.
@i-am-linja Před 24 dny
If I had skill, I'd release an application infringing as many obviously frivolous patents as possible, then start a GoFundMe for the legal fees. Set a damn precedent.
@98danielray Před 23 dny
how brave
@Rudxain Před 20 dny
"Arithmetic Coding" sounds like the joke "Pi compression" algorithm:
It assumes Pi is Normal (every digit has the same probability) and Pseudo-Random, in at least one radix, such as base 2. It consists of finding the position (index) (within the expanded Pi) where there's an exact match of the input data, and including its input size (as the decoder won't know how long the sub-string is).
So if ASCII "Hello world" is found at index 0xBF2482801, then that message can be compressed as a ~33bit pointer + 7bit sub-str-size (ignoring the delimiter overhead) instead of the original 88bits!
To decompress, simply compute *all bits of pi* up to the index `0xBF2482801 + 88`, then slice from the pointer.
This algorithm is horribly slow, and may *not even halt within the lifetime of the universe.* But hey! at least you'll get a "guaranteed" high-information- density 2-tuple that represents your precious data!

Další v pořadí

Automatické přehrávání

A 90-year-old Unsolved Telephone Question