going fast is about doing less

leddoo

zhlédnutí 169 912

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 16. 05. 2024
optimization isn't always about multi-threading and optimizing hardware utilization. in fact, most performance work is about simply doing less.
tantan's video: • Rust multi-threading c...
philosophies of optimization (highly recommend): • Refterm Lecture Part 1...
simple code high performance: • Simple Code, High Perf...
slides & code: github.com/leddoo/vids/tree/m...
discord: / discord
handmade network: handmade.network/ (there's a discord link on the homepage)
chapters:
00:00-00:33 - intro
00:33-01:53 - doing less
01:53-02:40 - problem description
02:40-06:34 - initial solution
06:34-08:31 - caching
08:31-10:09 - more caching!
10:09-13:45 - using abstractions well
13:45-17:53 - more domain knowledge!
17:53-19:40 - a beautiful ending

Komentáře • 281

@NStripleseven Před 11 měsíci ⁺¹⁴⁴⁴
Funny how the video started with “alright so let’s do the first thing you’d probably think of, just caching some states in a hashmap” and ended with “so now we get rid of the hashmap.”
@leddoo Před 11 měsíci ⁺²⁵⁰
spoilers!!! 😆
but yeah, when i discovered that, i knew i had to make this video :D
@entcraft44 Před 11 měsíci ⁺⁴³
That's actually a good example on why you should always test your optimizations. They might not always actually work, and worse, they can stop working if you tweak something else.
@spfab3429 Před 11 měsíci ⁺⁴⁷
@@entcraft44 I'd argue it's not so much an example of "test your optimizations" and more an example of "ensure your earlier assumptions/optimizations are still valid". He did test the hash map when he implemented it and it did improve the performance, but he changed so much of the code and the algorithm afterwards, that by the end, the code that he originally implemented the hash map for simply doesn't exist anymore. And the new code, that exists now, has therefor a basically untested hash map.
@linebreaker8751 Před 11 měsíci
Actually it may be needed to make one more clay robot than needed for geode robot becouse of the obsidian robot. And at 16:30 he just missed that fact! Actulally he should check if there is enough of obsidian robots first before than checking geode robots! The except of that this optimazition was amazing!
@NStripleseven Před 11 měsíci ⁺³
@@linebreaker8751 I don’t remember the exact code that was displayed but I think it accounts for the maximum possible amount of clay you could need in one minute, which just happens to usually be that of the geode bot.
@matt-dx1jo Před 11 měsíci ⁺⁴⁴²
I can verify this rule works. Whenever I do less myself (i.e., copy other people's code more) the code ends up being much faster.
@notnullnotvoid Před 11 měsíci ⁺²²
Sounds like it's high time to get good.
@batlin Před 11 měsíci ⁺³²
@@notnullnotvoid that's a pretty unpleasant comment.
@korn6657 Před 11 měsíci ⁺¹⁵
@@notnullnotvoid harsh but fair
@hhjkfvbu5846 Před 11 měsíci
Hhg
@gianni50725 Před 10 měsíci ⁺³
@@batlinits a joke just like the original comment
@Avighna Před 11 měsíci ⁺³¹⁹
The fact that you ended with removing the hashmap ... simply amazing!
You started with the slowest possible approach and then worked your way up to the best where you were exploring so less that the hashmap became the overhead! That's some optimization!
Don't be too scared to start with the slowest approach or use hashmaps, you might end up discarding them at the end like this guy. Amazing video!
@Inirit Před 11 měsíci ⁺¹⁵⁵
Incredible video. I've been a professional software engineer for nearly 10 years but I don't often have to deal with optimization problems and as such my low-level optimization skills are relatively poor. Walking through all your steps so clearly and concisely really helps me understand some of the different techniques that can be used and is a great addition to my general skills as a programmer. The final optimization in this particular scenario being to merely remove the naive first pass at optimization (i.e. the hash map) was especially revealing.
@kanuos Před 11 měsíci ⁺²³⁵
The last optimization was just the perfect icing on the cake. I was thinking along the lines of "Oh, now is the time for multi-threading!" but never in a million years I would have guessed THAT. Subscribed!
@AndreasHontzia Před 11 měsíci ⁺⁶⁶
The number of times where I could just slam dunk a programming problem with math is quite high. Knowing that you are in the right complexity class is key. I like your theoretical analysis of the game.
@pseudo_goose Před 11 měsíci ⁺⁶³
This was definitely one of the most fun problems to optimize this season.
I used a similar approach, but it looks pretty different (in particular, non-recursive) because I've had a lot of experience implementing search algorithms for these kinds of problems. They are basically pathfinding problems, where you try to find the path of moves (or sequence of actions) with the least cost (or the largest score).
In my case, I just used BFS, where my score at each state is the "minimum achievable score": geodes_collected + (time_remaining * num_geode_robots) . I also added some simple best-case pruning - if it can't beat the best "minimum achievable score" so far, even if it built geode robots in every future turn, then it's not worth searching that branch.
I also noticed that you can remove "waiting" from the list of actions, which significantly reduces the number of states. My logic goes: Waiting is a redundant state, because you always have a specific robot you want to build (unless you are running out of time), and you always want to build that robot as soon as possible. So my action becomes the question "what is the next robot I want to build?" and the waiting is built into the state transition, it fast-forwards until either time runs out or it can build that robot.
12:40: Another way to make this safe is to use the bytemuck crate, which encapsulates the unsafe transmute using a bunch of checks on the type layout - requiring #[repr(C)], ensuring there isn't any padding that would be uninitialized, etc. I use bytemuck a lot for shader bindings in graphics programming; since they (mostly) use the C memory layout, I can just specify all the shader variables in a struct and then easily convert it to a byte slice to write to a GPU buffer.
@leddoo Před 11 měsíci ⁺¹⁷
thanks for mentioning bytemuck, i'll definitely recommend that next time!
@devtadeo Před 11 měsíci ⁺¹¹⁸
5 months ago CZcams recommended me your video of why python is slower than your language, it's impressive to see someone that really puts effort in videos with those animations and good explanation of the topics, one of the best channels I've ever subscribed ❤
@leddoo Před 11 měsíci ⁺¹⁵
aww, thanks!
if you haven't seen them already, i think you'd also love the videos by @strager_ (who i've stolen the tip/exercise thing from 😁)
@kissinger2867 Před 11 měsíci
@@leddoo CZcams just recommended me this video and I'm already sold. It's so rare to find a computer science channel with true computer scientist who is passionate and knowledgeable about the subject of computation and not a tech influencer.
@devtadeo Před 11 měsíci
@@leddoo yeah i know @strager_ i love his videos, i pretty much follow every programmer that explains with animations/slides like tantan altough i dont understand rust but the videos are enjoyable anyways
@OrtinFargo Před 10 měsíci ⁺¹²
I find myself continuously returning to this video because it is such a well-done video. it doesn't cover naive but commonly thought optimizations like multi-threading, which could lead to poor performance on older devices, nor does it delve into the process of optimizing the hashmap to the point of oblivion! What sets this video apart is that it demonstrates how to approach a problem and generalize it to enhance the algorithm. Many other videos that merely provide the memoization solution and call it a day, however this video goes the extra mile by presenting clear and concise steps for general optimization that could be used in any application. In my opinion, this video is an invaluable resource and explain in an entertaining manner that you have gained yourself a subscriber.
@gingeas Před 11 měsíci ⁺¹⁵
incredible explanation of branch-and-bound. I notice every year there are quite a few questions that don't have a fancy solution, but just require optimizing an okay one; for example the 2022 day 16 problem ("Proboscidea Volcanium") which is another branch-and-bound, or the 2020 day 15 ("Rambunctious Recitation") which just requires a slightly-optimized brute force solution (it's the Van Eck sequence and has no optimal solution)
these sorts of questions i really like for aoc as it helps break the stigma to those i teach programming to in that you don't have to memorize 75 algorithms to be good at competitive programming: just need a propensity to sit down and solve a good puzzle!
looking forward to your later videos
@Rin-qj7zt Před 11 měsíci ⁺⁸
That cache removal took me off guard but it makes so much sense and is a perfect example of unintuitive optimization processes. There will be solutions implemented that were only solutions at the time they were implemented.
@mc4ndr3 Před 11 měsíci ⁺¹¹
Fantastic final point! Always remember to *benchmark*, and then remove, any optimizations that for one reason or another, turn out to be premature optimizations overall.
@felixjohnson3874 Před 6 měsíci
It always annoys be that youtube MD formatting requires spaces, I mean just _*WHY*_‽
@SciDiFuoco13 Před 11 měsíci ⁺³²
Very cool video! I remember bruteforcing this in 5 minutes and getting on the global leaderboard just to then spend 5 hours trying to optimize it down to milliseconds. I used most of the techniques used in the video, although it took me a while to realize especially the second to last one (although I implemented it in a slightly different way, i.e. for every kind of robot wait until you can build it and then do it; this means that exploring a node doesn't coindice with a single turn though). In the end I spent a lot of type trying to find even tigher upper bounds (it's surprising how many more nodes you can skip with that!)
@leddoo Před 11 měsíci ⁺⁴
oh, that sounds really interesting! do you have your solution on GH?
@matroqueta6825 Před 11 měsíci ⁺²³
I had a very similar experience with AoC 2019 day 18, Many Worlds Interpretation (except that problem took me about 4 or 5 days to complete). Likewise went from a recursive solution that would DNF, to using a cache to getting it to finish in a couple of mins, and then improving it gradually by finding ways to identify clearly terrible strategies (like walking over a key but not picking it up in the 2019 case) and removing them from evaluation.
Many of the advanced AoC problems are really great at teaching the "do less = go fast" principle in a very natural way
@thekwoka4707 Před 11 měsíci ⁺⁵
Yeah, in 2022, there is one related to movement between caves to turn valves.
You can actually step through caves, but you actually only need to be concerned with the order of valves to turn, since any steps in caves not getting you directly to the next valve is wasted.
My fastest solution actually just bruteforced every possible valve order end value since it was so quick compared to any kind of simulation.
@spicybaguette7706 Před 11 měsíci ⁺⁶⁰
One low-hanging fruit optimization is to use a faster hash function, like fnv. SipHash is only really necessary in cases where you have untrusted/user input as your key material, because it's HashDOS resistant
@thekwoka4707 Před 11 měsíci ⁺⁵
Or even a really naive stringification of the state.
Just the number values of each section (time, ore, bots? With a hyphen in between lol
But removing the cache entirely is faster
@professortrog7742 Před 11 měsíci ⁺¹¹
@@thekwoka4707 seeing as all those numbers are 8-bit ints i would say that it is quite obvious how to glue them together without falling back on strings.
@leddoo Před 11 měsíci ⁺²¹
good point!
using fnv made the initial version about 30% faster
@kintrix007 Před 11 měsíci ⁺⁵
@@thekwoka4707 I am failing to see how that would bring any benefit? You have eight 8-bit integers in a struct. That means you have 8 bytes. You have it in a struct, which is to say that memory is grouped together. Thus, if you read 8 bytes, you just got all of the data in a non-ambiguous way as a u64, an 8 byte integer.
What you are proposing is to instead allocate up to 8 times 3 bytes for the numbers and 7 bytes for the dashes. You made 8 bytes of data grouped together in one place into 31 bytes of data grouped together in another place. I have severe doubts that would lead to speedup. You just made the hashing function slower by having to hash more data.
@gwentarinokripperinolkjdsf683 Před 11 měsíci ⁺¹
@@leddoo was it enough to justify using the hash in the final version or no?
@Aidiakapi Před 11 měsíci ⁺¹³
Props for this video, this was a really interesting problem, and you explained your steps and reasoning behind them very well. For fun, I went back and reviewed my own code, and what I did for this day, and there's some more interesting optimizations:
- I implemented "no idling" differently, and instead calculate the time until one robot could be produced with the current resources, and skip ahead to that frame. This cuts out all the "waiting states".
- I set up a much tighter upper bound, by having two assumptions, and then calculating the time until we can start producing geodes, and how many it'd produce. The assumptions are:
--- After the first clay robot has been produced, ore is infinite.
--- After the first obsidian robot has been produced, clay is infinite.
- I also compute: lower bound = owned geodes + geode robots * remaining time.
- Instead of recursion, it uses a binary heap, with states ordered by highest upper bound first, and then highest lower bound. Once the upper bound is >= any previously seen lower bound, we know we are done, and the previously seen lower bound is the actual maximum.
So we visit a state, to test paths where we produce any of the new robots, and then compute new bounds on the geode count for those states. For the example in the video, on part one (24 minutes), it visits 343, and computes bounds for 1246 states, which runs in 29µs. For part two (32 minutes), it visits 5373, and computes bounds for 21259 states, which runs in 865µs.
@leddoo Před 11 měsíci ⁺⁶
those numbers are sick :D
gonna have to give your approach a try at some point, sounds super fun!
@duncathan_salt Před 11 měsíci ⁺⁴
are those assumptions about infinite resources sufficiently robust? would it not be possible to construct a blueprint which breaks those assumptions?
my initial thought would be to treat resources as infinite as soon as you have as many robots for that resource as the highest cost of that resource. is that too conservative?
@Aidiakapi Před 11 měsíci ⁺²
@@duncathan_salt They are an overestimation, but the upper bound is never used as an answer, it is only used to cull the search space, because if using these overly optimistic assumptions, you couldn't even beat a previously seen result, it could never be a solution worth visiting.
The real answer could *never* produce more than ore/clay than infinite, so the real amount of geodes produced could *never* be higher than this upper bound.
The key is the interaction with the lower bound, which is just "how much would I have if I idled the rest of the time". Once a state we've visited, where it just idles the remainder of the time, can produce more or the same amount of geodes, than our optimistic upper bound with infinite resources, we know that the solution with this upper bound could never be better than any previously seen.
Because we pick off the item with the highest upper bound, any other states we might have to visit will also never be able to beat a previously seen state.
Hope that explains it a bit better.
@duncathan_salt Před 11 měsíci ⁺¹
@@Aidiakapi ah! I understand now. that's very clever! tyvm for the detailed explanation, it's much appreciated.
@lewismassie Před 11 měsíci ⁺⁶
Realising what you had to do for that last step is the true wizardry of programming. I certainly wouldn't have figured that out
@sanjsahayam5271 Před 11 měsíci ⁺¹
Love the step-by-step approach to optimisations used here. Great job! Love to see more of these types of videos. 🙌🏾
@teofilciobanu Před 11 měsíci ⁺³
I just discovered your channel and this is such a great video. Very fun to watch and very inspiring. Great work, I'm looking forward for your future videos.
@taukakao Před 11 měsíci ⁺²⁵
I think what's important to see here is that every optimization has it's cost, either code readability, complexity, or overhead (as with the HashMap). Balancing them is the most important task in my opinion.
I think in many cases having easier to understand code can be more important than the speed increase. (And sometimes a seemingly better solution will result in a worse execution time, especially hard to notice in garbage collected languages)
@Roy-K Před 11 měsíci ⁺¹
Absolutely incredible, your explanation as you went really helped to guide me through the thought process you used - hope to take advantage of this soon!
@CottidaeSEA Před 11 měsíci ⁺⁸
I wish I got to do more stuff like this at work. This is the kind of stuff that really excites me. There's something in me that just goes kind of crazy when I make something blazingly fast.
I made a function go from taking 3 minutes to just below 2 seconds before, I could've done even more, but I couldn't really justify it. It was also fast enough by a long shot even if it had taken 30 seconds.
@jimmyhirr5773 Před 11 měsíci
Not sure what you do at work, but I've heard that these industries put a high priority on performance:
Video games (particularly ones with high graphical quality)
High frequency trading firms
@CottidaeSEA Před 11 měsíci ⁺¹
@@jimmyhirr5773 Yeah, they do. I primarily work with e-shops. Performance is a big part of what we do to stay competitive, but it's very much just until it's "good enough" and nothing more.
@zbynekjurica Před 11 měsíci ⁺¹
This is a really great video! So much information, learnt a lot. Thank you! Would love to see more aoc optimization videos in the future!
@CWunderA Před 10 měsíci ⁺¹
This was fantastic! What a great example of how to think about and tackle a complex problem systematically. How you walked through each step, the intuition behind each change, and built the solution up from parts was perfect. I would love to see more videos like this.
Před 11 měsíci ⁺²
Best software video I've seen lately with a great ending, subscribed.
@iliedemian8461 Před 11 měsíci ⁺²
PLEASE UPLOAD MORE OF THESE VIDS
this is the best someone has explained a DP problem ever i love this video❤❤
@cyanoswag Před 11 měsíci ⁺⁴
Your video style and clarity in the explanation is awesome! Thank you.
@leddoo Před 11 měsíci
thank You!
@skyslycer Před 11 měsíci ⁺¹⁰
This is so cool. I didn't understand anything after a certain point but still interesting.
@sebaztian18 Před 11 měsíci ⁺¹
Incredible explanations! Keep it up!
@filipenicoli_ Před 9 měsíci
that was just beautiful
@bigjukebox3370 Před 11 měsíci ⁺¹
this is brilliant. Well done and very interesting example
@raylopez99 Před 11 měsíci ⁺¹¹
Nice, it's like checking the min-max score on a chess tree and then truncating the branches that will not give a higher score. How chess engines work. My solution was to simply randomly try different combinations and see what works, then store the learned preferred combination in the final production code. I wonder if this is "almost as good" as the binary tree approach here; of course it will by definition be suboptimal but it's easier to code for sure.
@thekwoka4707 Před 11 měsíci ⁺¹
The chances you would get the right thing that way are extremely low with the context of the problems.
You have to check a lot of different blue prints that have different robot resource costs to find the best.
@lucasa8710 Před 11 měsíci ⁺¹
this is a masterpiece video. incredible
@yash1152 Před 11 měsíci ⁺¹
6:40 yes, i was wondering that too
and it's awesome. the terminology used - hits/misses matches the terms used in paging and page replacement algorithms pretty well :)
@Razvvannn Před 10 měsíci
Just stumbled upon your channel. Great video! Explained gorgeous the ideas! Instant subscribe
@Jankoekepannekoek Před 11 měsíci ⁺²
Never expected that last optimisation! Subbed!
@leddoo Před 11 měsíci ⁺¹
i didn't see it coming either, that's why i made this vid :D
@glevco Před 11 měsíci ⁺²
Great video, keep it up! Would love to see more like this, specially in Rust
@RoryDavidWatts Před 11 měsíci ⁺²
This is an excellent video, thank you for putting the effort into making it.
@leddoo Před 11 měsíci
glad you enjoyed it! :)
@sinom Před 11 měsíci ⁺³
The camera didn't die. It was just doing less
@ItsLaxe Před 11 měsíci ⁺¹
incredible video, thanks a lot
@jukkajylanki4769 Před 10 měsíci ⁺²
Great video! Here is one further state space optimization: remove 'idling' as an option. That is, instead of each child node being the parent node + 1 turn passes, make each child node be the result of a "what do I choose to build next?" choice. For each choice, calculate immediately how many turns it will take to build the thing (how many turns are needed to wait for resources + do the build) before recursing to the child. That is a generalization that should obviate the awkward can_build booleans and removes all "I idled" child nodes from the graph.
@JoQeZzZ Před 3 měsíci ⁺¹
Yep, this + caching is how I solved the problem when I did it.
It's not a matter of idling, it's a matter of deciding what to build next and waiting the required number of turns to do so.
There are definitely further easy optimisations, like not building a producing robot if you're already producing enough as was stated in the video, but simply removing all idling steps was enough for me to get it into acceptable (making a cup of coffee abd coming back) running range for me.
@TaiwoMegbope Před 10 měsíci
Thank you very much! This is profoundly useful.
@henkolsonpietersen2242 Před 11 měsíci
I loved this, great job!
@TheRealBrenny Před 10 měsíci ⁺¹
I don't usually comment, but that ending was the most satisfying ending I've seen in a long time.
@protheu5 Před 10 měsíci ⁺¹
It's like a magician's performance, that twist at the end. Brilliant!
@romanstingler435 Před 11 měsíci ⁺¹
Definitively worth a follow
@kjaamor2057 Před 11 měsíci ⁺¹³
I'm a novice programmer who has never used Rust but I watched this from start to finish. Superb piece of presenting, aside from anything else.
I'm probably several years off implementing something like this, but as a novice its a nice way of being introduced to concepts.
@gawwad4073 Před 10 měsíci
That ending was just beautiful
@theghost9362 Před 10 měsíci
Simply amazing
@girishpoojari2910 Před 11 měsíci ⁺¹
Great video, I didn't understand everything but still some of it made sense. Thank you,
@meowoasdgjoiagjoi Před 10 měsíci
wow, I wish I could like a video twice, that ending was amazing
@dudusami89 Před 11 měsíci ⁺¹
amazing content, thank you for your time!
@leddoo Před 11 měsíci
thanks for taking the time to leave such a nice comment! 😁
@Teflora Před 11 měsíci ⁺¹
That was a satisfying ending! Tbh did not expect that, but it makes a lot of sense!
@codesymphony Před 11 měsíci
why didn't he show code for last step?
@jeremysollars5922 Před 11 měsíci
Continue this style of video and you got yourself a long term viewer
@prestonharrison2140 Před 11 měsíci ⁺⁵
This is an insanely cool video.
@awfultrash888 Před měsícem
Pretty good video man!
@chrissalgaj4111 Před 11 měsíci ⁺¹
Absolutely awesome vid!
@leddoo Před 11 měsíci
thanks!
@tunafllsh Před 11 měsíci ⁺¹
This is amazing! Less is more until more is less.
@Noitcereon Před 10 měsíci ⁺⁴
Tips in the video:
#1 (1:40): Try doing less
#2 (4:13): Start with an algorithm that is definitely correct
#3 (6:34): Use caches to avoid expensive work multiple times
#4 (8:23): dynamic programming is just recursion + caching
#5 (13:30): Understand how abstractions work, so you can use them effectively
@CWunderA Před 10 měsíci
#6 Benchmark/test each optimization and remove earlier optimizations if they are no longer beneficial
@colox97 Před 10 měsíci
the last 2 software project i started are paused cuz i couldn't figure out a way to solve the problems i was having.. it's so nice to see someone reasoning trough a problem and solving it efficiently
@zachleavitt87 Před 10 měsíci
THANK YOUUU
@feeelix Před 11 měsíci
this is brilliant
@denversupermarket7484 Před 11 měsíci ⁺¹
That was beautiful
@HippieInHeart Před 10 měsíci ⁺¹
I'm not even gonna pretend that I understood most of what you said, since I'm pretty much still at the very beginning of learning programming, but it was pretty interesting regardless. Thanks for making this video. Maybe it will help me one day in the future.
@thoperSought Před 11 měsíci
that was a lot of fun!
I don't use Rust, but I've been curious about it for a while, and I really enjoyed this
@ataraxianAscendant Před 11 měsíci ⁺⁵
babe wake up new leddoo video
@SWinxyTheCat Před 11 měsíci ⁺⁴
The best way is to just return the correct answer immediately
@bonenintomatensaus Před 11 měsíci ⁺²
Found the mathematician
@DB-Barrelmaker Před 11 měsíci ⁺³
I think I now understand the point of memorization. Man what a round about way of realizing something
@brytonsalisbury6223 Před 11 měsíci
Memoization* :)
@DB-Barrelmaker Před 11 měsíci
@@brytonsalisbury6223 autocorrect
@jonarmani8654 Před 11 měsíci ⁺⁵
so much was deleted by the end, even your camera
doing more with less
@isaiahhonor991 Před 11 měsíci
Wonderful video!
@eboatwright_ Před 10 měsíci ⁺¹
whoa. subscribed 👍
@oysteinmb1 Před 11 měsíci
This is a great video! Enjoy your other content too, but would love to get some more "tutorial"/"think like a programmer" videos!
@jm-alan Před 11 měsíci ⁺¹
This is exactly the kind of progressive optimization I love.
You have me reconsidering my Rust solutions for Project Euler, wondering if there are more unseen optimizations to be had...
@orbital1337 Před 11 měsíci
As someone who has worked quite a bit on optimization problems like this, I also enjoyed this particular problem of AOC and your video about it! Branch and bound is a classic technique and this is a nice and simple example of how powerful it can be. Just for fun I benchmarked my own solution that I wrote a couple months ago against yours. Averaging over 10 runs I get:
Part 1 | Part 2
Mine | 1.3 ms | 47 ms
Yours | 6.3 ms | 107 ms
Our approaches are a bit different. You use DFS whereas I use BFS with a lower bound heuristic (build the most expensive robot possible). These are both okay (usually best-first search ala A* is better but I was too lazy for that). You use some domain knowledge to prune like only building geode robots when possible but I didn't bother with any pruning outside of comparing the lower and upper bounds. On the other hand, I put in more work into my upper bound which often gives the biggest bang per buck for branch and bound algorithms.
Usually a good strategy for an upper bound is to relax the problem to something simpler that you can solve exactly. The relaxation I came up with is the following. Given a starting state of resources, create a copy of the resources for each type of robot - lets call this their personal inventory. Building a robot takes resources from their personal inventory. But when a robot *produces* a resource, that resource is added to *each* robot type's personal inventory. Each minute, you're allowed to build up to one robot of *each* type at once.
This is clearly a relaxation because each robot type's personal inventory is an upper bound on the resources that you would have if they all shared resources. And since you can always decide to just build one robot, a solution to the original problem is a feasible solution to the relaxation. On the other hand, it is easy to construct an optimal solution to this relaxation: simply build robots whenever possible. So that is the upper bound that I used for branch and bound.
Less is more, but sometimes more is less! In this case, doing a lot more work per state reduces the number of states to ~2300 in part 1 and ~81600 in part 2.
Code: gist.github.com/t-troebst/466cb7017a295ab4d4a571238876f0a8
@graysoncroom Před 11 měsíci
great video. thanks.
@senzmaki4890 Před 11 měsíci ⁺¹
there's no reason for a programming video to go this hard 🐐🐐🐐
@thegreatdissector Před 11 měsíci ⁺¹
I watched this video a few days ago and decided to hold off on watching the rest so I could have a go at solving the problem on my own time. I spent too much time trying to do so and eventually caved to see what "the solution" was.
Turns out, there is no "the solution"! That was exactly my problem. I was trying to find an elegant algo that would get the job done. But this video revealed this is nothing more than simply beginning with brute force, followed by a series of clever prunes to the space. Moral of the story - just start with the brute force!
@abanoubha Před 11 měsíci ⁺¹
great video 🥇
well done 👍
@1vader Před 11 měsíci ⁺⁵
I'm pretty sure the unsafe transmute is undefined behavior if you don't specify repr(C). As you said, the field layout is undefined, which also means there can be padding in between and reading padding is undefined behavior.
@leddoo Před 11 měsíci ⁺³
yeah, i should have probably mentioned padding.
but in this case, there's no UB, cause the struct consists of 8x u8 values, and those are transmuted to a u64. a u64 is of course 8 bytes large, and neither u64s nor u8s have any invalid bit patterns.
if the compiler did insert padding (which would be sub-optimal here), the size of the struct wouldn't be 8 bytes, so the transmute would raise an error.
@KaneYork Před 11 měsíci
The fact that it's "either a compile error or not UB" is actually mind blowing to me, someone who is used to bad tools that let you do bad things
@lesterdelacruz5088 Před 11 měsíci ⁺³
My approach to optimization is always dfs and memoize and hope for the best haha. Been programming for 10+ years haha and clearly I haven’t progressed since year one.
@keokawasaki7833 Před 11 měsíci
it is so satisfying to see those numbers drop
@atlasxatlas Před 9 měsíci ⁺¹
i was struggling with aoc22 d19 for months now.
ofc i thought of brute forcing it but instantly disregarded this method cause i was determined to fine some more logic way to solve this.
i tunnlevisioned this ideal solution and at the end couldn't find it.
finally i watched this video, which i had saved for a month after i watched it a little and realized it's about the same problem which i was stuck on.
this is incredible stuff. im so glad i stopped being stubborn. i let go of my self assigned goal of 'if i do this it will be by my own force'.
the _smartest brute force_ is an incredible method and i am so glad to have learned it.
thank you
@Sammi84 Před 11 měsíci ⁺¹
That ending was wild.
@kylebowles9820 Před 11 měsíci
Nice content! Minecraft textures are a bonus :)
@JohnB5290 Před 11 měsíci
That's really cool.
@jeffreysun7983 Před 10 měsíci
A good lesson to take from this is to solve as much of a problem as possible before letting the machine take over. I can imagine someone working on this one long enough that they can practically solve it with pen and paper, then the final algorithm can be in the μs range
@alssoldat7185 Před 11 měsíci ⁺¹
Nice work, please consider making more like this, I think it is a category of coding information that is under represented on CZcams.
If anyone has suggestions for other sources of such information, please do share :D
@leddoo Před 11 měsíci ⁺¹
i put some more links into the description. you may also enjoy strager_'s work!
@alssoldat7185 Před 11 měsíci ⁺¹
@@leddoo Hehe yeah, I already discovered Casey and tantan, they are both great! I have also stumbled across strager, but only a video at a time, I'll give the channel a look :D Thanks!
@sabihass5361 Před 10 měsíci
awesome!
@chezlonian Před 11 měsíci ⁺²
This man talks like the engineer I want to be. He has an understanding of low-level processes that make me embarrassed to claim to be an engineer. How do I learn this kind of stuff? Is there a community?
@leddoo Před 11 měsíci
there is, in fact!
i have a discord, link in the description.
but the real deal is the handmade network! (handmade.network/ discord.gg/hmn )
other than that, i always followed my curiosity, when i wondered "how does this thing actually work underneath?"
@chezlonian Před 10 měsíci
@@leddoo Looks like the discord link in the description isn't working :(
@dariogifc0 Před 11 měsíci ⁺²
You forgot to mention that 12.7 million is approximately the population of Bavaria. Unforgettable omission.
@Turalcar Před 10 měsíci ⁺¹
I would usually put "memory layout" under "doing less" as in doing less memory fetching.
@silviogames Před 11 měsíci ⁺¹
How cool is that please,
Wie cool ist das bitte
@curtmack Před 10 měsíci
One of the big reasons I love Lisp languages is that I can write a macro that memoizes functions for me.
@minuskelvin3619 Před 11 měsíci
Branch-and-bound and symmetry breaking are overpowered
@nanahiiragi723 Před 11 měsíci ⁺¹¹
This is excellent. I could watch 1000 of these. I was so bummed when I saw there weren't more. No pressure though (^・ω・^ )
@leddoo Před 11 měsíci ⁺³
thanks! in the meantime, you could check out the links in the description 😁
@reprC Před 11 měsíci ⁺²
At 12:06, technically it is safe to transmute this into a u64, but Rust doesn’t guarantee preserving field order for structs, only drop order. You’d ideally want to annotate Pack with #[repr(C)] to prevent rustc from reordering fields. Also, it may be a good idea to explicitly align the struct too ( with #[repr(C, align(8))] ). Struct alignment is calculated as the max alignment of each of its fields. Since Pack is a struct of u8’s, it has an alignment of 1, but u64 has an alignment of 8. mem::transmute()’s docs says the compiler will take care of ensuring compatible alignment, but I am not 100% sure if adjusting the unaligned value adds any overhead so explicitly aligning couldn’t hurt.
EDIT: unpaused and saw you addressed the field ordering and repr. Hopefully this comment is still helpful for anyone interested
@leddoo Před 11 měsíci ⁺¹
hmm yeah, repr(align) could be a good idea 🤔
@gfasterOS Před 11 měsíci ⁺¹
@@leddoo I think you need more than just alignment, it's conceivable that a profile-guided optimization would introduce padding within the struct. Without repr(C), it is reasonable to assume that the transmute is undefined behavior.
Irrational aversion to unsafe isn't good, but there are a ton of potential footguns and it really should be treated as a "going faster" optimization reserved for when you know the safe variant is actually *much* slower.
@jole0 Před 11 měsíci
Why does it matter if the fields are reordered?
@yjlom Před 11 měsíci
@@jole0 it doesn't, since they're never communicated between different versions (or even instances) of the program
@BarsDemirdelen Před 11 měsíci ⁺¹
One way I optimized this problem was building a better state tree. Instead of having actions like if can build robot, build robot, or wait, I changed my actions to be: Either build a x robot at t2 or build a y robot at t4. There is no wait action at all. I believe your no idling pruning is in the end equivalent to this, but I am mentioning this as just a different way of thinking.
@leddoo Před 11 měsíci ⁺¹
> a different way of thinking.
yup, a few others have done it that way too, and i love it :D
@laubannenberg5446 Před 10 měsíci
This is really cool, it's a very nice demonstration of how using a couple of key techniques in a spiral of improvements can whittle down a problem. But after seeing this, I wonder if you can't draw out this approach to optimizing even further?
The rules of this game say you can build only one robot per turn. So once you hit time X the point where you can make one geode robot every turn, everything else after that should be expressible as a simple function of Y time -> Z geodes.
And it only takes a finite number of steps to get to that point in the fastest possible way. Essentially, each moment in time until the takeoff point is a special case with a knowable geode value.
So you could also just hard-code all those special cases for time < X, and the general function for time >= X.
That's a bit banal solution, but then, the problem itself has no real randomness or surprises that require dynamic adaptation; the only variable is how many time steps you get. And it's in keeping with the general way of optimization here: aggressively use domain knowledge to do less. It's just kinda the big brother to optimizing a fixed-length loop by unrolling it.
@syncrossus Před 10 měsíci
My first thought was to explore the tree using A*. I'd have to actually work through it more thoroughly to be sure, but think it would be equivalent to all of your optimizations except "u8" and "enough bots".
@ExCyberino Před 11 měsíci ⁺¹
Pretty good, you'll probably get big soon
@leddoo Před 11 měsíci ⁺²
thanks! i've been working out for ~2 months now 🌚
@ExCyberino Před 11 měsíci
@@leddoo I actually meant your channel will get big.
But I'll believe your body is gonna get big as well. Keep at it
@Gunth0r Před 11 měsíci
@@leddoo oh, so you do gardening too?
@mastershooter64 Před 11 měsíci
@@leddoo Keep going for 2 more years you'll definitely be big, building muscle is no joke man you gotta get good sleep every night, eat well and lots of protein and hit the gym regularly and of course don't overtrain. It's hard work but if you keep at it for 2 or 3 years it's really worth it. sleep is very important.
@leddoo Před 11 měsíci
@@mastershooter64 hehe, thanks for the tips!
eating enough is definitely the hardest part for me. i'm the kind to get lost in a coding problem & forget to eat the entire day. (though dw that doesn't happen too often :D)
and while i like to track how i spend my time, tracking calories has never really worked for me. though i recently got a new, more accurate scale, so now i at least have some objective feedback loop going on.

Další v pořadí

Automatické přehrávání