Komentáře •

  • @loipesmas
    @loipesmas Před měsícem +20

    Very interesting insight! I wouldn't consider that GC can make some things simpler, although now it totally makes sense. Thanks for sharing!

    • @stercorarius
      @stercorarius Před měsícem +7

      The entire point of gc is to make things simpler

    • @loipesmas
      @loipesmas Před měsícem +2

      @@stercorarius right, I meant simpler for the compiler/CPU, as in it has to do less work (at given time, because obviously it will have to do the work later)

  • @jonas1ara
    @jonas1ara Před měsícem +15

    I have been preparing a series of posts on my blog about memory and this helps me a lot to understand and complement it, it is great.

    • @maxrinehart4177
      @maxrinehart4177 Před měsícem +2

      would help a lot if you tell us what your blog name or url

    • @anasssoulimani9288
      @anasssoulimani9288 Před měsícem +2

      Hey, can you share your blog?

    • @contextfree
      @contextfree Před měsícem +2

      I chatted a bit with Jonas and confirmed that this is his blog. It's still brand new, but he has some additional posts planned, such as he mentioned above. jonas1ara.github.io/

  • @Nerthym
    @Nerthym Před měsícem +17

    Nice and to the point comparison. Usually though you are not supposed to pass shared_ptr unless you actually want to share ownership. In adequate cpp codebases about 95% of ptrs are unique (or bound to an arena) and parameters passed by ptr or ref most of the time.
    Plus shared_ptr's ref counter is atomic, which adds overhead. While ago I had to optimize iphone 4 game where like 20% of frametime was spent copying pointers arround 'cause people who developed it were used to garbage collected languages. If used naively RC can absolutely affect performance negatively.

    • @von_nobody
      @von_nobody Před měsícem

      Yes, in some cases if we have predefined lifetime, `ptr*` will be best solution (probably in some wrapper).

  • @sanderbos4243
    @sanderbos4243 Před měsícem +1

    Very cool

  • @japedr
    @japedr Před měsícem +16

    So I take that your argument is that multi-owner semantics is easier to describe in C# than C++ (and perhaps Rust). This is of course true, given that there is no concept of ownership in GC languages. But I would argue that multi-owner is something that should be avoided if possible, as there are many pitfalls; and in particular shallow copies are more surprising and difficult to understand than deep copies (e.g. why is the int is different but the list isn't?).
    I don't buy though the performance argument though: shared ptr needs to do a lot more stuff so that 1. it is thread-safe to some extent, and 2. it does need to keep track of the weak ptr bookkeeping. The performance hit is translated to the GC with its own set of tradeoffs (you probably implied this), so there's no free lunch.
    That's not to dismiss the amazing work of the C# compiler devs, of course.

    • @user-uf4rx5ih3v
      @user-uf4rx5ih3v Před měsícem +1

      Data is ownership is a thing in GC languages, including C#, they're just not exposed to the programmer, but the GC algorithm needs to know who owns the data and for how long so that it can `safely` drop objects. This process can be quite fast and efficient, the problem is that reference counting and data dropping is not done eagerly for a lot of reasons, so you end up with these performance dips that can be unpredictable.

    • @japedr
      @japedr Před měsícem

      @@user-uf4rx5ih3v Oh sure, you're right. I should have specified *user-facing* existence of it. Of course that will matter in implementation details, for performance, etc.

    • @insertpenguin8443
      @insertpenguin8443 Před měsícem +1

      ​@@japedrEven as a user you still have to think about ownership sometimes. For example when you have to ask yourself, should this be a regular HashMap or a WeakHashMap? And if you guess wrong your objects either spontaneously disappear or start hogging up your memory.

    • @xelaxander
      @xelaxander Před měsícem +1

      I disagree. Shared ownership and mutability should be used very sparingly. Shared ownership itself is absolutely fine and often a nice tool to keep track of context.

    • @Hazanko83
      @Hazanko83 Před měsícem +3

      The comparison shouldn't come close though... Shared pointer is literally just an atomic size_t variable that gets incremented and decremented, and deletes the pointer to the held object if the reference hits 0; that's incredible cheap to the point of being almost zero-cost. Garbage collection has MUCH more work to do with walking through the list of items and checking to see if an object is still "reachable" from the programs root.
      I've personally built my own custom "shared_ptr" type that offers additional functionality from the standard library, and it's much much easier to implement than even a custom container type would be, let alone writing anything resembling competent garbage collection.

  • @overloader7900
    @overloader7900 Před měsícem +1

    Fun fact: writing is cheaper than reading. Also modifying has nearly same cost as writing; higher if anything because you need to read a cache line first to modify it. However, the problem comes if the new value should be seen by 2+ objects simultaneously.

  • @spicybaguette7706
    @spicybaguette7706 Před měsícem +2

    I really like the copy/clone distinction in Rust. Is there any particular language that this idea came from or is it new?

    • @contextfree
      @contextfree Před měsícem +2

      One place I've seen `clone` before Rust is Java, where they say that "by convention" the cloned object "should be independent" of the original object, meaning deep copy of anything mutable, and I suspect the Java method name may have influenced Rust's naming, but I don't know what predates Java in that, nor have I dug up conversations from Rust development. And of course, Java still has no value/copy types outside of primitives quite yet, although possibly coming soonish.

  • @krystofjakubek9376
    @krystofjakubek9376 Před měsícem +4

    **I was wrong here its really is displaying the actual x86 assembly, appologies**
    In the C# case you are not looking at x86 assembly. Instead its .NET interpreted code. They just happen to look very similar in this case.

    • @contextfree
      @contextfree Před měsícem

      Are you sure? I also don't really know CIL, but .NET has AOT compilation sometimes, and those registers and instructions look like x86-64 and don't look like this example of cil here: en.wikipedia.org/wiki/Common_Intermediate_Language#Example

    • @contextfree
      @contextfree Před měsícem +2

      Or rather .NET supports AOT compilation these days, and I suspect that's what they've configured on Compiler Explorer. See also: github.com/compiler-explorer/compiler-explorer/pull/3168

    • @krystofjakubek9376
      @krystofjakubek9376 Před měsícem +1

      @@contextfree oh nice didnt know that!

    • @JorgetePanete
      @JorgetePanete Před měsícem

      it's*

    • @viktorstojanovic9007
      @viktorstojanovic9007 Před měsícem +1

      ​@@contextfreeThose are assembly instructions, but you don't need aot for assembly instructions

  • @MrNathanShow
    @MrNathanShow Před měsícem +2

    Nice to see the Godot ;)

  • @logangrosz6529
    @logangrosz6529 Před měsícem +1

    Very nice video!

  • @init1508
    @init1508 Před měsícem +2

    nice nice nice

  • @kamertonaudiophileplayer847
    @kamertonaudiophileplayer847 Před měsícem +1

    It looks like VScode wants to update.

  • @sporefergieboy10
    @sporefergieboy10 Před měsícem +3

    luv me float * ‘ate reference counting and GC simple as

  • @cole-nyc
    @cole-nyc Před měsícem +2

    This is not a perfect comparison. In your C++ example, there's no need for the Seq struct to be copied when it can be passed as a reference. The refcount increment op of shared_ptr is expensive because it is atomic, therefore passing it to every function that operates on the underlying object will slow things down considerably. Generally, the use of shared_ptr is relatively rare, but if you do use it, it shouldn't be passed by value to any entity that does not require ownership of the memory.

    • @contextfree
      @contextfree Před měsícem +1

      Thanks for the comment! Yeah, I purposely passed by reference to the main printing function in both C++ and Rust to have examples of pass by reference here in addition to the examples of passing by copy/clone.

    • @contextfree
      @contextfree Před měsícem +1

      I just didn't mention it in verbal discussion, and maybe I should have.

  • @deltapi8859
    @deltapi8859 Před měsícem +4

    I used to say for a while GC languages solved some memory and reference issues decades ago that rust and c++ try to find sophisticated solutions for. However the concept of GC is elegant compared to what you do in these other languages. It just doesn't have optimal control and runtime performance. I think though that if people were serious about performant GC code then they would find ways to make it 90% the speed with some exceptions.

    • @contextfree
      @contextfree Před měsícem +5

      Trade-offs exist for most things.

    • @vlc-cosplayer
      @vlc-cosplayer Před měsícem +1

      Semi-serious question: would it be possible to provide hardware acceleration for GC?
      ("Semi-serious" because I don't know exactly how GC works, and if it would make sense at all to do this. But I'm guessing that if it hasn't been done yet, especially given how common GC languages are, then it's probably a silly idea. Or maybe it's not a silly idea, and we can get by fine by just throwing more powerful generic hardware at the problem... 🤔)

    • @contextfree
      @contextfree Před měsícem

      @@vlc-cosplayer No immediate answer from me. I'd have to study that for a while.

    • @deltapi8859
      @deltapi8859 Před měsícem +2

      @@vlc-cosplayer good question, I think the overhead wouldn't be worth it, because the best solution is to keep as much memory as possible for as long as you don't need it for something else. Really memory management is only necessary for Operating Systems with concurrent applications that need your memory. If you manage memory for your own application you can just keep the memory address to a large piece of memory and return the last the end of the piece of memory and assign on top of that. That is how arenas do it. So all this memory management issue is largely a multiprocess issue.

    • @andrewgr144
      @andrewgr144 Před měsícem +2

      Speed isn't necessarily the issue with GC languages, and in fact I think it might not even be the issue the majority of the time. The other issues are predictability and constancy of execution time. For example, say you're writing the back end for a massively multiplayer game of some sort. It could be the case that the mean, median, and modal performance averages are all plenty fast enough, yet your decision to use GC dooms your game to poor reviews and player retention, because some small percentage of the time, the GC runs and sees that something can be deleted, but by deleting that thing, it now sees that some other stuff can be deleted, etc., in a cascading effect that leads to a noticeable lag in the game. The conditions for creating this lag may be incredibly rare as a percentage of the number of times the GC runs, but the GC may run often enough that the lag is still occurring every five or ten minutes of real time. (I think maybe a high-volume day-trading stock system might have made a more compelling illustration, but my mind immediately goes to games, and I'm sure anyone who reads this comment can see how performance lags due to GC would be disastrous in that scenario-- if the GC is fast enough for 999/1000 trades to get executed flawlessly, but 1/1000 times there's a half-second delay, allowing the many competing auto-trading bots to make their trades first, those delayed trades might completely wipe out the profit margin made by the other 999.)

  • @metaltyphoon
    @metaltyphoon Před měsícem +1

    Why dotnet 7? Should try this with dotnet 8 AOT.

    • @contextfree
      @contextfree Před měsícem +2

      It's what's currently handy on Compiler Explorer. They also have an unclear "main" option. I suspect 8.0 is coming there sometime.

    • @metaltyphoon
      @metaltyphoon Před měsícem +1

      @@contextfree aaa nice to know

    • @Liphi
      @Liphi Před měsícem +1

      dotnet 7 also supports AOT

    • @metaltyphoon
      @metaltyphoon Před měsícem

      @@Liphi yes but codegen for 8 is much better as libraries get more annotated for AOT

    • @Liphi
      @Liphi Před měsícem

      @@metaltyphoon I mean, that's just a basic struct manipulation. Doubt that .net 8 can change something here

  • @JasonL220
    @JasonL220 Před měsícem +1

    why are you invoking the toolchain as part of a benchmark, just run the outputted binary

    • @contextfree
      @contextfree Před měsícem +1

      When I remember, I always measure cli time in my videos. My videos are edited, but I try not to edit command execution, and keeping the time there is part of that. I'm also interested in compiler speed. I'm not benchmarking here. But a viewer did do some benchmarking, even if this still is just a micro benchmark. I've linked to that in the video description.

  • @sirhenrystalwart8303
    @sirhenrystalwart8303 Před měsícem +1

    Turning off optimizations makes zero sense. It also seems odd to say the c++ was less efficient when your own timing shows it's 3x faster.

    • @contextfree
      @contextfree Před měsícem +3

      The timing includes compile time, because I care about that also. One viewer did some actual timing analysis with fuller optimizations, though, using Rust, C++, and D (in multiple configurations). See the link above in the description.

    • @contextfree
      @contextfree Před měsícem

      Worth noting that even the benchmarking linked in the description is still limited to this simple program, which is a micro benchmark. Hopefully the measurements manage to exclude IO time as well, but I didn't ask that.

  • @stevenhe3462
    @stevenhe3462 Před měsícem +2

    Makes little sense to me… All examples are single-threaded.

    • @contextfree
      @contextfree Před měsícem +1

      What adjustments would you like to make for a multithreaded look at the topic?

    • @stevenhe3462
      @stevenhe3462 Před měsícem +3

      @@contextfree What happens when you mutate your struct from different threads? In Cpp it works (except it rarely causes data races?), in Rust it doesn't (Rc is not Send, have to use Arc instead), in C# I don't know… Also, C# tracing GC stops all (most) threads when it runs but the other two guys don't?

  • @OCTAGRAM
    @OCTAGRAM Před měsícem

    Not structs, but records

    • @contextfree
      @contextfree Před měsícem

      Mind explaining more?

    • @OCTAGRAM
      @OCTAGRAM Před měsícem

      @@contextfree We don't put structs into database tables. We put records into database tables. They are records

    • @contextfree
      @contextfree Před měsícem

      So you would prefer if these languages had used the term "record" instead of "struct", if I understand correctly. Interestingly, C# has a language feature called "record" these days, and it means something different than "struct" there. More to what seems to be your main point, long ago I learned C before learning Pascal, so I learned the term "struct" first. Although I think you're right that "record" has a longer history in such usage.

    • @OCTAGRAM
      @OCTAGRAM Před měsícem +1

      @@contextfree Yes. Ada, Modula, Oberon, Oxygene, Icon, they all have records. But whatever Haskell you take from outside the family, it also has records

  • @RoughB8
    @RoughB8 Před měsícem +1

    Interesting video but I am missing the point of the video. Your comparison of C#'s CLR instructions with C++'s x86 assembly is just unfair. It seems you dislike the extra syntax that rust adds on you when you want to copy an object. It would have also been interesting to compare a 'real' application (in the 3 languages) with full optimization instead of a minimal program that you manually need to prevent inlining.

    • @contextfree
      @contextfree Před měsícem +3

      Yeah, I rarely express opinions directly in my videos (although I actually like Rust clone vs C++ implicit copy constructors, but don't tell anyone). I usually just explore stuff. And RC optimizations are definitely an interesting topic that I avoided discussing here.

    • @contextfree
      @contextfree Před měsícem

      One of many interesting video topics could be exploring heavily compiler-optimized C++ and/or Rust and/or Swift RC in simple cases vs say Perceus RC such as in Roc or Koka. Haven't explored that yet, though.

    • @RoughB8
      @RoughB8 Před měsícem +2

      @@contextfree After a few years of C++ i love to remove implicitly as this leads to the worst errors. At some point I created a hardware interface implicitly from string which caused havoc :)

    • @RoughB8
      @RoughB8 Před měsícem +1

      @@contextfree It could be as simple as reading numbers from file, adding them up and printing the sum to terminal. With godbolt its great that you can compare different optimization levels and see when the compiler starts using SSE instructions.

    • @Liphi
      @Liphi Před měsícem +2

      C# app was also in x86 assembly (aot or after jit). IL looks different

  • @yuan.pingchen3056
    @yuan.pingchen3056 Před měsícem +1

    I prefer Rust, the Rust language is not only a memory-safe language, but it simplifies the complexity of C++ syntax that is overextended due to modern language features, but what about C#? It's a copycat of Microsoft's special version of Java, it runs in a VM, and garbage collection cannot be removed...

    • @andrewgr144
      @andrewgr144 Před měsícem

      I abandoned learning Rust because there are so few jobs in it, and getting a job as a programmer at a quality company when you're in your mid-late 50s is hard enough as it is. I didn't get to the point in my Rust journey where it ever felt like I wasn't fighting the language, but I truly believe that I would have, eventually. The main thing that I love about Rust is the way enums are first-class data objects, and the way the compiler makes sure you're covering every possible value that can be returned (if the return type is an enum). It feels so natural, and I wound up writing functions that looked a lot like finite state machines, which I find particularly easy to understand and reason about correctly. It is my understanding that this feature is common in functional languages; even if that's the case (pun not intended but duly noted), Rust is the first time I remember encountering it in a C-derived language. (Actually, now that I think about it, I also really miss that attribute system, which took a while to wrap my head around, but has profoundly changed how I think about many problems, to the point that I've become really interested in tagging systems in contexts other than programming. I know Rust isn't the first language, or even C-derived language, to use something like attributes, but it's the first time I encountered it as a core language feature.)

    • @yuan.pingchen3056
      @yuan.pingchen3056 Před měsícem

      @@andrewgr144 You are right, Rust is not a good choice for work reasons, but I only learned it because of its modern features. I worked for a company that made set-top boxes, and the original code of the company's product had a lot of unused annotations. And a large number of global static variables, and the inexplicable insistence on using the ANSIC standard makes the original code messy and difficult to maintain. I don’t understand why the company insists on using the C language. Obviously the tool chain provided by the set-top box supplier provides g++ related files. There is nothing missing. Using C++'s data encapsulation function and construction and destruction function can enhance the readability of the code and simplify a lot of complexity. Of course, this is a story later. The company's programmers do not have the ability to C++. they can't even understand the construction and deconstruction functions provided by the C++ language. This is a company where engineers insist on only doing their own part to clean up the trouble. After many years, I learned that the internal state of this kind of company is very important to a China mainland company. The company is the norm. Fortunately, I left my job early. At that time, I still had the enthusiasm for new technologies and the insistence that the code should be beautifully designed. I was not suitable to stay in such an aging and decadent company.