Unix system calls (1/2)

Sdílet
Vložit
  • čas přidán 28. 08. 2024
  • Part of a larger series teaching programming. See codeschool.org

Komentáře • 192

  • @R3negade638
    @R3negade638 Před 4 lety +22

    9 years later and this is the best resource on the topic. Really great.

  • @DragisaBoca
    @DragisaBoca Před 10 lety +190

    Very well made, you made the world a better place.

  • @BlackM3sh
    @BlackM3sh Před 7 lety +215

    At 31:50 you talk about environment variables. However there are some mistakes worth correcting for future viewers. First, although the environment variables are stored in the process' memory, it is stored as zero-terminated strings and not as one big string separated by new-line characters. It is also is not stored on the heap, nor is there a global variable in the data section pointing to it. The environment is actually stored entirely on the stack and is a part of the initial process stack that is set up before the program starts running. The first value on the stack is the argument count followed by an array of the addresses of the different arguments, then address 0 marking the end of the argument array. Right after that there is a second array of addresses which each point to a zero-terminated string which would be the environment variables, this array is also terminated by having address 0 at the end. There is actually a third array of auxiliary vectors but after that there is an unspecified amount of bytes before the information block starts. It's generally inside this block the command line arguments and environment variables are stored, as in the actual string values. You can confirm this by dumping the stack of pretty much any program and you typically find all the environment variables at the very end (highest memory address). If you are on Linux you can do this by first reading the '/proc//maps' file for any process, just replace with that process' PID. This file contains the ranges of memory mapped to the process and what they are mapped to. Near the bottom you'll see one line with the range mapped to [stack]. Take note of the start address and calculate how big it is in bytes. Then run 'sudo xxd -s -l /dev//mem', example 'sudo xxd -s 0x7fff182bd000 -l 0x22000 /dev/14950/mem'. And the environment variables should get printed out together with their hex values and address location.
    To illustrate this further I've written a small c program that prints all the environment variables using the argv array pointer. As you can see the environment variable pointers are stored pretty much right after argv.
    #include
    int main(int argc, char **argv)
    {
    for (int i = argc + 2; argv[i] != NULL; i++)
    {
    printf("%s
    ", argv[i]);
    }
    return 0;
    }
    You can of course make it less stupid by using the full version of main which includes a pointer to the first element in the environment pointer array.
    #include
    int main(int argc, char **argv, char **envp)
    {
    for (int i = 0; envp[i] != NULL; i++)
    {
    printf("%s
    ", envp[i]);
    }
    return 0;
    }
    This is all defined as a part of the ABI (application binary interface) for both the x86 and x86_64 architecture, so 32 and 64 bit desktop computers.
    tl;dr: The environment is not a single long string separated by new-line characters. The environment variables and the pointers to them are both stored on the stack or just before it.

    • @TheNullBox
      @TheNullBox Před 5 lety +24

      Wow man! Thank you :) Both the video and your comment. Amazing stuff!

    • @payloadartist
      @payloadartist Před 4 lety +11

      Thanks. This comment should be pinned!

    • @crptc5707
      @crptc5707 Před 4 lety +3

      Thank you sir just tried your code in online c ide and it runs exactly as you described, but other than than video is great!

    • @sangramjitchakraborty7845
      @sangramjitchakraborty7845 Před 3 lety

      Incredibly informative. Thank you for commenting this.

    • @leaharrington4472
      @leaharrington4472 Před 3 lety +2

      While not part of the kernel ABI, as you point out, glibc provides a global variable (char ** environ) pointing to the environment, and may relocate the environment to the heap in setenv() as needed.

  • @sodapopinski9922
    @sodapopinski9922 Před 5 lety +38

    so many levels of abstraction,by the time people are clicking on their GUI's it is a symphony of perfect timed and executed processes, but listening to this I can really imagine year after year problems develop and more complicated solutions come into play stepping up a level of abstraction, I mean 60 years is so impressive to see how for we come, from logic gates, MOSFETS to EEPROMS to insane clock speeds to RISC to now but it all started at a level of someone feeding an electronic impulse into a JK FLIP FLOP and trapping that high or low impulse.... it is truly baffling!!!!

    • @automatenmark5051
      @automatenmark5051 Před 4 měsíci +1

      it's mind blowing but also magical to see how all comes down to some tiny switches

  • @eshgholah
    @eshgholah Před 10 lety +83

    I have loved every second of your videos. Specially the Unix system call series. Could you please kindly do an advanced series of Linux Internals. I know it is too much to ask but obviously you are the right person to do it. I have never seen anyone else describing things so clear and nicely. Thanks a million.

  • @nikkehtine
    @nikkehtine Před 2 lety +3

    High quality presentation and commentary. We get such a long, interesting and informative video for the great low price of free. Thank you so much.

  • @Synchr0nix
    @Synchr0nix Před 6 lety +10

    I've been watching this video series every day, for the last 3 days, and I learn a little more from it each time, lol. Thanks man. This is one of the most professional lessons I've ever found on CZcams. I can tell you know what you're talking about.

  • @fouzaialaa7962
    @fouzaialaa7962 Před 4 lety +3

    i studied this in my engineering class and it took them almost 4 months to teach us this ...this 45 minute lecture made it so easy and simple !!! in uni they stretched it so much that you forget about it start questioning everything again every lecture
    thx for the upload

  • @AdamOutler
    @AdamOutler Před 9 lety +66

    This is a very informative Linux/Unix System Calls series.

    • @JoePhilipps
      @JoePhilipps Před 9 lety +1

      Applications are ever more security aware, and one caveat with malloc(3) and mmap(2) is to determine what is sensitive in what has been allocated (e.g., storage for passphrases, cached encryption keys, etc.), and that should be zeroed before calls to free(3) or munmap(2). There may be no guarantee by the OS that newly malloc'ed or mmapped regions have been thus scrubbed, so it's up to the process, as best as it can, to sanitize such regions before handing them back.

    • @AdamOutler
      @AdamOutler Před 9 lety +1

      *****​ seems like there should be a system call to handle sensitive data. I wonder if it would be possible to somehow fill the memory with memory requests to just scan for random strings. In what context, though, do you mean apps are more security aware? Are you speaking of just this?

    • @JoePhilipps
      @JoePhilipps Před 9 lety

      One should always consult the manual page for the system call you want to use, and about the system for which you wish to program. This guy obviously had to remain generic to cover SysV, Linux, *BSD, OS X, etc., but each system can have other restrictions or features. For example, if you want to write a utility for an SELinux system, you will have contexts to deal with, and such things operate additionally in system calls (e.g., I think child processes after a fork(2) also inherit SELinux contexts).

    • @JoePhilipps
      @JoePhilipps Před 9 lety +1

      _I wonder if it would be possible to somehow fill the memory with memory requests to just scan for random strings. In what context, though, do you mean apps are more security aware?_
      Exactly that, Adam Outler . I just wanted people to start thinking more securely if they want to program at the system call level. It's worth a look at a particular OS's manpages or equivalent to see if such conditions are specified, such as [s]brk(2) (upon which malloc(3) is based) zeroing memory pages before they're returned to the process.
      In fact, because a process can be killed at any time, it is wise in more security minded apps to scrub storage (whether variables/RAM or parts of files) as soon as they're not needed. To a certain extent, you can control this by locking the pages into RAM if you have enough priviledge to the process, so that such RAM will never be written to the swap partition. That represents another potential security threat, the superuser (or anyone with enough access to the underlying device node) sifting through the swap space for such nuggets.

  • @pouryamehdinejad8124
    @pouryamehdinejad8124 Před 4 lety +2

    The best explanation of system calls I could find on the internet.
    Thanks to Brian Will

  • @rzathamesmer
    @rzathamesmer Před 9 lety +21

    More Unix videos please! You're an excellent teacher, and the slides are very well done. :))))))))

  • @penisafotza4807
    @penisafotza4807 Před 3 lety +10

    1:46 Oh God! I cant get away from it...

  • @kshahkshah
    @kshahkshah Před 8 lety +2

    This is a truly excellent, informative and well laid out video. Thank you so much. I've been coding for 15+ years and got a lot out of this, especially having mostly dealt in interpreted dynamic languages and not having to ever manage memory myself

  • @leonbishop7404
    @leonbishop7404 Před 3 lety +16

    2:00 kinda sus abbreviation you got there

  • @REOsama
    @REOsama Před 2 měsíci

    My head is spinning, this is the most meaty video I've ever watched, you are an unsung hero

  • @ben2258
    @ben2258 Před 4 lety +2

    I just discovered your channel and can't stop watching your videos! They're incredibly helpful and clear.
    Just wanted to say it seems to me this and the next video should be added to your Operating Systems playlist.

  • @crptc5707
    @crptc5707 Před 4 lety

    It's a fantastic tutorial! I've been baffled with kernel space and user space for quite a long time and misunderstood that system call incurs context switch between user process and kernel process, until I watched this video... million thanks!

  • @mbigras
    @mbigras Před 5 lety +4

    I’ve searched for a video like yours for a long time, thank you so much for this work!

  • @amarnathp4560
    @amarnathp4560 Před 3 lety

    I am seeing this after 9 years. Nice content

  • @aborkar
    @aborkar Před 4 lety +1

    Sir, you may be the reason I get my dream job

  • @valdasadomaitis719
    @valdasadomaitis719 Před 9 lety

    Came here through codeschool.org while googling for a system call and i'm really enjoying the rest of the content.

  • @tigeruppercut7
    @tigeruppercut7 Před 9 lety +1

    One of the most solid videos I've seen on Linux. Great job. Thanks.

  • @PauloConstantino167
    @PauloConstantino167 Před 3 lety

    Your content is Gold. Sad to see you inactive........

  • @cyrilemeka6987
    @cyrilemeka6987 Před 6 měsíci

    Very informative. I needed this to better understand low level details for the program I am currently writing in C++ and llvm. Thanks

  • @srenh-p3798
    @srenh-p3798 Před rokem +1

    Great video Brian

  • @briantwill
    @briantwill  Před 12 lety +2

    I'm no expert issue on this issue, but my investigation at the time concluded that brk/sbrk are actually archaic, as the concept of a data segment barrier is outmoded in paged-memory environments. Yes, mapping /dev/mem is not the way to go, but mmap in modern Unixes can do 'annonymous mapping,' which maps to swap-backed memory pages rather than any file. The Wikipedia entry on mmap mentions this. I believe this is what most allocation routines use today, not brk/sbrk.

  • @Yazan_Majdalawi
    @Yazan_Majdalawi Před rokem

    Wow, where have you been, this is a treasure!

  • @stefanvoicu6484
    @stefanvoicu6484 Před 2 lety

    I have a class on Operating Systems and this has been very helpful! thank you

  • @bob-ny6kn
    @bob-ny6kn Před 3 lety +2

    The depth of the information you cover is what scared me away from Computer Science. I call myself a programmer, but obviously I am more a Code Groupie. Your video is very nice, and still relevant so many years after it was made. I wonder if ever architecture will significantly change?

  • @greymind0072
    @greymind0072 Před 4 lety

    Thanks for making this gem of a video. Your content is lucid and enriching at the same time

  • @patrickmullen2914
    @patrickmullen2914 Před rokem

    Thank you for taking the time to make this video. A thumbs up 👍
    I'll be viewing more of your videos 🙂 including part 2 of this one

  • @briantwill
    @briantwill  Před 11 lety +4

    Yeah, I should have phrased this better. To my understanding, user groups were created solely with actual groupings of humans in mind, a use case of diminished importance today in most settings.

  • @WeightlessFlex
    @WeightlessFlex Před 3 lety

    Using this to study up for my interview as a production engineer. Best videos resource I’ve found besides certain books. Thank you. Maybe can you come up with a practice problem series?

  • @shaunmorgan2202
    @shaunmorgan2202 Před 6 lety

    Good video, just a small point. Linux is a kernel, Debian and others are the Unix derivatives that use the Linux kernel.

  • @pdr0663
    @pdr0663 Před 4 lety

    Brian, I’m learning to program on Research Unix V6 in C, so I love your tutorial. I have a small nitpick. Plurals of nouns which end in “s” are pronounced “es” like any other noun ending, unlike (I think) the only exception, which is those which end in “x” which have a different ending containing the pronounciation you are using. Seems to be a recent US thing, and particularly in IT. Love the tutorials, thanks!

  • @JethroYSCao
    @JethroYSCao Před 4 lety

    When I first learned about permissions on directories, it was said that 'x' allows one to cd into it, even if that might not be the most precise explanation, I think it's a good enough proxy to give users the intuition.

  • @0xrgg965
    @0xrgg965 Před 6 lety +5

    laughed at "we have a process that is forking itself"

  • @MaxCoplan
    @MaxCoplan Před 5 lety +3

    It says this is one part of a larger series. What is the larger series?

  • @ravisaraswat2452
    @ravisaraswat2452 Před 7 lety +1

    you are the real Guru, thanks a lot , really appreciate your help and videos. :)

  • @llama1917
    @llama1917 Před 13 lety

    Just stating the obvious, 90% of why i'm watching this particular video on the plane is "speaker's voice does not make me ashamed to be an engineer", good job

  • @landro3552
    @landro3552 Před 4 lety

    0:00 ~ UNIX-like systems
    1:46 ~ UNIX standards
    2:57 ~ System calls
    5:21 ~ Process states

  • @vishals9353
    @vishals9353 Před 5 lety

    It is an excellent video which covers hell lot of things with great clarity in a short time.Thanks for the tutorial.

  • @NeelSandellISAWESOME
    @NeelSandellISAWESOME Před 2 lety

    BSD actually stands for Berkley Software Distribution

  • @samarthtandale9121
    @samarthtandale9121 Před rokem +2

    This is a superb playlist! Though this is 11 yrs old, can you please tell where can I find the subsequent videos cauz the link provided in description is out-dated i think ... Please tell where can I get the subsequent videos of this series or upload them in the same playlist on youtube. This is a very kind request of mine ... Thank You though for whatever you have put on the channel for free !!!

    • @ziaahmad8738
      @ziaahmad8738 Před 2 měsíci

      idk if you still need it but: czcams.com/video/2DrjQBL5FMU/video.html

  • @algoshalgo2682
    @algoshalgo2682 Před 2 lety

    37:14, that is why UNIX are secure and safe

  • @XavierMJames
    @XavierMJames Před 4 lety

    Wow ! that's one helpful easy to understand lecture on CZcams

  • @ko95
    @ko95 Před 4 lety

    So a system call is some service that the operating system makes available from hardware and application/processes use these services to function...?

  • @ooo000ps8
    @ooo000ps8 Před 4 lety

    great video, Thank you very much

  • @mohammadalhyari4272
    @mohammadalhyari4272 Před 4 lety +2

    seems the site is down ?

  • @Occcc12
    @Occcc12 Před 7 lety

    What an excellent and clear explanation. thanks a lot for the upload!

  • @kami-brawlstars9635
    @kami-brawlstars9635 Před 4 lety +7

    System call:
    Generate luminous element!
    Discharge!

  • @Alex2Buzz
    @Alex2Buzz Před 6 lety +11

    Technically, mmap maps memory at a specified address, while malloc finds free address space and allocates there. Of course, you know this, but I'll just note the correct usage in the comments.

    • @crateim
      @crateim Před 6 lety +5

      mmap is how a userspace program requests memory from the kernel (with an anonymous mapping, and it can, but does not have to specify the address it gets mapped at, it can leave it up to the kernel to choose).
      `malloc` is not a system call, but a libc function that will use `mmap` under the hood.

    • @Alex2Buzz
      @Alex2Buzz Před 6 lety +1

      Aaron Miller malloc can be implemented using an anonymous mmap, but it can also be implemented with sbrk.
      I do see your point, but simplifying mmap as just “the memory allocator” is a bit of a misrepresentation.

    • @GarrickHe
      @GarrickHe Před 6 lety

      I thought malloc uses sbrk() or was is mmap a recent change? Thanks.

    • @crateim
      @crateim Před 6 lety

      Garrick He depends on your libc/malloc, recent ones mostly mmap I believe, but when in doubt strace and find out :)

  • @TheDerHeld
    @TheDerHeld Před 6 lety

    great content with awesome sidenotes to give you the big picture - thank you!

  • @yesterdaysguy
    @yesterdaysguy Před 11 lety +2

    Actually it's Berkeley SOFTWARE Distribution.
    -Nice first pic though gotta love Jurassic Park.

  • @waiwinglam8541
    @waiwinglam8541 Před 7 lety

    This is a very quality tutorial. Thanks a lot!

  • @00chiuppi
    @00chiuppi Před 11 lety

    BIll - great work here. very helpful for me in understanding issues I'm dealing with on some servers at work. thanks

  • @manojambakkat
    @manojambakkat Před 13 lety

    Thanks a ton. This is very nice video on system calls and process address space.

  • @GideonMaina
    @GideonMaina Před 3 lety

    Great content, thanks for sharing the knowledge.

  • @Larock-wu1uu
    @Larock-wu1uu Před 2 lety

    This is amazing!

  • @lanhsunsiingh4898
    @lanhsunsiingh4898 Před 5 lety

    Great Work Brian!

  • @gavalinilesh80186
    @gavalinilesh80186 Před 10 lety +1

    Thank you Brian.

  • @mehdikerdoud6139
    @mehdikerdoud6139 Před rokem

    thank you so much for thee great explanation

  • @rangapavankurapati2557

    Thank you Sir. Great explained.

  • @briantwill
    @briantwill  Před 12 lety

    Thanks, this is good to know. Still think it's a bit too in depth at this point. I'm already glossing over a lot here, though I don't think I say anything out-and-out false. Do you think there's something misleading?

    • @aborkar
      @aborkar Před 4 lety +1

      No such thing as too in depth mate. Share as much as you know!

  • @shear-watts
    @shear-watts Před 2 měsíci

    💯💯💯❤❤ great playlist

  • @Jonathan-od5xc
    @Jonathan-od5xc Před 5 lety

    This is incredible, thank you.

  • @rajusakthitube
    @rajusakthitube Před 7 lety

    Thank you so much Brain. very useful video.

  • @ChandraSekhar-ur1so
    @ChandraSekhar-ur1so Před 8 lety +2

    Thanks a lot for the video.

  • @Aemilindore
    @Aemilindore Před 3 lety

    you are crazy good!

  • @romaincoupin7929
    @romaincoupin7929 Před 3 lety

    does anyone know a similar course, but fresher? I mean, 10 years passed away, probably pretty much of things had changed up to nowdays

    • @storyls
      @storyls Před 3 lety +2

      Nope, everything is mostly the same. It’s been this way since the 70s, we haven’t come up with a better way to do all this.
      We have of course added more system calls, but all these old calls are there are usually still what are used.

    • @romaincoupin7929
      @romaincoupin7929 Před 3 lety

      @@storyls ok, thx!

  • @cyberst0rm
    @cyberst0rm Před 2 lety +1

    The part about syscall using the user stack of the process is blatantly wrong

  • @yurigansmith
    @yurigansmith Před rokem

    Thanks for this insightful playlist.
    Btw: Can you recommend a good book (or lecture notes) on this topic?

  • @fuanka1724
    @fuanka1724 Před 6 lety

    Excellent explanation, thank you!

  • @benwhite3938
    @benwhite3938 Před 9 lety +7

    Fantastic video!
    Small nitpick:
    It's actually called "The University of California, Berkeley", not "Berkeley University."

  • @BlackJar72
    @BlackJar72 Před 6 lety

    Interesting to note: I have found the trying to access a deallocated C++ object crashes with a seg-fault, but accessing a deallocated C-string with C code results in code that runs with no complaints.
    (Why I did? Well, it was a challenge experiment, not something I'd ever do in a real program.)

    • @briantwill
      @briantwill  Před 6 lety +4

      I assume this C string was a literal in code, right? String-literal strings get put in a permanent memory area that exists at program start and never goes away. I'm curious what platform you did this on? I believe the behavior of free'ing addresses that aren't valid alloc addresses is undefined. Perhaps on your platform invalid free calls just fail silently.

  • @PurnachandMedisetty
    @PurnachandMedisetty Před 5 lety +1

    BSD -Berkeley Software Distribution

  • @Feninou
    @Feninou Před 4 lety

    Good job man !

  • @m3hdim3hdi
    @m3hdim3hdi Před 4 lety

    Thank you so much you helped me a lot

  • @goldibollocks
    @goldibollocks Před 2 lety

    The init process be like "I just wanna fork"

  • @rafnaegels8913
    @rafnaegels8913 Před 6 lety +4

    Opensource is not necessarily free software.

  • @aquapurity
    @aquapurity Před 7 lety

    An amazingly helpful video on the subject. Thank you very much.

  • @WilcoVerhoef
    @WilcoVerhoef Před 2 lety

    30:44 the "wait" system call.
    What if the child process ends very quickly, and another unrelated process starts using the same PID. Will this cause this "wait" call to fail? Is there a way to prevent this with certainty?

  • @cindycindy5980
    @cindycindy5980 Před 3 lety

    how does the process split memory into pages? at 19:00 if we can get away with using 8000 bytes of memory when allocating 5000, does this mean a memory error (17:00) only triggers by the page, unless otherwise specified, eg. when a second process allocated the other 3000 bytes?

  • @echoptic775
    @echoptic775 Před 2 lety +7

    SUS

  • @JosefdeJoanelli
    @JosefdeJoanelli Před 4 lety

    Unix system calls casually explained

  • @jonasfelix7700
    @jonasfelix7700 Před 4 lety

    How does this compare to modern Windows Systems? Would be interesting to see a comparison video.

  • @varunh93
    @varunh93 Před 6 lety +1

    Does the copy on write flag gets unset after the copy is made on the first write?

  • @petevenuti7355
    @petevenuti7355 Před 2 lety

    I've heard of 'single system image' Linux versions, how does that work? How many of them are there?

  • @jjpcondor
    @jjpcondor Před 11 lety

    Fine job, Brian!

  • @miraserban4350
    @miraserban4350 Před rokem

    Allah bless you! As a self taught software monkey now I have elvolved into software homo sapiens

  • @Stakkato98
    @Stakkato98 Před 12 lety

    good work, very straight forward.

  • @subee128
    @subee128 Před 6 měsíci

    Thank you very much

  • @chandanrock4802
    @chandanrock4802 Před 5 lety

    Hello Sir, thank you for your knowledge, all the videos are really great and understandable, Sir I wanted to know name of the book from where I can get this information, thank you

  • @MichaelDCBowen
    @MichaelDCBowen Před 12 lety

    what is your presentation app? these slides are perfect.

  • @ranjirhodes
    @ranjirhodes Před 4 lety

    Excellent info , though i had to reduce to 0.75x speed . Thank you for the explanation 👍 can u give info on relation between smaps and mmap and about virtual memory and resident set size

  • @DaLakersFan24
    @DaLakersFan24 Před 5 lety

    GREAT VIDEO, THANKS FOR MAKING THIS

  • @microto
    @microto Před 9 lety +1

    awesome! keep up the good work

  • @shashank88
    @shashank88 Před 9 lety

    Thank you ! crisp and to the point!

  • @Crux161
    @Crux161 Před 12 lety

    this is great, and i'm all for the smaller chunks however, I feel this video has some volume normalization issues.. That could be something to double check before creating your final version. :D Still Very Awesome

  • @Crux161
    @Crux161 Před 11 lety

    However, I should say that the captions seem to work exceptionally well. :D