ASCII and Unicode Character Sets

Sdílet
Vložit
  • čas přidán 22. 07. 2024
  • This video describes the fundamental principles of character sets, character encoding, ASCII and Unicode. In particular, it covers the limitations of ASCII and the plethora of extended ASCII code pages. It also covers the design goals of Unicode, and describes the way control bits are allocated in the variable length, multi-byte character encoding of the Unicode Transformation Format, namely UTF-8.

Komentáře • 117

  • @nanibanani7560
    @nanibanani7560 Před 4 lety +47

    No Music, no insulting or idiotic speeches, great and clean way of explanation and a nice english accent - thumb up and abo

  • @siefeldenabualsoud9021

    The definition of a great teacher is you, thank you so much for your explanation.

  • @MarkIv49
    @MarkIv49 Před rokem +25

    Awesome explanation, high quality content, no disturbing music. This is the example of how educational content should be.
    Thank you for this great video.

  • @asharneyaz7
    @asharneyaz7 Před 2 měsíci +1

    It was one of the most well-structured, in-depth explanations I had witnessed. Enjoyed every second of it. Thank you!

  • @rahulbalan
    @rahulbalan Před 3 lety +8

    I came here from Computerphile's Tom Scott video explaining the idea of Unicode. This video perfectly explains the system in appropriate depth. I would always be thankful for these channels.

  • @axedyson
    @axedyson Před 5 lety +49

    This channel is amazing. The explanation is very good and easy to understand!

  • @AkashVerma-sq8eq
    @AkashVerma-sq8eq Před 3 lety +10

    One of the best video on character sets! Explained very well!!

  • @hichemguenfaf8814
    @hichemguenfaf8814 Před 5 lety +5

    fascinating work and amazing video, well done.

  • @cantstandya3761
    @cantstandya3761 Před 5 lety +8

    Extremely informative, well done - AND thanks!

  • @PurXion_777
    @PurXion_777 Před 2 lety +1

    GOD BLESS YOU, I FELT SO BEHIND IN MY CLASS WHEN I COULDN'T UNDERSTAND THIS!!! 7seconds in and already I know what this is

  • @lanceleavitt7472
    @lanceleavitt7472 Před rokem +1

    Perfectly explained. I am not a programmer and do not want to write code, but needed to understand the basics. --- Thanks you for this presentation. ---

  • @johnwick-m4m
    @johnwick-m4m Před 4 lety +4

    Awesome video man.Loved it and its answered all my questions about ASCII and unicode

  • @illustriousinc8608
    @illustriousinc8608 Před 5 lety +3

    This was just great! Very well done

  • @ravense8337
    @ravense8337 Před 3 lety +1

    Thank you very much, I've checked multiple explanations but the wording used in those made me confused and not sure whether I understood it correctly, this one explains everything perfectly!

  • @janhorcicka4988
    @janhorcicka4988 Před 5 lety +8

    Perfectly explained. Thank you!

    • @md_ez
      @md_ez Před 2 lety

      I have a question about the keyboard and coding systems

  • @caiomoura9620
    @caiomoura9620 Před 2 lety +1

    Best explanation I've seen so far. Thank you

  • @knseeker1524
    @knseeker1524 Před 3 lety +1

    the explanation was superb, completely understandable!!!!

  • @shravanighadge5406
    @shravanighadge5406 Před rokem

    Best video to understand ASCII and Unicode.

  • @sahilsharmahere
    @sahilsharmahere Před 2 lety +2

    Fabulous work !

  • @prakashrnair
    @prakashrnair Před rokem +1

    Thank you. Very easy and clear explanation!

  •  Před 3 lety +1

    Explained very well. Thank you!

  • @omerfarukmerey
    @omerfarukmerey Před 3 lety +3

    You saved my day. İ really appreciate it thank you :)

  • @oseyedian
    @oseyedian Před rokem +1

    WoW, You are incredible 👏

  • @gyangaha109
    @gyangaha109 Před 3 lety

    Perfectly explained.

  • @mcnole25
    @mcnole25 Před 3 lety +5

    Hexadecimal: There are 10000 characters in Unicode.
    Decimal: no there are 65536 characters
    Octal: There's actually 200000!
    Binary: THERE ARE 10000000000000000 CHARACTERS IN UNICODE. THAT'S A LOT.

    • @md_ez
      @md_ez Před 2 lety

      I have a question about the keyboard and coding systems ?

    • @Robotomy101
      @Robotomy101 Před rokem

      ​@@md_ez what is the question

  • @skullwise
    @skullwise Před rokem +1

    Excellent explanation.

  • @urvashimendhe7056
    @urvashimendhe7056 Před 2 lety +1

    nicely explained!
    it was really very helpful.. THANK YOU

  • @dragods1516
    @dragods1516 Před 3 lety +1

    Another amazing video, thank you!
    It would be useful to mention and other Unicode encoding forms (UTF-16, UTF-32).

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 3 lety +5

      Thank you are you are most welcome. I mention UTF-16 and UTF-32 in my video 'Unicode and Byte Order'. You might also find my video on 'Byte Order (Endianess)' interesting. :)KD

  • @VishalSharma-oj3rr
    @VishalSharma-oj3rr Před 3 lety +1

    you are awesome..great teaching style..thx

  • @nsansa9234
    @nsansa9234 Před 2 lety +1

    great explanation. Thank you 😊

  • @casparharte
    @casparharte Před 4 lety +2

    I wonder why ASCII was 7 bits in the first place? In the end this played well with UNICODE UTF-8 fitting into a byte format. Was it because of the teletype selector vanes?

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 4 lety +2

      I think it was because 7 was enough to cover all of the characters on a (teletype) keyboard and it left a spare bit to be used for parity checking. A lucky accident perhaps? :) KD

  • @_hydrogelic
    @_hydrogelic Před 4 měsíci +2

    What happens when we theoretically run out of all the space we had to encode character? Also how are new characters added to unicode?

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 4 měsíci +1

      An organisation called the Unicode Consortium decided what new characters to include. Once or twice a year, new characters are introduced. It's unlikely we're are going to run out of space soon but, theoretically the same system could be scaled up again and again. home.unicode.org/ :)KD

  • @abhishekghosh5550
    @abhishekghosh5550 Před 2 lety +1

    Thanks a lot. Subscribed.

  • @bzdv5220
    @bzdv5220 Před 4 lety +1

    Very clear, thanks.

  • @manhkhang7732
    @manhkhang7732 Před 2 lety

    superb lecture

  • @toyoashihara6242
    @toyoashihara6242 Před 3 lety +2

    Thank you!
    ありがとう!

  • @deeptich5537
    @deeptich5537 Před 3 lety

    #Question
    How does the cpu differentiates between binary representation of, say A(whose ASCII is 65) and the number 65 itself?

    • @tobleramone
      @tobleramone Před 3 lety

      The ASCII code for the string '65' would be two separate bytes.

  • @mariiasafonova5510
    @mariiasafonova5510 Před rokem +1

    awesome...thank you

  • @gamingindustries403
    @gamingindustries403 Před 2 lety +1

    Thank you very much
    😀😀😀

  • @DisfigurmentOfUs
    @DisfigurmentOfUs Před 5 lety +1

    How can Netflix representer be in Unicode consortium if Netflix is founded in 1997?

  • @davidcabreramartinez4892

    BRO YOU ARE THE COOLEST DUDE

  • @EducaLK
    @EducaLK Před 3 lety +1

    Is the same organization managing ASCII and Unicode systems? Pls give a answer?

  • @nowhere7438
    @nowhere7438 Před 3 lety +2

    Amazing

  • @lostcarpark
    @lostcarpark Před 2 lety

    Good video, but it implies that Unicode and UTF-8 are synonyms, which they are not. It would be good to include some mention of UTF-16 in particular.

  • @JoseGonzalezUwU
    @JoseGonzalezUwU Před 2 lety +1

    muy bien explicado

  • @MrEfraimfc
    @MrEfraimfc Před 5 lety +2

    I don't understand much of english, but I got the content well

  • @coreymagin
    @coreymagin Před 3 lety

    12:12 is where it all came together for me ;)

  • @justcurious1940
    @justcurious1940 Před 2 lety

    u+0041 did we used 1 byte or 2 bytes c++ says that we only used 1byte and u are saying that we only used 1byte but it looks that we are using 2 bytes 4 bits for each digit right ? what i'm i missing ?

  • @lazarussam77
    @lazarussam77 Před 3 měsíci +1

    nice explanation

  • @nihalkhan7435
    @nihalkhan7435 Před 4 lety +1

    This is great also how many colors can be saved with 5 bits per pixel
    ?

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 4 lety +1

      2^5 = 32. Each pixel can be one of 32 possible colours.
      Take a look at my playlist on bitmaps
      czcams.com/video/0KmimFoalTI/video.html
      :)KD

    • @nihalkhan7435
      @nihalkhan7435 Před 4 lety +1

      @@ComputerScienceLessons Thank you soo much

  • @flipflop1758
    @flipflop1758 Před 4 hodinami

    👏🏻👏🏻

  • @michaelglinsky2614
    @michaelglinsky2614 Před 2 měsíci

    When you show a number of possible encodings in UTF-8 for 2-byte scenario why don't you add 1024 to 937?
    The same question goes for the 3-byte scenario. For some reason you don't add possible combinations for the last 2 "free" bits.

  • @piotrz9546
    @piotrz9546 Před 4 lety +2

    9:37 UTF-24(3bytes) is used for Asian symbols mostly, and without control bites it uses 16bits

  • @c0mbat612
    @c0mbat612 Před 5 lety +1

    You should make a video on how to replace/add more ram to you're PC or another part's.

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 5 lety +1

      Funny you should say that - I plan to do more practical hardware stuff in the future.

  • @unkit07
    @unkit07 Před 3 lety +2

    Great explanation U+1F60D

  • @vegekou100
    @vegekou100 Před 3 lety +1

    Final tutorial.

  • @Laughing_Cat_Meme
    @Laughing_Cat_Meme Před 3 lety +1

    Sir 🙇‍♂️🙇‍♂️🙇‍♂️

  • @PersonalXZ
    @PersonalXZ Před 3 lety

    coded character sets, history and development / Ascii Art / Emoji, smile, emoticon.

  • @charlesklein7232
    @charlesklein7232 Před 2 lety

    a very professional video but it does not explain how to get them! for example THE SKULL$CROSSBONES is --- echo -e "\xE2\x98\xA0" ---- and while i have found lists their in binary and hex and not in what ever format this is and is their a list of them?

  • @md_ez
    @md_ez Před 2 lety

    I have a question , please

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 2 lety +1

      Please ask - I'll try my best :)KD

    • @md_ez
      @md_ez Před 2 lety

      @@ComputerScienceLessons I have an idea and I want to do it by java or c or python , could you help me, please ?
      Your account on a social media ?

  • @fullfungo
    @fullfungo Před 2 lety +1

    The goals starting from 6:41 are actually not entirely correct.
    Unicode is “kinda” compatible with ASCII, in the sense that Unicode codepoint 65 (A) corresponds to ASCII code 65(A), for example.
    However, Unicode does not mandate that codepoint 65 is *necessarily* stored as 1-byte code 65. This is a goal of UTF-8 instead.
    UTF-16 and UTF-32 comply with Unicode standard but don’t store codepoint 65 as a 1-byte value.
    This video makes the same mistake multiple times, conflating Unicode and UTF-8, which are broadly speaking unrelated to each other and serve completely different purposes.

    • @justcurious1940
      @justcurious1940 Před rokem

      yea brother i watched this video few days ago before i knew anything about this topic but after i have learned more about it i can see how much wrong informations on it
      unicode is the collection of code points that represent characters
      UTF-8 or UTF-16 or UTF-32 is the representation of these code points in memory
      so UTF-8 is a way to encode unicode using a multibyte encoding scheme
      UTF-32 is fixed size in memory i think
      i'm still searching about UTF-16 to understand is it represented in memory ....

  • @narendrajayram1317
    @narendrajayram1317 Před 2 lety

    lucid explanation!

  • @piotrz9546
    @piotrz9546 Před 4 lety +4

    8:47 UTF-16(2bytes) is used for European and Arabic languages, and without control bites it uses 11bits

  • @gachaghost7996
    @gachaghost7996 Před 4 lety +1

    I don't understand this is confusing and I need to do my homework but it is going to fast SO CONFUSING

    • @ComputerScienceLessons
      @ComputerScienceLessons  Před 4 lety +2

      It goes quickly but took several hours to make. If you are new to this, you should take your time with it and perhaps make a few notes. Make sure you understand the basics of binary integers first.

  • @lokisingh9944
    @lokisingh9944 Před 7 měsíci

    Your channel is amazing and I'm really interested in learning more about computers, can you please recommend books, websites or other CZcams channels from where I can learn about computers with this depth. Praise from India 🇮🇳

  • @clever472
    @clever472 Před 6 měsíci

    6:06 unicode pua
    6:09 su
    p pua a

  • @vijaykumar-jf7fs
    @vijaykumar-jf7fs Před 3 lety

    How to convert ASCII to Tamil font EX:ÂáÇè ¸ÛÆèÂÝÂá ÔÚÇÌè / ѽ Tamil word