ASCII and Unicode Character Sets
Vložit
- čas přidán 22. 07. 2024
- This video describes the fundamental principles of character sets, character encoding, ASCII and Unicode. In particular, it covers the limitations of ASCII and the plethora of extended ASCII code pages. It also covers the design goals of Unicode, and describes the way control bits are allocated in the variable length, multi-byte character encoding of the Unicode Transformation Format, namely UTF-8.
No Music, no insulting or idiotic speeches, great and clean way of explanation and a nice english accent - thumb up and abo
Glad you liked it. Thanks for the comment. :)KD
The definition of a great teacher is you, thank you so much for your explanation.
Awesome explanation, high quality content, no disturbing music. This is the example of how educational content should be.
Thank you for this great video.
You are most kind. Thank you :)KD
It was one of the most well-structured, in-depth explanations I had witnessed. Enjoyed every second of it. Thank you!
Thank you. You are most welcome :)KD
I came here from Computerphile's Tom Scott video explaining the idea of Unicode. This video perfectly explains the system in appropriate depth. I would always be thankful for these channels.
I love Tom Scott too. :)KD
This channel is amazing. The explanation is very good and easy to understand!
I agree
One of the best video on character sets! Explained very well!!
You are very kind :)KD
fascinating work and amazing video, well done.
Extremely informative, well done - AND thanks!
GOD BLESS YOU, I FELT SO BEHIND IN MY CLASS WHEN I COULDN'T UNDERSTAND THIS!!! 7seconds in and already I know what this is
Delighted to help. You are most welcome :)KD
Perfectly explained. I am not a programmer and do not want to write code, but needed to understand the basics. --- Thanks you for this presentation. ---
You're most welcome :)KD
Awesome video man.Loved it and its answered all my questions about ASCII and unicode
Glad to help. Thanks for the comment - they keeps me going. :) KD
This was just great! Very well done
Thank you very much, I've checked multiple explanations but the wording used in those made me confused and not sure whether I understood it correctly, this one explains everything perfectly!
Delighted to help. K.I.S.S :)KD
Perfectly explained. Thank you!
I have a question about the keyboard and coding systems
Best explanation I've seen so far. Thank you
Thank you :)KD
the explanation was superb, completely understandable!!!!
Thank you for saying so :)KD
Best video to understand ASCII and Unicode.
Fabulous work !
Thank you :)KD
Thank you. Very easy and clear explanation!
You're welcome - thanks :)KD
Explained very well. Thank you!
You're welcome. :)KD
You saved my day. İ really appreciate it thank you :)
WoW, You are incredible 👏
Thank you. You are very kind :)KD
Perfectly explained.
Hexadecimal: There are 10000 characters in Unicode.
Decimal: no there are 65536 characters
Octal: There's actually 200000!
Binary: THERE ARE 10000000000000000 CHARACTERS IN UNICODE. THAT'S A LOT.
I have a question about the keyboard and coding systems ?
@@md_ez what is the question
Excellent explanation.
Thank you :)KD
nicely explained!
it was really very helpful.. THANK YOU
You are most welcome :)KD
Another amazing video, thank you!
It would be useful to mention and other Unicode encoding forms (UTF-16, UTF-32).
Thank you are you are most welcome. I mention UTF-16 and UTF-32 in my video 'Unicode and Byte Order'. You might also find my video on 'Byte Order (Endianess)' interesting. :)KD
you are awesome..great teaching style..thx
Thank you :)KD
great explanation. Thank you 😊
You're very welcome :)KD
I wonder why ASCII was 7 bits in the first place? In the end this played well with UNICODE UTF-8 fitting into a byte format. Was it because of the teletype selector vanes?
I think it was because 7 was enough to cover all of the characters on a (teletype) keyboard and it left a spare bit to be used for parity checking. A lucky accident perhaps? :) KD
What happens when we theoretically run out of all the space we had to encode character? Also how are new characters added to unicode?
An organisation called the Unicode Consortium decided what new characters to include. Once or twice a year, new characters are introduced. It's unlikely we're are going to run out of space soon but, theoretically the same system could be scaled up again and again. home.unicode.org/ :)KD
Thanks a lot. Subscribed.
You're very welcome. Tnx for the sub :)KD
Very clear, thanks.
You're welcome. :)KD
superb lecture
Thank you!
ありがとう!
You're welcome :)KD
#Question
How does the cpu differentiates between binary representation of, say A(whose ASCII is 65) and the number 65 itself?
The ASCII code for the string '65' would be two separate bytes.
awesome...thank you
You're very welcome :)KD
Thank you very much
😀😀😀
You're welcome :)KD
How can Netflix representer be in Unicode consortium if Netflix is founded in 1997?
I guess members of the consortium have come and gone over the years.
BRO YOU ARE THE COOLEST DUDE
You are very kind. :)KD
Is the same organization managing ASCII and Unicode systems? Pls give a answer?
No. I'm just a teacher :)KD
Amazing
Thanks :)KD
Good video, but it implies that Unicode and UTF-8 are synonyms, which they are not. It would be good to include some mention of UTF-16 in particular.
muy bien explicado
De nada :)KD
I don't understand much of english, but I got the content well
12:12 is where it all came together for me ;)
u+0041 did we used 1 byte or 2 bytes c++ says that we only used 1byte and u are saying that we only used 1byte but it looks that we are using 2 bytes 4 bits for each digit right ? what i'm i missing ?
is this (u+0041) UTF-8 or UTF-16 ?
nice explanation
Thank you :)KD
This is great also how many colors can be saved with 5 bits per pixel
?
2^5 = 32. Each pixel can be one of 32 possible colours.
Take a look at my playlist on bitmaps
czcams.com/video/0KmimFoalTI/video.html
:)KD
@@ComputerScienceLessons Thank you soo much
👏🏻👏🏻
When you show a number of possible encodings in UTF-8 for 2-byte scenario why don't you add 1024 to 937?
The same question goes for the 3-byte scenario. For some reason you don't add possible combinations for the last 2 "free" bits.
9:37 UTF-24(3bytes) is used for Asian symbols mostly, and without control bites it uses 16bits
You should make a video on how to replace/add more ram to you're PC or another part's.
Funny you should say that - I plan to do more practical hardware stuff in the future.
Great explanation U+1F60D
U+1F633 :)KD
Final tutorial.
Sir 🙇♂️🙇♂️🙇♂️
coded character sets, history and development / Ascii Art / Emoji, smile, emoticon.
a very professional video but it does not explain how to get them! for example THE SKULL$CROSSBONES is --- echo -e "\xE2\x98\xA0" ---- and while i have found lists their in binary and hex and not in what ever format this is and is their a list of them?
I have a question , please
Please ask - I'll try my best :)KD
@@ComputerScienceLessons I have an idea and I want to do it by java or c or python , could you help me, please ?
Your account on a social media ?
The goals starting from 6:41 are actually not entirely correct.
Unicode is “kinda” compatible with ASCII, in the sense that Unicode codepoint 65 (A) corresponds to ASCII code 65(A), for example.
However, Unicode does not mandate that codepoint 65 is *necessarily* stored as 1-byte code 65. This is a goal of UTF-8 instead.
UTF-16 and UTF-32 comply with Unicode standard but don’t store codepoint 65 as a 1-byte value.
This video makes the same mistake multiple times, conflating Unicode and UTF-8, which are broadly speaking unrelated to each other and serve completely different purposes.
yea brother i watched this video few days ago before i knew anything about this topic but after i have learned more about it i can see how much wrong informations on it
unicode is the collection of code points that represent characters
UTF-8 or UTF-16 or UTF-32 is the representation of these code points in memory
so UTF-8 is a way to encode unicode using a multibyte encoding scheme
UTF-32 is fixed size in memory i think
i'm still searching about UTF-16 to understand is it represented in memory ....
lucid explanation!
Thank you :)KD
8:47 UTF-16(2bytes) is used for European and Arabic languages, and without control bites it uses 11bits
I don't understand this is confusing and I need to do my homework but it is going to fast SO CONFUSING
It goes quickly but took several hours to make. If you are new to this, you should take your time with it and perhaps make a few notes. Make sure you understand the basics of binary integers first.
Your channel is amazing and I'm really interested in learning more about computers, can you please recommend books, websites or other CZcams channels from where I can learn about computers with this depth. Praise from India 🇮🇳
6:06 unicode pua
6:09 su
p pua a
How to convert ASCII to Tamil font EX:ÂáÇè ¸ÛÆèÂÝÂá ÔÚÇÌè / ѽ Tamil word