Given the following UTF-8 byte stream:
0x63 0xec 0xa7 0x93 0x33 0xd3 0xa9
what are the unicode character code points encoded by the above stream?
Solution:
Each unicode character is represented by a unicode code
point which is an integer value. The code point integer values
go from 0
to 10FFFF
(in hexadecimal
encoding).. A code point is an integer value that uniquely
identifies the given character. Unicode characters can be encoded
using different encodings, like UTF-8 . Below is the table showing
the same:
UTF-8 byte stream | Character | Decimal code point | Hex UTF-8 bytes |
0X63 | c | 99 | 63 |
0Xec | i | 236 | C3 AC |
0Xa7 | § (SECTION SIGN) | 167 | C2 A7 |
0x93 | " | 147 | C2 93 |
0x33 | 3 | 51 | 33 |
0xd3 | Ó(O WITH ACUTE) | 211 | C3 93 |
0xa9 | © | COPYRIGHT SIGN |
C2 A9 |
Get Answers For Free
Most questions answered within 1 hours.