To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??f??T??f | 001111110011111101100110001111110011111101010100001111110011111101100110 | 3f3f663f3f543f3f66 |
SJIS-WIN | ??f??T??f | 001111110011111101100110001111110011111101010100001111110011111101100110 | 3f3f663f3f543f3f66 |
EUC-JP | ??f??T??f | 001111110011111101100110001111110011111101010100001111110011111101100110 | 3f3f663f3f543f3f66 |
UTF-8 | 숰쉃f숰쉃T숰쉃f | 111011001000100010110000111011001000100110000011011001101110110010001000101100001110110010001001100000110101010011101100100010001011000011101100100010011000001101100110 | ec88b0ec898366ec88b0ec898354ec88b0ec898366 |
UHC | 숰쉃f숰쉃T숰쉃f | 100110100100100010011010010110000110011010011010010010001001101001011000010101001001101001001000100110100101100001100110 | 9a489a58669a489a58549a489a5866 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)