To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??????E | 00111111001111110011111100111111001111110011111101000101 | 3f3f3f3f3f3f45 |
SJIS-WIN | 嚥??嚥?、E | 10011010100010110011111100111111100110101000101100111111100000010100000101000101 | 9a8b3f3f9a8b3f814145 |
EUC-JP | 嚥??嚥?、E | 11010011111010110011111100111111110100111110101100111111101000011010001001000101 | d3eb3f3fd3eb3fa1a245 |
UTF-8 | 嚥좂왎嚥좄、E | 11100101100110101010010111101100101000101000001011101100100110011000111011100101100110101010010111101100101000101000010011100011100000001000000101000101 | e59aa5eca282ec998ee59aa5eca284e3808145 |
UHC | 嚥좂왎嚥좄、E | 11100110101111111010000011100111100111101011111111100110101111111010000011101000101000011010001001000101 | e6bfa0e79ebfe6bfa0e8a1a245 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)