To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ???A?? | 001111110011111100111111010000010011111100111111 | 3f3f3f413f3f |
SJIS-WIN | ?嬋先A線禪 | 00111111100110110110100010010000111001100100000110010000111111001110001001010111 | 3f9b6890e64190fce257 |
EUC-JP | 瑄嬋先A線禪 | 100011111100110010111001110101011100100111000000111010000100000111000000111111101110001110111000 | 8fccb9d5c9c0e841c0fee3b8 |
UTF-8 | 瑄嬋先A線禪 | 11100111100100011000010011100101101011001000101111100101100001011000100001000001111001111011011110011010111001111010011010101010 | e79184e5ac8be5858841e7b79ae7a6aa |
UHC | 瑄嬋先A線禪 | 1110000011000101111000001011110111100000101110110100000111100000110010101110000011001001 | e0c5e0bde0bb41e0cae0c9 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)