To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
SJIS-WIN | 誤▲?岳?─ | 10001100111010111000000110100011001111111000101001111000001111111000010010011111 | 8ceb81a33f8a783f849f |
EUC-JP | 誤▲?岳?─ | 10111000111011011010001010100101001111111011001111011001001111111010100010100001 | b8eda2a53fb3d93fa8a1 |
UTF-8 | 誤▲굤岳귟─ | 111010001010101010100100111000101001011010110010111010101011010110100100111001011011001010110011111010101011011110011111111000101001010010000000 | e8aaa4e296b2eab5a4e5b2b3eab79fe29480 |
UHC | 誤▲굤岳귟─ | 111010001010011010100001111000111000001010001010111001001011111110000010111010001010011010100001 | e8a6a1e3828ae4bf82e8a6a1 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)