To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ????? | 0011111100111111001111110011111100111111 | 3f3f3f3f3f |
SJIS-WIN | ??醇沼? | 00111111001111111000111110000110100011111100000000111111 | 3f3f8f868fc03f |
EUC-JP | ?宬醇沼? | 001111111000111110111010110100111011110111100110101111101100001000111111 | 3f8fbad3bde6bec23f |
UTF-8 | 葉宬醇沼敾 | 111011111010010110101110111001011010111010101100111010011000011010000111111001101011001010111100111001101001010110111110 | efa5aee5aeace98687e6b2bce695be |
UHC | 葉宬醇沼敾 | 11100000111100011110000011110100111000101111010111100001101110111110000011000000 | e0f1e0f4e2f5e1bbe0c0 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)