To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??[??[^ | 00111111001111110101101100111111001111110101101101011110 | 3f3f5b3f3f5b5e |
SJIS-WIN | 耿耿[耿耿[^ | 1110001111010100111000111101010001011011111000111101010011100011110101000101101101011110 | e3d4e3d45be3d4e3d45b5e |
EUC-JP | 耿耿[耿耿[^ | 1110011011010110111001101101011001011011111001101101011011100110110101100101101101011110 | e6d6e6d65be6d6e6d65b5e |
UTF-8 | 耿耿[耿耿[^ | 111010001000000010111111111010001000000010111111010110111110100010000000101111111110100010000000101111110101101101011110 | e880bfe880bf5be880bfe880bf5b5e |
UHC | 耿耿[耿耿[^ | 1100110011101010110011001110101001011011110011001110101011001100111010100101101101011110 | cceaccea5bcceaccea5b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)