To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??¨??¨B | 00111111001111111010100000111111001111111010100001000010 | 3f3fa83f3fa842 |
SJIS-WIN | 猶?¨猶?¨B | 1001011101010000001111111000000101001110100101110101000000111111100000010100111001000010 | 97503f814e97503f814e42 |
EUC-JP | 猶?¨猶?¨B | 1100110110110001001111111010000110101111110011011011000100111111101000011010111101000010 | cdb13fa1afcdb13fa1af42 |
UTF-8 | 猶뽯¨猶뽯¨B | 1110011110001100101101101110101110111101101011111100001010101000111001111000110010110110111010111011110110101111110000101010100001000010 | e78cb6ebbdafc2a8e78cb6ebbdafc2a842 |
UHC | 猶뽯¨猶뽯¨B | 11101011101000101001011011101011101000011010011111101011101000101001011011101011101000011010011101000010 | eba296eba1a7eba296eba1a742 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)