To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | OeOêTéáj | 10010111010011110110010110010111010011111110101001010100111010011110000101101010 | 974f65974fea54e9e16a |
SJIS-WIN | ?Oe?O?T??j | 00111111010011110110010100111111010011110011111101010100001111110011111101101010 | 3f4f653f4f3f543f3f6a |
EUC-JP | ?Oe?OêTéáj | 00111111010011110110010100111111010011111000111110101011101101000101010010001111101010111011000110001111101010111010000101101010 | 3f4f653f4f8fabb4548fabb18faba16a |
UTF-8 | OeOêTéáj | 110000101001011101001111011001011100001010010111010011111100001110101010010101001100001110101001110000111010000101101010 | c2974f65c2974fc3aa54c3a9c3a16a |
UHC | ?Oe?O?T??j | 00111111010011110110010100111111010011110011111101010100001111110011111101101010 | 3f4f653f4f3f543f3f6a |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)