To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | èEèE[èEèE[^ | 1110100001000101111010000100010101011011111010000100010111101000010001010101101101011110 | e845e8455be845e8455b5e |
SJIS-WIN | ?E?E[?E?E[^ | 0011111101000101001111110100010101011011001111110100010100111111010001010101101101011110 | 3f453f455b3f453f455b5e |
EUC-JP | èEèE[èEèE[^ | 10001111101010111011001001000101100011111010101110110010010001010101101110001111101010111011001001000101100011111010101110110010010001010101101101011110 | 8fabb2458fabb2455b8fabb2458fabb2455b5e |
UTF-8 | èEèE[èEèE[^ | 110000111010100001000101110000111010100001000101010110111100001110101000010001011100001110101000010001010101101101011110 | c3a845c3a8455bc3a845c3a8455b5e |
UHC | ?E?E[?E?E[^ | 0011111101000101001111110100010101011011001111110100010100111111010001010101101101011110 | 3f453f455b3f453f455b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)