To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??L??L^ | 00111111001111110100110000111111001111110100110001011110 | 3f3f4c3f3f4c5e |
SJIS-WIN | 渦?L渦?L^ | 100010010101000100111111010011001000100101010001001111110100110001011110 | 89513f4c89513f4c5e |
EUC-JP | 渦?L渦?L^ | 101100011011001000111111010011001011000110110010001111110100110001011110 | b1b23f4cb1b23f4c5e |
UTF-8 | 渦캱L渦캱L^ | 111001101011100010100110111011001011101010110001010011001110011010111000101001101110110010111010101100010100110001011110 | e6b8a6ecbab14ce6b8a6ecbab14c5e |
UHC | 渦캱L渦캱L^ | 1110100010111110101100000101000101001100111010001011111010110000010100010100110001011110 | e8beb0514ce8beb0514c5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)