To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??D??D^ | 00111111001111110100010000111111001111110100010001011110 | 3f3f443f3f445e |
SJIS-WIN | ウ牘Dウ牘D^ | 1000001101000101111000001010111101000100100000110100010111100000101011110100010001011110 | 8345e0af448345e0af445e |
EUC-JP | ウ牘Dウ牘D^ | 1010010110100110111000001011000101000100101001011010011011100000101100010100010001011110 | a5a6e0b144a5a6e0b1445e |
UTF-8 | ウ牘Dウ牘D^ | 111000111000001010100110111001111000100110011000010001001110001110000010101001101110011110001001100110000100010001011110 | e382a6e7899844e382a6e78998445e |
UHC | ウ牘Dウ牘D^ | 1010101110100110110101001011101001000100101010111010011011010100101110100100010001011110 | aba6d4ba44aba6d4ba445e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)