To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????H | 001111110011111100111111001111110011111101001000 | 3f3f3f3f3f48 |
SJIS-WIN | 闌蛾ァ亥香H | 11101000100011001000100111101001101001111000100011100101100011011000000101001000 | e88c89e9a788e58d8148 |
EUC-JP | 闌蛾ァ亥香H | 1110111111101100101100101110101110001110101001111011000011100111101110011110000101001000 | efecb2eb8ea7b0e7b9e148 |
UTF-8 | 闌蛾ァ亥香H | 11101001100101111000110011101000100110111011111011101111101111011010011111100100101110101010010111101001101001101001100101001000 | e9978ce89bbeefbda7e4baa5e9a69948 |
UHC | ?蛾?亥香H | 001111111110010010110110001111111111101010100100111110101100010101001000 | 3fe4b63ffaa4fac548 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)