To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
SJIS-WIN | 幼?誣?擺り | 10010111011000110011111111100110011101110011111110011101101100101000001011101000 | 97633fe6773f9db282e8 |
EUC-JP | 幼?誣?擺り | 11001101110001000011111111101011110110000011111111011010101101001010010011101010 | cdc43febd83fdab4a4ea |
UTF-8 | 幼쯩誣뤈擺り | 111001011011100110111100111011001010111110101001111010001010101010100011111010111010010010001000111001101001001110111010111000111000001010001010 | e5b9bcecafa9e8aaa3eba488e693bae3828a |
UHC | 幼쯩誣뤈擺り | 111010101110101011000010111011011101100111110100100011111011100011110111111011001010101011101010 | eaeac2edd9f48fb8f7ecaaea |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)