To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??}??{^ | 00111111001111110111110100111111001111110111101101011110 | 3f3f7d3f3f7b5e |
SJIS-WIN | 貶?}貶?{^ | 111001101100100000111111011111011110011011001000001111110111101101011110 | e6c83f7de6c83f7b5e |
EUC-JP | 貶?}貶?{^ | 111011001100101000111111011111011110110011001010001111110111101101011110 | ecca3f7decca3f7b5e |
UTF-8 | 貶줻}貶줻{^ | 111010001011001010110110111011001010010010111011011111011110100010110010101101101110110010100100101110110111101101011110 | e8b2b6eca4bb7de8b2b6eca4bb7b5e |
UHC | 貶줻}貶줻{^ | 1111100010111111101000100110111001111101111110001011111110100010011011100111101101011110 | f8bfa26e7df8bfa26e7b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)