To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??[??[^ | 00111111001111110101101100111111001111110101101101011110 | 3f3f5b3f3f5b5e |
SJIS-WIN | 癇爛[癇爛[^ | 1110000110010010111000001010001101011011111000011001001011100000101000110101101101011110 | e192e0a35be192e0a35b5e |
EUC-JP | 癇爛[癇爛[^ | 1110000111110010111000001010010101011011111000011111001011100000101001010101101101011110 | e1f2e0a55be1f2e0a55b5e |
UTF-8 | 癇爛[癇爛[^ | 111001111001100110000111111001111000100010011011010110111110011110011001100001111110011110001000100110110101101101011110 | e79987e7889b5be79987e7889b5b5e |
UHC | ?爛[?爛[^ | 001111111101010110110100010110110011111111010101101101000101101101011110 | 3fd5b45b3fd5b45b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)