To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?o[?o[^ | 00111111011011110101101100111111011011110101101101011110 | 3f6f5b3f6f5b5e |
SJIS-WIN | 達o[達o[^ | 100100100100001001101111010110111001001001000010011011110101101101011110 | 92426f5b92426f5b5e |
EUC-JP | 達o[達o[^ | 110000111010001101101111010110111100001110100011011011110101101101011110 | c3a36f5bc3a36f5b5e |
UTF-8 | 達o[達o[^ | 1110100110000001100101000110111101011011111010011000000110010100011011110101101101011110 | e981946f5be981946f5b5e |
UHC | 達o[達o[^ | 110100111011100101101111010110111101001110111001011011110101101101011110 | d3b96f5bd3b96f5b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)