To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 闕オ諤懷す髮包スオ 111010001000110110110101111001101000000010011100111001011000001010110111111010011001101110010101111011111011110110110101 e88db5e6809ce582b7e99b95efbdb5
EUC-JP 闕オ諤懷す髮包スオ 111011111110110110001110101101011110101111100000110110001110011110100100101110011111000111111011110010101111000110001110101111011000111010110101 efed8eb5ebe0d8e7a4b9f1fbcaf18ebd8eb5
UTF-8 闕オ諤懷す髮包スオ 111010011001011110010101111011111011110110110101111010001010101110100100111001101000011110110111111000111000000110011001111010011010101110101110111001011000110010000101111011111011110110111101111011111011110110110101 e99795efbdb5e8aba4e687b7e38199e9abaee58c85efbdbdefbdb5
UHC 闕??懷す髮包?? 1100111111110100001111110011111111111100111000111010101010111001110110111010010111111000110100000011111100111111 cff43f3ffce3aab9dba5f8d03f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)