To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 耳?珥?衆??? 1000111010101000001111111110000011100000001111111000111101001111001111110011111100111111 8ea83fe0e03f8f4f3f3f3f
EUC-JP 耳?珥?衆??? 1011110010101010001111111110000011100010001111111011110110110000001111110011111100111111 bcaa3fe0e23fbdb03f3f3f
UTF-8 耳렲珥렮衆縷렮렪 111010001000000010110011111010111010000010110010111001111000111110100101111010111010000010101110111010001010000110000110111011111010010110010000111010111010000010101110111010111010000010101010 e880b3eba0b2e78fa5eba0aee8a186efa590eba0aeeba0aa
UHC 耳렲珥렮衆縷렮렪 11101100101111001000111010111111111011001011010010001110101110111111000111101011110100101110101010001110101110111000111010111000 ecbc8ebfecb48ebbf1ebd2ea8ebb8eb8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)