To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 淀?昱???? 100101111000010000111111111110100110001100111111001111110011111100111111 97843ffa633f3f3f3f
EUC-JP 淀?昱???? 11001101111001000011111110001111110000101010110100111111001111110011111100111111 cde43f8fc2ad3f3f3f3f
UTF-8 淀렯昱계렊쯔렡 111001101011011110000000111010111010000010101111111001101001100010110001111010101011001110000100111010111010000010001010111011001010111110010100111010111010000010100001 e6b780eba0afe698b1eab384eba08aecaf94eba0a1
UHC 淀렯昱계렊쯔렡 1110111111100011100011101011110011101001111100001011000011101000100011101010000111000010111010101000111010110010 efe38ebce9f0b0e88ea1c2ea8eb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)