To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 庸??已??蚓 10010111011001100011111100111111100110111101111100111111001111111110010101101101 97663f3f9bdf3f3fe56d
EUC-JP 庸??已??蚓 11001101110001110011111100111111110101101110000100111111001111111110100111001110 cdc73f3fd6e13f3fe9ce
UTF-8 庸뉕퍜已뜹룛蚓 111001011011101010111000111010111000100110010101111011011000110110011100111001011011011110110010111010111001110010111001111010111010001110011011111010001001101010010011 e5bab8eb8995ed8d9ce5b7b2eb9cb9eba39be89a93
UHC 庸뉕퍜已뜹룛蚓 1110100110111100100001111110101010111011100100111110110010101011101101101110010110001111100101111110110011100010 e9bc87eabb93ecabb6e58f97ece2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)