To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 瓮??野??鴉? 1110000101000100001111110011111110010110111011000011111100111111111010011110101100111111 e1443f3f96ec3f3fe9eb3f
EUC-JP 瓮??野??鴉? 1110000110100101001111110011111111001100111011100011111100111111111100101110110100111111 e1a53f3fccee3f3ff2ed3f
UTF-8 瓮뚦츍野듣툑鴉곫 111001111001001110101110111010111001101010100110111011001011100010001101111010011000011110001110111010111001001110100011111011011000100010010001111010011011010010001001111010101011001110101011 e793aeeb9aa6ecb88de9878eeb93a3ed8891e9b489eab3ab
UHC 瓮뚦츍野듣툑鴉곫 11101000101101111000110011100101101011101000100011100101101011111011010111101000101110001000100011100100101111001000000111100110 e8b78ce5ae88e5afb5e8b888e4bc81e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)