To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????h? 0011111100111111001111110011111100111111001111110110100000111111 3f3f3f3f3f3f683f
SJIS-WIN 瓮??鈺??h瓮 1110000101000100001111110011111111111011110001000011111100111111011010001110000101000100 e1443f3ffbc43f3f68e144
EUC-JP 瓮??鈺??h瓮 111000011010010100111111001111111000111111100011110101010011111100111111011010001110000110100101 e1a53f3f8fe3d53f3f68e1a5
UTF-8 瓮쏉쉘鈺롳슁h瓮 11100111100100111010111011101100100011111000100111101100100010011001100011101001100010001011101011101011101000011011001111101100100010101000000101101000111001111001001110101110 e793aeec8f89ec8998e988baeba1b3ec8a8168e793ae
UHC 瓮쏉쉘鈺롳슁h瓮 111010001011011110011011111011111011110110101001111010001010110110001110111011111011110110110011011010001110100010110111 e8b79befbda9e8ad8eefbdb368e8b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)