To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 淏」貉ソ跂懃明 111110110100001010100011111001101011100110111111111001101110001110011100111001111001011010111110 fb42a3e6b9bfe6e39ce796be
EUC-JP 淏」貉ソ跂懃明 100011111100011111011001100011101010001111101100101110111000111010111111111011001110010111011000111010011100110011000000 8fc7d98ea3ecbb8ebfece5d8e9ccc0
UTF-8 淏」貉ソ跂懃明 111001101011011110001111111011111011110110100011111010001011001010001001111011111011110110111111111010001011011110000010111001101000011110000011111001101001100010001110 e6b78fefbda3e8b289efbdbfe8b782e68783e6988e
UHC 淏????懃明 11111011110010000011111100111111001111110011111111010000110001001101100110100101 fbc83f3f3f3fd0c4d9a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)