To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 陬俶焔顋瑚オュ 111010001010001110011000111001101000100110001011111010001111100110001100111010001011010110101101 e8a398e6898be8f98ce8b5ad
EUC-JP 陬俶焔顋瑚オュ 1111000010100101110100001110100010110001111010111111000011111011101110001110101010001110101101011000111010101101 f0a5d0e8b1ebf0fbb8ea8eb58ead
UTF-8 陬俶焔顋瑚オュ 111010011001100110101100111001001011111110110110111001111000010010010100111010011010000110001011111001111001000110011010111011111011110110110101111011111011110110101101 e999ace4bfb6e78494e9a18be7919aefbdb5efbdad
UHC ????瑚?? 0011111100111111001111110011111111111011110100010011111100111111 3f3f3f3ffbd13f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)