To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 韈、莨應鬟雎「 111010001110011110100100111001001011110010011100111001001110100110100011111010001011000110100010 e8e7a4e4bc9ce4e9a3e8b1a2
EUC-JP 韈、莨應鬟雎「 1111000011101001100011101010010011101000101111101101100011100110111100101010010111110000101100111000111010100010 f0e98ea4e8bed8e6f2a5f0b38ea2
UTF-8 韈、莨應鬟雎「 111010011001111110001000111011111011110110100100111010001000111010101000111001101000011110001001111010011010110010011111111010011001101110001110111011111011110110100010 e99f88efbda4e88ea8e68789e9ac9fe99b8eefbda2
UHC ???應?雎? 001111110011111100111111111010111110101100111111111011101101000100111111 3f3f3febeb3feed13f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)