To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 與??霓??認 11100100011011110011111100111111111010001011110100111111001111111001010001000110 e46f3f3fe8bd3f3f9446
EUC-JP 與??霓??認 11100111110100000011111100111111111100001011111100111111001111111100011110100111 e7d03f3ff0bf3f3fc7a7
UTF-8 與잙젫霓뉐굄認 111010001000100010000111111011001001111010011001111011001010000010101011111010011001110010010011111010111000100110010000111010101011010110000100111010001010101010001101 e88887ec9e99eca0abe99c93eb8990eab584e8aa8d
UHC 與잙젫霓뉐굄認 1110011010101000100111111110101110100000101000111110011111100111100001111110010110110001101011111110110011100011 e6a89feba0a3e7e787e5b1afece3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)