To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ????hぜ猶 00111111001111110011111100111111100000101000100010000010101110101001011101010000 3f3f3f3f828882ba9750
EUC-JP ????hぜ猶 00111111001111110011111100111111101000111110100010100100101111001100110110110001 3f3f3f3fa3e8a4bccdb1
UTF-8 閱묐떯璘hぜ猶 111010011001011010110001111010111010110010010000111010111001011010101111111011111010011110101111111011111011110110001000111000111000000110011100111001111000110010110110 e996b1ebac90eb96afefa7afefbd88e3819ce78cb6
UHC 閱묐떯璘hぜ猶 1110011011110011100100011110101110001011101111111110110011011110101000111110100010101010101111001110101110100010 e6f391eb8bbfecdea3e8aabceba2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)