To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺????????娃 111110111100010000111111001111110011111100111111001111110011111100111111001111111000100010100001 fbc43f3f3f3f3f3f3f3f88a1
EUC-JP 鈺????????娃 10001111111000111101010100111111001111110011111100111111001111110011111100111111001111111011000010100011 8fe3d53f3f3f3f3f3f3f3fb0a3
UTF-8 鈺됧넪溜곕젺燎뽯젪娃 111010011000100010111010111010111001000010100111111010111000010010101010111011111010011110001011111010101011001110010101111011001010000010111010111011111010011110000000111010111011110110101111111011001010000010101010111001011010100010000011 e988baeb90a7eb84aaefa78beab395eca0baefa780ebbdafeca0aae5a883
UHC 鈺됧넪溜곕젺燎뽯젪娃 1110100010101101100010011110010110000110101010101110101011111110101100001110101110100000101011011110100011111011100101101110101110100000101000101110100011011111 e8ad89e586aaeafeb0eba0ade8fb96eba0a2e8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)