To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 淨??拒v淨??拒vB 100111111100010000111111001111111000101110010001011101101001111111000100001111110011111110001011100100010111011001000010 9fc43f3f8b91769fc43f3f8b917642
EUC-JP 淨?饔拒v淨?饔拒vB 11011110110001100011111110001111111010001110111110110101111100010111011011011110110001100011111110001111111010001110111110110101111100010111011001000010 dec63f8fe8efb5f176dec63f8fe8efb5f17642
UTF-8 淨렠饔拒v淨렠饔拒vB 111001101011011110101000111010111010000010100000111010011010010110010100111001101000101110010010011101101110011010110111101010001110101110100000101000001110100110100101100101001110011010001011100100100111011001000010 e6b7a8eba0a0e9a594e68b9276e6b7a8eba0a0e9a594e68b927642
UHC 淨렠饔拒v淨렠饔拒vB 11101111111001001000111010110001111010001011110111001011110111100111011011101111111001001000111010110001111010001011110111001011110111100111011001000010 efe48eb1e8bdcbde76efe48eb1e8bdcbde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)