To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨???工製?虞??耿??? 10011111110001000011111100111111001111111000110101001000100100001011101100111111100010111111000100111111001111111110001111010100001111110011111100111111 9fc43f3f3f8d4890bb3f8bf13f3fe3d43f3f3f
EUC-JP 淨???工製?虞?饔耿??橒 1101111011000110001111110011111100111111101110011010100111000000101111010011111110110110111100110011111110001111111010001110111111100110110101100011111100111111100011111100010110101101 dec63f3f3fb9a9c0bd3fb6f33f8fe8efe6d63f3f8fc5ad
UTF-8 淨렠易쇠工製렩虞렧饔耿렟렩橒 111001101011011110101000111010111010000010100000111011111010011110100000111011001000011110100000111001011011011110100101111010001010001110111101111010111010000010101001111010001001100110011110111010111010000010100111111010011010010110010100111010001000000010111111111010111010000010011111111010111010000010101001111001101010100110010010 e6b7a8eba0a0efa7a0ec87a0e5b7a5e8a3bdeba0a9e8999eeba0a7e9a594e880bfeba09feba0a9e6a992
UHC 淨렠易쇠工製렩虞렧饔耿렟렩橒 11101111111001001000111010110001111011001010111110111100111010001100110111101111111100001011001010001110101101111110100111100101100011101011011011101000101111011100110011101010100011101011000010001110101101111110100111111000 efe48eb1ecafbce8cdeff0b28eb7e9e58eb6e8bdccea8eb08eb7e9f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)