To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?淨?鬱屯??畯陌???畯 111000110111000100111111100111111100010000111111100111110101010010010011110101000011111100111111111110110110111111101000100110010011111100111111001111111111101101101111 e3713f9fc43f9f5493d43f3ffb6fe8993f3f3ffb6f
EUC-JP 縡?淨?鬱屯??畯陌汶??畯 11100101110100100011111111011110110001100011111111011101101101011100011011010110001111110011111110001111110011011011101111101111111110011000111111000110111001010011111100111111100011111100110110111011 e5d23fdec63fddb5c6d63f3f8fcdbbeff98fc6e53f3f8fcdbb
UTF-8 縡렕淨렠鬱屯렕렟畯陌汶履렰畯 111001111011100010100001111010111010000010010101111001101011011110101000111010111010000010100000111010011010110010110001111001011011000110101111111010111010000010010101111010111010000010011111111001111001010110101111111010011001100110001100111001101011000110110110111011111010011110011111111010111010000010110000111001111001010110101111 e7b8a1eba095e6b7a8eba0a0e9acb1e5b1afeba095eba09fe795afe9998ce6b1b6efa79feba0b0e795af
UHC 縡렕淨렠鬱屯렕렟畯陌汶履렰畯 11101110101011011000111010101010111011111110010010001110101100011110101010100110110101001110101010001110101010101000111010110000111100011110000111011000111010001101101010100001111011001010101010001110101111011111000111100001 eead8eaaefe48eb1eaa6d4ea8eaa8eb0f1e1d8e8daa1ecaa8ebdf1e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)