To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??儀??齬???癲??魏??日 111000011001111100111111001111111000101101011000001111110011111110001011010101100011111100111111111010101001011100111111001111110011111111100001100111110011111100111111111010011011000000111111001111111001001111111010 e19f3f3f8b583f3f8b563f3fea973f3f3fe19f3f3fe9b03f3f93fa
EUC-JP 癲??宜??儀??齬???癲??魏??日 111000101010000100111111001111111011010110111001001111110011111110110101101101110011111100111111111100111111011100111111001111110011111111100010101000010011111100111111111100101011001000111111001111111100011011111100 e2a13f3fb5b93f3fb5b73f3ff3f73f3f3fe2a13f3ff2b23f3fc6fc
UTF-8 癲덈챶宜룝슭儀뤿겱齬읪됰즴癲쒖슜魏뚦봅日 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111010001110011101111011001000101010101101111001011000010010000000111010111010010010111111111010101011001010110001111010011011110110101100111011001001110110101010111010111001000010110000111011001010011010110100111001111001100110110010111011001001001010010110111011001000101010011100111010011010110110001111111010111001101010100110111010111011010010000101111001101001011110100101 e799b2eb8d88ecb1b6e5ae9ceba39dec8aade58480eba4bfeab2b1e9bdacec9daaeb90b0eca6b4e799b2ec9296ec8a9ce9ad8feb9aa6ebb485e697a5
UHC 癲덈챶宜룝슭儀뤿겱齬읪됰즴癲쒖슜魏뚦봅日 11101111101001101000100011101011101010101000001111101011111100011011011111100100101111011011111011101011111100001000111111101011100000011011110111100101111000011001111111010001100010011110101110100011100001101110111110100110100111001110110010011010101010011110101011100000100011001110010110111010101111101110110011101101 efa688ebaa83ebf1b7e4bdbeebf08feb81bde5e19fd189eba386efa69cec9aa9eae08ce5babeeced

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)