To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????×??????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101011100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3fd73f3f3f3f3f3f3f
SJIS-WIN 艾??誼??碎??夜?×釉ο?臾??筌 11100100100010000011111100111111100010110110001000111111001111111110000111101010001111110011111110010110111010010011111110000001011111101110011111010110100000111100110100111111111001000110101100111111001111111110001010100011 e4883f3f8b623f3fe1ea3f3f96e93f817ee7d683cd3fe46b3f3fe2a3
EUC-JP 艾??誼??碎??夜?×釉ο?臾??筌 11100111111010000011111100111111101101011100001100111111001111111110001011101100001111110011111111001100111010110011111110100001110111111110111011011000101001101100111100111111111001111100110000111111001111111110010010100101 e7e83f3fb5c33f3fe2ec3f3fcceb3fa1dfeed8a6cf3fe7cc3f3fe4a5
UTF-8 艾싲챶誼숂뜲碎몄젌夜껋×釉ο쭓臾먯젌筌 11101000100010011011111011101100100010111011001011101100101100011011011011101000101010101011110011101100100010001000001011101011100111001011001011100111101000101000111011101011101010101000010011101100101000001000110011100101101001001001110011101010101110111000101111000011100101111110100110000111100010011100111010111111111011001010110110010011111010001000011110111110111010111010100010101111111011001010000010001100111001111010110110001100 e889beec8bb2ecb1b6e8aabcec8882eb9cb2e7a28eebaa84eca08ce5a49ceabb8bc397e98789cebfecad93e887beeba8afeca08ce7ad8c
UHC 艾싲챶誼숂뜲碎몄젌夜껋×釉ο쭓臾먯젌筌 1110010011110101100110101110101110101010100000111110101111111110100110011110011110001101101100001110000111101111101110001110110010100000100011011110010110101000100000111110110010100001101111111110101110111000101001011110111110100111100010111110101110101100100100001110110010100000100011011110111110100111 e4f59aebaa83ebfe99e78db0e1efb8eca08de5a883eca1bfebb8a5efa78bebac90eca08defa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)