To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??攸??茹?????純??榮??吟? 1110000110011111001111110011111110011101101111110011111100111111111001001010010100111111001111110011111100111111001111111000111110000011001111110011111110011110110001000011111100111111100010111110000100111111 e19f3f3f9dbf3f3fe4a53f3f3f3f3f8f833f3f9ec43f3f8be13f
EUC-JP 癲??攸??茹?????純??榮??吟? 1110001010100001001111110011111111011010110000010011111100111111111010001010011100111111001111110011111100111111001111111011110111100011001111110011111111011100110001100011111100111111101101101110001100111111 e2a13f3fdac13f3fe8a73f3f3f3f3fbde33f3fdcc63f3fb6e33f
UTF-8 癲ㅺ슝攸놁굛茹띾맧杻길략純앷뎃榮붾낄吟큚 111001111001100110110010111000111000010110111010111011001000101010011101111001101001010010111000111010111000011010000001111010101011010110011011111010001000110010111001111010111001110110111110111010111010011110100111111011111010011110001000111010101011100010111000111010111001111010110101111001111011010010010100111011001001010110110111111010111000111010000011111001101010011010101110111010111011011010111110111010111000001010000100111001011001000010011111111011011000000110011010 e799b2e385baec8a9de694b8eb8681eab59be88cb9eb9dbeeba7a7efa788eab8b8eb9eb5e7b494ec95b7eb8e83e6a6aeebb6beeb8284e5909fed819a
UHC 癲ㅺ슝攸놁굛茹띾맧杻길략純앷뎃榮붾낄吟큚 11101111101001101010010011101010101111011011100111101010111100101000011011101100100000101000001111100110101010101000110111101011100100001011000011101010111101001011000111100110101101111010101111100010111011011001110111101010101101011010101111100111101101001001010011101011101100111010010111101011111000011011010001101000 efa6a4eabdb9eaf286ec8283e6aa8deb90b0eaf4b1e6b7abe2ed9deab5abe7b494ebb3a5ebe1b468

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)