To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣?㎝濡??檍?????碎l?魚?? 00111111001111110011111110001011100000110011111110000111011100001001010001000111001111110011111110011110111110000011111100111111001111110011111100111111111000011110101010000010100011000011111110001011100110110011111100111111 3f3f3f8b833f877094473f3f9ef83f3f3f3f3fe1ea828c3f8b9b3f3f
EUC-JP ???泣??濡??檍?????碎l?魚?? 001111110011111100111111101101011110001100111111001111111100011110101000001111110011111111011100111110100011111100111111001111110011111100111111111000101110110010100011111011000011111110110101111110110011111100111111 3f3f3fb5e33f3fc7a83f3fdcfa3f3f3f3f3fe2eca3ec3fb5fb3f3f
UTF-8 嶺뚭램泣숋㎝濡⑸룏檍용쑑溜잒첑碎l몚魚좊찆 111011111010011010101011111010111001101010101101111010111001111010101000111001101011001110100011111011001000100010001011111000111000111010011101111001101011111110100001111000101001000110111000111010111010001110001111111001101010101010001101111011001001101010101001111011001001000110010001111011111010011110001011111011001001111010010010111011001011001010010001111001111010001010001110111011111011110110001100111010111010101010011010111010011010110110011010111011001010001010001010111011001011000010000110 efa6abeb9aadeb9ea8e6b3a3ec888be38e9de6bfa1e291b8eba38fe6aa8dec9aa9ec9191efa78bec9e92ecb291e7a28eefbd8cebaa9ae9ad9aeca28aecb086
UHC 嶺뚭램泣숋㎝濡⑸룏檍용쑑溜잒첑碎l몚魚좊찆 111001111010110110001100111010101011011110100101111010111110100010011001111011111010011110101111111010111010000110101001111010111000111110001101111001011110010110111111111010111001110010110000111010101111111010011111111010001010101010011110111000011110111110100011111011001001000110001000111001011110000010100000111010111010100110001010 e7ad8ceab7a5ebe899efa7afeba1a9eb8f8de5e5bfeb9cb0eafe9fe8aa9ee1efa3ec9188e5e0a0eba98a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)