To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮?ぜ飮??阿??違??怨??沃??肄 001111110011111100111111111010000100101000111111100000101011101010011111010110100011111100111111100010001010001000111111001111111000100011100001001111110011111110001001100001010011111100111111100101111000000000111111001111111110001111100101 3f3f3fe84a3f82ba9f5a3f3f88a23f3f88e13f3f89853f3f97803f3fe3e5
EUC-JP ???鍮?ぜ飮??阿??違??怨??沃??肄 001111110011111100111111111011111010101100111111101001001011110011011101101110110011111100111111101100001010010000111111001111111011000011100011001111110011111110110001111001010011111100111111110011011110000000111111001111111110011011100111 3f3f3fefab3fa4bcddbb3f3fb0a43f3fb0e33f3fb1e53f3fcde03f3fe6e7
UTF-8 捻꿸낯鍮뽬ぜ飮곸숱阿숆퀬違긺솾怨몃퉾沃쇱늸肄 111011111010011010100100111010101011111110111000111010111000001010101111111010011000110110101110111010111011110110101100111000111000000110011100111010011010001110101110111010101011001110111000111011001000100010110001111010011001100010111111111011001000100010000110111011011000000010101100111010011000000110010101111010101011100010111010111011001000011010111110111001101000000010101000111010111010101010000011111011011000100110111110111001101011001010000011111011001000011110110001111010111000101010111000111010001000001010000100 efa6a4eabfb8eb82afe98daeebbdace3819ce9a3aeeab3b8ec88b1e998bfec8886ed80ace98195eab8baec86bee680a8ebaa83ed89bee6b283ec87b1eb8ab8e88284
UHC 捻꿸낯鍮뽬ぜ飮곸숱阿숆퀬違긺솾怨몃퉾沃쇱늸肄 1110011011110111101100101110101010110011101110001110101110111001100101101110100010101010101111001110101111100110100000011110110010111101101000101110010010111001100110011110101010110011101000001110101011011110101100011110011110011001101100101110101010110011101110001110101110111001100101101110100010101010101111001110110010001000100000011110110010111101 e6f7b2eab3b8ebb996e8aabcebe681ecbda2e4b999eab3a0eadeb1e799b2eab3b8ebb996e8aabcec8881ecbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)