To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ょク秧?ぐ娃??沃??癌? 10011010110010001000001011100101100000110100111011100010010111100011111110000010101011101000100010100001001111110011111110010111100000000011111100111111100010101110000000111111 9ac882e5834ee25e3f82ae88a13f3f97803f3f8ae03f
EUC-JP 塋ょク秧?ぐ娃??沃??癌? 11010100110010101010010011100111101001011010111111100011101111110011111110100100101100001011000010100011001111110011111111001101111000000011111100111111101101001110001000111111 d4caa4e7a5afe3bf3fa4b0b0a33f3fcde03f3fb4e23f
UTF-8 塋ょク秧믦ぐ娃쒏쓳沃곈걶癌퀮 111001011010000110001011111000111000001010000111111000111000001010101111111001111010011110100111111010111010111110100110111000111000000110010000111001011010100010000011111011001001001010001111111011001001001110110011111001101011001010000011111010101011001110001000111010101011000110110110111001111001100110001100111011011000000010101110 e5a18be38287e382afe7a7a7ebafa6e38190e5a883ec928fec93b3e6b283eab388eab1b6e7998ced80ae
UHC 塋ょク秧믦ぐ娃쒏쓳沃곈걶癌퀮 11100111101010111010101011100111101010111010111111100100111010111001001011101000101010101011000011101000110111111001110011100110100111011001000111101000101010101011000011101001100000011001110011100100110111111011010001000001 e7abaae7abafe4eb92e8aab0e8df9ce69d91e8aab0e9819ce4dfb441

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)