To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 塋ょク秧?ぐ娃??D塋ょク秧?ぐ娃??D^ 100110101100100010000010111001011000001101001110111000100101111000111111100000101010111010001000101000010011111100111111010001001001101011001000100000101110010110000011010011101110001001011110001111111000001010101110100010001010000100111111001111110100010001011110 9ac882e5834ee25e3f82ae88a13f3f449ac882e5834ee25e3f82ae88a13f3f445e
EUC-JP 塋ょク秧?ぐ娃??D塋ょク秧?ぐ娃??D^ 110101001100101010100100111001111010010110101111111000111011111100111111101001001011000010110000101000110011111100111111010001001101010011001010101001001110011110100101101011111110001110111111001111111010010010110000101100001010001100111111001111110100010001011110 d4caa4e7a5afe3bf3fa4b0b0a33f3f44d4caa4e7a5afe3bf3fa4b0b0a33f3f445e
UTF-8 塋ょク秧믦ぐ娃쒍퓱D塋ょク秧믦ぐ娃쒍퓱D^ 111001011010000110001011111000111000001010000111111000111000001010101111111001111010011110100111111010111010111110100110111000111000000110010000111001011010100010000011111011001001001010001101111011011001001110110001010001001110010110100001100010111110001110000010100001111110001110000010101011111110011110100111101001111110101110101111101001101110001110000001100100001110010110101000100000111110110010010010100011011110110110010011101100010100010001011110 e5a18be38287e382afe7a7a7ebafa6e38190e5a883ec928ded93b144e5a18be38287e382afe7a7a7ebafa6e38190e5a883ec928ded93b1445e
UHC 塋ょク秧믦ぐ娃쒍퓱D塋ょク秧믦ぐ娃쒍퓱D^ 111001111010101110101010111001111010101110101111111001001110101110010010111010001010101010110000111010001101111110011100111001001011111110010111010001001110011110101011101010101110011110101011101011111110010011101011100100101110100010101010101100001110100011011111100111001110010010111111100101110100010001011110 e7abaae7abafe4eb92e8aab0e8df9ce4bf9744e7abaae7abafe4eb92e8aab0e8df9ce4bf97445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)