To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ょク??ぐ娃??也ら?央?ぐ娃??塋ゆ 1001101011001000100000101110010110000011010011100011111100111111100000101010111010001000101000010011111100111111100101101110011110000010111001110011111110001001100110110011111110000010101011101000100010100001001111110011111110011010110010001000001011100100 9ac882e5834e3f3f82ae88a13f3f96e782e73f899b3f82ae88a13f3f9ac882e4
EUC-JP 塋ょク??ぐ娃??也ら?央?ぐ娃??塋ゆ 1101010011001010101001001110011110100101101011110011111100111111101001001011000010110000101000110011111100111111110011001110100110100100111010010011111110110001111110110011111110100100101100001011000010100011001111110011111111010100110010101010010011100110 d4caa4e7a5af3f3fa4b0b0a33f3fcce9a4e93fb1fb3fa4b0b0a33f3fd4caa4e6
UTF-8 塋ょク呂잒ぐ娃쒍퓱也ら걶央뉓ぐ娃쒑큹塋ゆ 111001011010000110001011111000111000001010000111111000111000001010101111111011111010011010000000111011001001111010010010111000111000000110010000111001011010100010000011111011001001001010001101111011011001001110110001111001001011100110011111111000111000001010001001111010101011000110110110111001011010010010101110111010111000100110010011111000111000000110010000111001011010100010000011111011001001001010010001111011011000000110111001111001011010000110001011111000111000001010000110 e5a18be38287e382afefa680ec9e92e38190e5a883ec928ded93b1e4b99fe38289eab1b6e5a4aeeb8993e38190e5a883ec9291ed81b9e5a18be38286
UHC 塋ょク呂잒ぐ娃쒍퓱也ら걶央뉓ぐ娃쒑큹塋ゆ 11100111101010111010101011100111101010111010111111100101111110111001111111101000101010101011000011101000110111111001110011100100101111111001011111100101101001011010101011101001100000011001110011100100111001111000011111101000101010101011000011101000110111111001110011101000101101001000100011100111101010111010101011100110 e7abaae7abafe5fb9fe8aab0e8df9ce4bf97e5a5aae9819ce4e787e8aab0e8df9ce8b488e7abaae6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)