To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ょク秧?ぐ娃??塋ゆ?餓?ぐ娃??塋ゆ? 10011010110010001000001011100101100000110100111011100010010111100011111110000010101011101000100010100001001111110011111110011010110010001000001011100100001111111000100111101100001111111000001010101110100010001010000100111111001111111001101011001000100000101110010000111111 9ac882e5834ee25e3f82ae88a13f3f9ac882e43f89ec3f82ae88a13f3f9ac882e43f
EUC-JP 塋ょク秧?ぐ娃??塋ゆ?餓?ぐ娃??塋ゆ? 11010100110010101010010011100111101001011010111111100011101111110011111110100100101100001011000010100011001111110011111111010100110010101010010011100110001111111011001011101110001111111010010010110000101100001010001100111111001111111101010011001010101001001110011000111111 d4caa4e7a5afe3bf3fa4b0b0a33f3fd4caa4e63fb2ee3fa4b0b0a33f3fd4caa4e63f
UTF-8 塋ょク秧녘ぐ娃쒏뜆塋ゆ땯餓뽬ぐ娃쒏뜆塋ゆ춲 111001011010000110001011111000111000001010000111111000111000001010101111111001111010011110100111111010111000010110011000111000111000000110010000111001011010100010000011111011001001001010001111111010111001110010000110111001011010000110001011111000111000001010000110111010111001010110101111111010011010010010010011111010111011110110101100111000111000000110010000111001011010100010000011111011001001001010001111111010111001110010000110111001011010000110001011111000111000001010000110111011001011011010110010 e5a18be38287e382afe7a7a7eb8598e38190e5a883ec928feb9c86e5a18be38286eb95afe9a493ebbdace38190e5a883ec928feb9c86e5a18be38286ecb6b2
UHC 塋ょク秧녘ぐ娃쒏뜆塋ゆ땯餓뽬ぐ娃쒏뜆塋ゆ춲 111001111010101110101010111001111010101110101111111001001110101110110011111010001010101010110000111010001101111110011100111001101000110110001001111001111010101110101010111001101000101110000101111001001011101110010110111010001010101010110000111010001101111110011100111001101000110110001001111001111010101110101010111001101010110110001110 e7abaae7abafe4ebb3e8aab0e8df9ce68d89e7abaae68b85e4bb96e8aab0e8df9ce68d89e7abaae6ad8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)