To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??宥?????肉ε?循??易?? 11100100100010000011111100111111111000101000011000111111001111111001011101000111001111110011111100111111001111110011111110010011111101111000001111000011001111111000111101111010001111110011111110001000110101010011111100111111 e4883f3fe2863f3f97473f3f3f3f3f93f783c33f8f7a3f3f88d53f3f
EUC-JP 艾??竊??宥?????肉ε?循??易?? 11100111111010000011111100111111111000111110011000111111001111111100110110101000001111110011111100111111001111110011111111000110111110011010011011000101001111111011110111011011001111110011111110110000110101110011111100111111 e7e83f3fe3e63f3fcda83f3f3f3f3fc6f9a6c53fbddb3f3fb0d73f3f
UTF-8 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵循뗰폊易뀀넞 1110100010001001101111101110110010001110100010001110101110000001100011111110011110101011100010101110101110111101101010001110110110001011101000001110010110101110101001011110101110001011101111111110110010111111100001011110111110100110100111001110101110100011100101001110101010111001101110101110100010000010100010011100111010110101111011001001000110110101111001011011111010101010111010111001011110110000111011011000111110001010111001101001100010010011111010111000000010000000111010111000010010011110 e889beec8e88eb818fe7ab8aebbda8ed8ba0e5aea5eb8bbfecbf85efa69ceba394eab9bae88289ceb5ec91b5e5beaaeb97b0ed8f8ae69893eb8080eb849e
UHC 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵循뗰폊易뀀넞 111001001111010110111101111010111000010110111111111011111011110010010110111001001011101010001100111010101110100110110100111010101011001010011010111001101110101010110111111000111000001110100110111010111011111110100101111001011011111010101010111000101110000010001011111011111011110010010101111001101011011010110010111010111000011010100010 e4f5bdeb85bfefbc96e4ba8ceae9b4eab29ae6eab7e383a6ebbfa5e5beaae2e08befbc95e6b6b2eb86a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)