To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??宥?????肉ε?循??癲?? 11100100100010000011111100111111111000101000011000111111001111111001011101000111001111110011111100111111001111110011111110010011111101111000001111000011001111111000111101111010001111110011111111100001100111110011111100111111 e4883f3fe2863f3f97473f3f3f3f3f93f783c33f8f7a3f3fe19f3f3f
EUC-JP 艾??竊??宥?????肉ε?循??癲?? 11100111111010000011111100111111111000111110011000111111001111111100110110101000001111110011111100111111001111110011111111000110111110011010011011000101001111111011110111011011001111110011111111100010101000010011111100111111 e7e83f3fe3e63f3fcda83f3f3f3f3fc6f9a6c53fbddb3f3fe2a13f3f
UTF-8 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵循뗰폊癲놃뇫 1110100010001001101111101110110010001110100010001110101110000001100011111110011110101011100010101110101110111101101010001110110110001011101000001110010110101110101001011110101110001011101111111110110010111111100001011110111110100110100111001110101110100011100101001110101010111001101110101110100010000010100010011100111010110101111011001001000110110101111001011011111010101010111010111001011110110000111011011000111110001010111001111001100110110010111010111000011010000011111010111000011110101011 e889beec8e88eb818fe7ab8aebbda8ed8ba0e5aea5eb8bbfecbf85efa69ceba394eab9bae88289ceb5ec91b5e5beaaeb97b0ed8f8ae799b2eb8683eb87ab
UHC 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵循뗰폊癲놃뇫 111001001111010110111101111010111000010110111111111011111011110010010110111001001011101010001100111010101110100110110100111010101011001010011010111001101110101010110111111000111000001110100110111010111011111110100101111001011011111010101010111000101110000010001011111011111011110010010101111011111010011010000110111011011000011110010001 e4f5bdeb85bfefbc96e4ba8ceae9b4eab29ae6eab7e383a6ebbfa5e5beaae2e08befbc95efa686ed8791

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)