To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??宥?????肉ε?循????? 111001001000100000111111001111111110001010000110001111110011111110010111010001110011111100111111001111110011111100111111100100111111011110000011110000110011111110001111011110100011111100111111001111110011111100111111 e4883f3fe2863f3f97473f3f3f3f3f93f783c33f8f7a3f3f3f3f3f
EUC-JP 艾??竊??宥?????肉ε?循????? 111001111110100000111111001111111110001111100110001111110011111111001101101010000011111100111111001111110011111100111111110001101111100110100110110001010011111110111101110110110011111100111111001111110011111100111111 e7e83f3fe3e63f3fcda83f3f3f3f3fc6f9a6c53fbddb3f3f3f3f3f
UTF-8 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵循뗰폊療먰닽 1110100010001001101111101110110010001110100010001110101110000001100011111110011110101011100010101110101110111101101010001110110110001011101000001110010110101110101001011110101110001011101111111110110010111111100001011110111110100110100111001110101110100011100101001110101010111001101110101110100010000010100010011100111010110101111011001001000110110101111001011011111010101010111010111001011110110000111011011000111110001010111011111010011110000001111010111010100010110000111010111000101110111101 e889beec8e88eb818fe7ab8aebbda8ed8ba0e5aea5eb8bbfecbf85efa69ceba394eab9bae88289ceb5ec91b5e5beaaeb97b0ed8f8aefa781eba8b0eb8bbd
UHC 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵循뗰폊療먰닽 111001001111010110111101111010111000010110111111111011111011110010010110111001001011101010001100111010101110100110110100111010101011001010011010111001101110101010110111111000111000001110100110111010111011111110100101111001011011111010101010111000101110000010001011111011111011110010010101111010001111111010010000111011011000100010101011 e4f5bdeb85bfefbc96e4ba8ceae9b4eab29ae6eab7e383a6ebbfa5e5beaae2e08befbc95e8fe90ed88ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)