To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??宥?????肉ε????餓μ? 11100100100010000011111100111111111000101000011000111111001111111001011101000111001111110011111100111111001111110011111110010011111101111000001111000011001111110011111100111111001111111000100111101100100000111100101000111111 e4883f3fe2863f3f97473f3f3f3f3f93f783c33f3f3f3f89ec83ca3f
EUC-JP 艾??竊??宥?????肉ε????餓μ? 11100111111010000011111100111111111000111110011000111111001111111100110110101000001111110011111100111111001111110011111111000110111110011010011011000101001111110011111100111111001111111011001011101110101001101100110000111111 e7e83f3fe3e63f3fcda83f3f3f3f3fc6f9a6c53f3f3f3fb2eea6cc3f
UTF-8 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵栒삼폊餓μ퉱 11101000100010011011111011101100100011101000100011101011100000011000111111100111101010111000101011101011101111011010100011101101100010111010000011100101101011101010010111101011100010111011111111101100101111111000010111101111101001101001110011101011101000111001010011101010101110011011101011101000100000101000100111001110101101011110110010010001101101011110011010100000100100101110110010000010101111001110110110001111100010101110100110100100100100111100111010111100111011011000100110110001 e889beec8e88eb818fe7ab8aebbda8ed8ba0e5aea5eb8bbfecbf85efa69ceba394eab9bae88289ceb5ec91b5e6a092ec82bced8f8ae9a493cebced89b1
UHC 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵栒삼폊餓μ퉱 111001001111010110111101111010111000010110111111111011111011110010010110111001001011101010001100111010101110100110110100111010101011001010011010111001101110101010110111111000111000001110100110111010111011111110100101111001011011111010101010111000101110001110111011111011111011110010010101111001001011101110100101111011001011100110001001 e4f5bdeb85bfefbc96e4ba8ceae9b4eab29ae6eab7e383a6ebbfa5e5beaae2e3bbefbc95e4bba5ecb989

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)