To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??宥?????肉ε?巡??鸚?? 11100100100010000011111100111111111000101000011000111111001111111001011101000111001111110011111100111111001111110011111110010011111101111000001111000011001111111000111110000100001111110011111111101010010111110011111100111111 e4883f3fe2863f3f97473f3f3f3f3f93f783c33f8f843f3fea5f3f3f
EUC-JP 艾??竊??宥?????肉ε?巡??鸚?? 11100111111010000011111100111111111000111110011000111111001111111100110110101000001111110011111100111111001111110011111111000110111110011010011011000101001111111011110111100100001111110011111111110011110000000011111100111111 e7e83f3fe3e63f3fcda83f3f3f3f3fc6f9a6c53fbde43f3ff3c03f3f
UTF-8 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵巡볩폊鸚쒖걖 1110100010001001101111101110110010001110100010001110101110000001100011111110011110101011100010101110101110111101101010001110110110001011101000001110010110101110101001011110101110001011101111111110110010111111100001011110111110100110100111001110101110100011100101001110101010111001101110101110100010000010100010011100111010110101111011001001000110110101111001011011011110100001111010111011001110101001111011011000111110001010111010011011100010011010111011001001001010010110111010101011000110010110 e889beec8e88eb818fe7ab8aebbda8ed8ba0e5aea5eb8bbfecbf85efa69ceba394eab9bae88289ceb5ec91b5e5b7a1ebb3a9ed8f8ae9b89aec9296eab196
UHC 艾쎈끏竊뽨틠宥닿쿅列룔깺肉ε쑵巡볩폊鸚쒖걖 111001001111010110111101111010111000010110111111111011111011110010010110111001001011101010001100111010101110100110110100111010101011001010011010111001101110101010110111111000111000001110100110111010111011111110100101111001011011111010101010111000101101111010010011111011111011110010010101111001011010010010011100111011001000000110000001 e4f5bdeb85bfefbc96e4ba8ceae9b4eab29ae6eab7e383a6ebbfa5e5beaae2de93efbc95e5a49cec8181

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)