To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 仲陌??泣紗??滓?仲陌??泣紗??滓?^ 10010010100001111110100010011001001111110011111110001011100000111000111011010001001111110011111110011111111001100011111110010010100001111110100010011001001111110011111110001011100000111000111011010001001111110011111110011111111001100011111101011110 9287e8993f3f8b838ed13f3f9fe63f9287e8993f3f8b838ed13f3f9fe63f5e
EUC-JP 仲陌??泣紗??滓?仲陌??泣紗??滓?^ 11000011111001111110111111111001001111110011111110110101111000111011110011010011001111110011111111011110111010000011111111000011111001111110111111111001001111110011111110110101111000111011110011010011001111110011111111011110111010000011111101011110 c3e7eff93f3fb5e3bcd33f3fdee83fc3e7eff93f3fb5e3bcd33f3fdee83f5e
UTF-8 仲陌렭렩泣紗렠렗滓쁩仲陌렭렩泣紗렠렗滓쁠^ 11100100101110111011001011101001100110011000110011101011101000001010110111101011101000001010100111100110101100111010001111100111101101001001011111101011101000001010000011101011101000001001011111100110101110111001001111101100100000011010100111100100101110111011001011101001100110011000110011101011101000001010110111101011101000001010100111100110101100111010001111100111101101001001011111101011101000001010000011101011101000001001011111100110101110111001001111101100100000011010000001011110 e4bbb2e9998ceba0adeba0a9e6b3a3e7b497eba0a0eba097e6bb93ec81a9e4bbb2e9998ceba0adeba0a9e6b3a3e7b497eba0a0eba097e6bb93ec81a05e
UHC 仲陌렭렩泣紗렠렗滓쁩仲陌렭렩泣紗렠렗滓쁠^ 1111000111101010110110001110100010001110101110101000111010110111111010111110100011011110111010011000111010110001100011101010110011101110101010111011101111011110111100011110101011011000111010001000111010111010100011101011011111101011111010001101111011101001100011101011000110001110101011001110111010101011101110111101110001011110 f1ead8e88eba8eb7ebe8dee98eb18eaceeabbbdef1ead8e88eba8eb7ebe8dee98eb18eaceeabbbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)