To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??應ц?蒻??潁リ?哀??應ц?蒻?? 10001000101000110011111100111111100111001110010010000100100010000011111111100100111010000011111100111111100111111111000110000011100010100011111110001000101000110011111100111111100111001110010010000100100010000011111111100100111010000011111100111111 88a33f3f9ce484883fe4e83f3f9ff1838a3f88a33f3f9ce484883fe4e83f3f
EUC-JP 哀??應ц?蒻??潁リ?哀??應ц?蒻?? 10110000101001010011111100111111110110001110011010100111111010000011111111101000111010100011111100111111110111101111001110100101111010100011111110110000101001010011111100111111110110001110011010100111111010000011111111101000111010100011111100111111 b0a53f3fd8e6a7e83fe8ea3f3fdef3a5ea3fb0a53f3fd8e6a7e83fe8ea3f3f
UTF-8 哀잂꽒應ц삏蒻앸젔潁リ린哀잂꽒應ц삏蒻앸젔 11100101100100111000000011101100100111101000001011101010101111011001001011100110100001111000100111010001100001101110110010000010100011111110100010010010101110111110110010010101101110001110110010100000100101001110011010111101100000011110001110000011101010101110101110100110101100001110010110010011100000001110110010011110100000101110101010111101100100101110011010000111100010011101000110000110111011001000001010001111111010001001001010111011111011001001010110111000111011001010000010010100 e59380ec9e82eabd92e68789d186ec828fe892bbec95b8eca094e6bd81e383aaeba6b0e59380ec9e82eabd92e68789d186ec828fe892bbec95b8eca094
UHC 哀잂꽒應ц삏蒻앸젔潁リ린哀잂꽒應ц삏蒻앸젔 111001001110111010011111111000101000010010100001111010111110101110101100111010001001100010010110111001011011011010011101111010111010000010010010111001111011100010101011111010101011100010110000111001001110111010011111111000101000010010100001111010111110101110101100111010001001100010010110111001011011011010011101111010111010000010010010 e4ee9fe284a1ebebace89896e5b69deba092e7b8abeab8b0e4ee9fe284a1ebebace89896e5b69deba092

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)