To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歟??揖??釉?????揖ζ?諭?????泣 1001111101100010001111110011111110010111010010110011111100111111111001111101011000111111001111110011111100111111001111111001011101001011100000111100010000111111100101110100000000111111001111110011111100111111001111111000101110000011 9f623f3f974b3f3fe7d63f3f3f3f3f974b83c43f97403f3f3f3f3f8b83
EUC-JP 歟??揖?ł釉?????揖ζ?諭?????泣 11011101110000110011111100111111110011011010110000111111100011111010100111001000111011101101100000111111001111110011111100111111001111111100110110101100101001101100011000111111110011011010000100111111001111110011111100111111001111111011010111100011 ddc33f3fcdac3f8fa9c8eed83f3f3f3f3fcdaca6c63fcda13f3f3f3f3fb5e3
UTF-8 歟㏐랬揖좂ł釉앹댅黎싰쒀揖ζ썫諭꾩뒛嶪용뵃泣 11100110101011011001111111100011100011111001000011101011100111101010110011100110100011111001011011101100101000101000001011000101100000101110100110000111100010011110110010010101101110011110101110001100100001011110111110100110100010011110110010001011101100001110110010010010100000001110011010001111100101101100111010110110111011001000110110101011111010001010101110101101111010101011111010101001111010111001001010011011111001011011011010101010111011001001101010101001111010111011010110000011111001101011001110100011 e6ad9fe38f90eb9eace68f96eca282c582e98789ec95b9eb8c85efa689ec8bb0ec9280e68f96ceb6ec8dabe8abadeabea9eb929be5b6aaec9aa9ebb583e6b3a3
UHC 歟㏐랬揖좂ł釉앹댅黎싰쒀揖ζ썫諭꾩뒛嶪용뵃泣 1110011010100010101001111110101010110111101010001110101111100111101000001110011110101001101010011110101110111000100111011110110010001000101011111110011010110001100110101110101010111110101011001110101111100111101001011110011010011011100111001110101110110001100001001110110010001010100110001110010111110101101111111110101110010100100010011110101111101000 e6a2a7eab7a8ebe7a0e7a9a9ebb89dec88afe6b19aeabeacebe7a5e69b9cebb184ec8a98e5f5bfeb9489ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)