To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 簇???足?宰???症?蔚?止?裁???B 11100010110001100011111100111111001111111001000110101011001111111000110111001001001111110011111100111111100011111100011100111111100010010101010100111111100011100111111000111111100011011101100100111111001111110011111101000010 e2c63f3f3f91ab3f8dc93f3f3f8fc73f89553f8e7e3f8dd93f3f3f42
EUC-JP 簇???足?宰???症?蔚?止?裁???B 11100100110010000011111100111111001111111100001010101101001111111011101011001011001111110011111100111111101111101100100100111111101100011011011000111111101110111101111100111111101110101101101100111111001111110011111101000010 e4c83f3f3fc2ad3fbacb3f3f3fbec93fb1b63fbbdf3fbadb3f3f3f42
UTF-8 簇펠렍렯足렪宰희렰렏症렊蔚렮止렑裁肋렰렏B 11100111101100001000011111101101100011101010000011101011101000001000110111101011101000001010111111101000101101101011001111101011101000001010101011100101101011101011000011101101100111011010110011101011101000001011000011101011101000001000111111100111100101111000011111101011101000001000101011101000100101001001101011101011101000001010111011100110101011011010001011101011101000001001000111101000101000111000000111101111101001011001001111101011101000001011000011101011101000001000111101000010 e7b087ed8ea0eba08deba0afe8b6b3eba0aae5aeb0ed9daceba0b0eba08fe79787eba08ae8949aeba0aee6ada2eba091e8a381efa593eba0b0eba08f42
UHC 簇펠렍렯足렪宰희렰렏症렊蔚렮止렑裁肋렰렏B 1111000011101010110001101110011110001110101000111000111010111100111100001110101110001110101110001110111010100101110010001111000110001110101111011000111010100101111100011111100010001110101000011110101010100101100011101011101111110010101011011000111010100110111011101010111011010010111100011000111010111101100011101010010101000010 f0eac6e78ea38ebcf0eb8eb8eea5c8f18ebd8ea5f1f88ea1eaa58ebbf2ad8ea6eeaed2f18ebd8ea542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)