To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 淙?蔚?足?裁???症?蔚?衆?裁???B 1001111111001000001111111000100101010101001111111001000110101011001111111000110111011001001111110011111100111111100011111100011100111111100010010101010100111111100011110100111100111111100011011101100100111111001111110011111101000010 9fc83f89553f91ab3f8dd93f3f3f8fc73f89553f8f4f3f8dd93f3f3f42
EUC-JP 淙?蔚?足?裁???症?蔚?衆?裁???B 1101111011001010001111111011000110110110001111111100001010101101001111111011101011011011001111110011111100111111101111101100100100111111101100011011011000111111101111011011000000111111101110101101101100111111001111110011111101000010 deca3fb1b63fc2ad3fbadb3f3f3fbec93fb1b63fbdb03fbadb3f3f3f42
UTF-8 淙렊蔚렯足렪裁肋렰렏症렊蔚렯衆렪裁肋렰렏B 11100110101101111001100111101011101000001000101011101000100101001001101011101011101000001010111111101000101101101011001111101011101000001010101011101000101000111000000111101111101001011001001111101011101000001011000011101011101000001000111111100111100101111000011111101011101000001000101011101000100101001001101011101011101000001010111111101000101000011000011011101011101000001010101011101000101000111000000111101111101001011001001111101011101000001011000011101011101000001000111101000010 e6b799eba08ae8949aeba0afe8b6b3eba0aae8a381efa593eba0b0eba08fe79787eba08ae8949aeba0afe8a186eba0aae8a381efa593eba0b0eba08f42
UHC 淙렊蔚렯足렪裁肋렰렏症렊蔚렯衆렪裁肋렰렏B 1111000011111000100011101010000111101010101001011000111010111100111100001110101110001110101110001110111010101110110100101111000110001110101111011000111010100101111100011111100010001110101000011110101010100101100011101011110011110001111010111000111010111000111011101010111011010010111100011000111010111101100011101010010101000010 f0f88ea1eaa58ebcf0eb8eb8eeaed2f18ebd8ea5f1f88ea1eaa58ebcf1eb8eb8eeaed2f18ebd8ea542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)