To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 齊???姐?蔚???????爭?園?珥?B 111010101000111000111111001111110011111110001000101101110011111110001001010101010011111100111111001111110011111100111111001111110011111111100000101001010011111110001001100000000011111111100000111000000011111101000010 ea8e3f3f3f88b73f89553f3f3f3f3f3f3fe0a53f89803fe0e03f42
EUC-JP 齊???姐?蔚???焌???爭?園?珥?B 1111001111101110001111110011111100111111101100001011100100111111101100011011011000111111001111110011111110001111110010011110100000111111001111110011111111100000101001110011111110110001111000000011111111100000111000100011111101000010 f3ee3f3f3fb0b93fb1b63f3f3f8fc9e83f3f3fe0a73fb1e03fe0e23f42
UTF-8 齊골렰렑姐렒蔚멸렱렩焌찔렰렑爭렦園렋珥렲B 11101001101111011000101011101010101100111010100011101011101000001011000011101011101000001001000111100101101001111001000011101011101000001001001011101000100101001001101011101011101010011011100011101011101000001011000111101011101000001010100111100111100001001000110011101100101100001001010011101011101000001011000011101011101000001001000111100111100010001010110111101011101000001010011011100101100111001001001011101011101000001000101111100111100011111010010111101011101000001011001001000010 e9bd8aeab3a8eba0b0eba091e5a790eba092e8949aeba9b8eba0b1eba0a9e7848cecb094eba0b0eba091e788adeba0a6e59c92eba08be78fa5eba0b242
UHC 齊골렰렑姐렒蔚멸렱렩焌찔렰렑爭렦園렋珥렲B 1111000010111010101100001111000110001110101111011000111010100110111011101011101110001110101001111110101010100101101110001110101010001110101111101000111010110111111100011110000011000010111100011000111010111101100011101010011011101110101100111000111010110101111010101010111010001110101000101110110010110100100011101011111101000010 f0bab0f18ebd8ea6eebb8ea7eaa5b8ea8ebe8eb7f1e0c2f18ebd8ea6eeb38eb5eaae8ea2ecb48ebf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)