To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 涇ス螟ア趙シ鮨奇スゥ﨟ス螟ア辟シ鮨奇スゥB 11111010111110111011110111100101101001001011000111100110111000101011110011101001101111011000101011101111101111011010100111111011100111011011110111100101101001001011000111100111100001001011110011101001101111011000101011101111101111011010100101000010 fafbbde5a4b1e6e2bce9bd8aefbda9fb9dbde5a4b1e784bce9bd8aefbda942
EUC-JP 涇ス螟ア趙シ鮨奇スゥ?ス螟ア辟シ鮨奇スゥB 1000111111000111110001111000111010111101111010101010011010001110101100011110110011100100100011101011110011110010101111111011010011110001100011101011110110001110101010010011111110001110101111011110101010100110100011101011000111101101111001001000111010111100111100101011111110110100111100011000111010111101100011101010100101000010 8fc7c78ebdeaa68eb1ece48ebcf2bfb4f18ebd8ea93f8ebdeaa68eb1ede48ebcf2bfb4f18ebd8ea942
UTF-8 涇ス螟ア趙シ鮨奇スゥ﨟ス螟ア辟シ鮨奇スゥB 11100110101101101000011111101111101111011011110111101000100111101001111111101111101111011011000111101000101101101001100111101111101111011011110011101001101011101010100011100101101001011000011111101111101111011011110111101111101111011010100111101111101010001001111111101111101111011011110111101000100111101001111111101111101111011011000111101000101111101001111111101111101111011011110011101001101011101010100011100101101001011000011111101111101111011011110111101111101111011010100101000010 e6b687efbdbde89e9fefbdb1e8b699efbdbce9aea8e5a587efbdbdefbda9efa89fefbdbde89e9fefbdb1e8be9fefbdbce9aea8e5a587efbdbdefbda942
UHC 涇?螟?趙??奇????螟????奇??B 110011001101110000111111110110011010110100111111111100001110000100111111001111111101000011110100001111110011111100111111001111111101100110101101001111110011111100111111001111111101000011110100001111110011111101000010 ccdc3fd9ad3ff0e13f3fd0f43f3f3f3fd9ad3f3f3f3fd0f43f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)