To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 源?障?垣企??趙孟∧源?障?垣企??趙孟∧B 10001100101110010011111110001111111000010011111110001010010111111000101011101001001111110011111111100110111000101001011011010000100000011100100010001100101110010011111110001111111000010011111110001010010111111000101011101001001111110011111111100110111000101001011011010000100000011100100001000010 8cb93f8fe13f8a5f8ae93f3fe6e296d081c88cb93f8fe13f8a5f8ae93f3fe6e296d081c842
EUC-JP 源?障?垣企??趙孟∧源?障?垣企??趙孟∧B 10111000101110110011111110111110111000110011111110110011110000001011010011101011001111110011111111101100111001001100110011010010101000101100101010111000101110110011111110111110111000110011111110110011110000001011010011101011001111110011111111101100111001001100110011010010101000101100101001000010 b8bb3fbee33fb3c0b4eb3f3fece4ccd2a2cab8bb3fbee33fb3c0b4eb3f3fece4ccd2a2ca42
UTF-8 源렰障렚垣企렕렟趙孟∧源렰障렚垣企렕렟趙孟∧B 11100110101110101001000011101011101000001011000011101001100110101001110011101011101000001001101011100101100111101010001111100100101111001000000111101011101000001001010111101011101000001001111111101000101101101001100111100101101011011001111111100010100010001010011111100110101110101001000011101011101000001011000011101001100110101001110011101011101000001001101011100101100111101010001111100100101111001000000111101011101000001001010111101011101000001001111111101000101101101001100111100101101011011001111111100010100010001010011101000010 e6ba90eba0b0e99a9ceba09ae59ea3e4bc81eba095eba09fe8b699e5ad9fe288a7e6ba90eba0b0e99a9ceba09ae59ea3e4bc81eba095eba09fe8b699e5ad9fe288a742
UHC 源렰障렚垣企렕렟趙孟∧源렰障렚垣企렕렟趙孟∧B 111010101011100110001110101111011110111010100001100011101010110111101010101011111101000011101010100011101010101010001110101100001111000011100001110110001110101110100001111111001110101010111001100011101011110111101110101000011000111010101101111010101010111111010000111010101000111010101010100011101011000011110000111000011101100011101011101000011111110001000010 eab98ebdeea18eadeaafd0ea8eaa8eb0f0e1d8eba1fceab98ebdeea18eadeaafd0ea8eaa8eb0f0e1d8eba1fc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)