To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 簇?∧趙孟?趙孟∧B 1110001011000110001111111000000111001000111001101110001010010110110100000011111111100110111000101001011011010000100000011100100001000010 e2c63f81c8e6e296d03fe6e296d081c842
EUC-JP 簇?∧趙孟?趙孟∧B 1110010011001000001111111010001011001010111011001110010011001100110100100011111111101100111001001100110011010010101000101100101001000010 e4c83fa2caece4ccd23fece4ccd2a2ca42
UTF-8 簇쯤∧趙孟놉趙孟∧B 11100111101100001000011111101100101011111010010011100010100010001010011111101000101101101001100111100101101011011001111111101011100001101000100111101000101101101001100111100101101011011001111111100010100010001010011101000010 e7b087ecafa4e288a7e8b699e5ad9feb8689e8b699e5ad9fe288a742
UHC 簇쯤∧趙孟놉趙孟∧B 11110000111010101100001011101011101000011111110011110000111000011101100011101011101100111111000111110000111000011101100011101011101000011111110001000010 f0eac2eba1fcf0e1d8ebb3f1f0e1d8eba1fc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)