To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 仲?∧畯孟?畯孟∧B 1001001010000111001111111000000111001000111110110110111110010110110100000011111111111011011011111001011011010000100000011100100001000010 92873f81c8fb6f96d03ffb6f96d081c842
EUC-JP 仲?∧畯孟?畯孟∧B 11000011111001110011111110100010110010101000111111001101101110111100110011010010001111111000111111001101101110111100110011010010101000101100101001000010 c3e73fa2ca8fcdbbccd23f8fcdbbccd2a2ca42
UTF-8 仲쯤∧畯孟놉畯孟∧B 11100100101110111011001011101100101011111010010011100010100010001010011111100111100101011010111111100101101011011001111111101011100001101000100111100111100101011010111111100101101011011001111111100010100010001010011101000010 e4bbb2ecafa4e288a7e795afe5ad9feb8689e795afe5ad9fe288a742
UHC 仲쯤∧畯孟놉畯孟∧B 11110001111010101100001011101011101000011111110011110001111000011101100011101011101100111111000111110001111000011101100011101011101000011111110001000010 f1eac2eba1fcf1e1d8ebb3f1f1e1d8eba1fc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)