To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 小?堯?舌屑B 1000111110101100001111111110101010011111001111111001000011100011100010111111101101000010 8fac3fea9f3f90e38bfb42
EUC-JP 小?堯炤舌屑B 10111110101011100011111111110100101000011000111111001001110100101100000011100101101101101111110101000010 beae3ff4a18fc9d2c0e5b6fd42
UTF-8 小숞堯炤舌屑B 11100101101100001000111111101100100010001001111011100101101000001010111111100111100000101010010011101000100010001000110011100101101100011001000101000010 e5b08fec889ee5a0afe782a4e8888ce5b19142
UHC 小숞堯炤舌屑B 11100001101100111001100111111011111010001110101111100001101111111110000011011111111000001101101001000010 e1b399fbe8ebe1bfe0dfe0da42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)