To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 奛讌蠑常蟋蠑冗赧咒 111110101010000111100110101001011110010110111100100011111110110111100101101001111110010110111100100011111110011111100110110111011001100111101110 faa1e6a5e5bc8fede5a7e5bc8fe7e6dd99ee
EUC-JP 奛讌蠑常蟋蠑冗赧咒 10001111101110001111011111101100101001111110101010111110101111101110111111101010101010011110101010111110101111101110100111101100110111111101001011110000 8fb8f7eca7eabebeefeaa9eabebee9ecdfd2f0
UTF-8 奛讌蠑常蟋蠑冗赧咒 111001011010010110011011111010001010111010001100111010001010000010010001111001011011100010111000111010001001111110001011111010001010000010010001111001011000011010010111111010001011010110100111111001011001001010010010 e5a59be8ae8ce8a091e5b8b8e89f8be8a091e58697e8b5a7e59292
UHC ???常??冗?? 0011111100111111001111111101111111001000001111110011111111101001101101110011111100111111 3f3f3fdfc83f3fe9b73f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)