To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 曜??唯?????奧??茹??B 1001011101101010001111110011111110010111010000100011111100111111001111110011111100111111100110101111101000111111001111111110010010100101001111110011111101000010 976a3f3f97423f3f3f3f3f9afa3f3fe4a53f3f42
EUC-JP 曜??唯?????奧??茹??B 1100110111001011001111110011111111001101101000110011111100111111001111110011111100111111110101001111110000111111001111111110100010100111001111110011111101000010 cdcb3f3fcda33f3f3f3f3fd4fc3f3fe8a73f3f42
UTF-8 曜쒑븥唯끾챼溜볣슌奧롫늼茹됰젗B 11100110100110111001110011101100100100101001000111101011101110001010010111100101100101001010111111101011100000011011111011101100101100011011110011101111101001111000101111101011101100111010001111101100100010101000110011100101101001011010011111101011101000011010101111101011100010101011110011101000100011001011100111101011100100001011000011101100101000001001011101000010 e69b9cec9291ebb8a5e594afeb81beecb1bcefa78bebb3a3ec8a8ce5a5a7eba1abeb8abce88cb9eb90b0eca09742
UHC 曜쒑븥唯끾챼溜볣슌奧롫늼茹됰젗B 11101000111110001001110011101000100101011000111011101010111001101000010111100110101010101000100111101010111111101001001111101001100110101001110011100111111100111000111011101011100010001000010111100110101010101000100111101011101000001001001101000010 e8f89ce8958eeae685e6aa89eafe93e99a9ce7f38eeb8885e6aa89eba09342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)