To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 堯??戌?堯??戌?^ 111010101001111100111111001111111001110011111010001111111110101010011111001111110011111110011100111110100011111101011110 ea9f3f3f9cfa3fea9f3f3f9cfa3f5e
EUC-JP 堯??戌濚堯??戌濚^ 11110100101000010011111100111111110110001111110010001111110010011010000111110100101000010011111100111111110110001111110010001111110010011010000101011110 f4a13f3fd8fc8fc9a1f4a13f3fd8fc8fc9a15e
UTF-8 堯덀룗戌濚堯덀룗戌濚^ 11100101101000001010111111101011100011011000000011101011101000111001011111100110100010001000110011100110101111111001101011100101101000001010111111101011100011011000000011101011101000111001011111100110100010001000110011100110101111111001101001011110 e5a0afeb8d80eba397e6888ce6bf9ae5a0afeb8d80eba397e6888ce6bf9a5e
UHC 堯덀룗戌濚堯덀룗戌濚^ 111010001110101110001000111000111000111110010011111000101111100111100111101110011110100011101011100010001110001110001111100100111110001011111001111001111011100101011110 e8eb88e38f93e2f9e7b9e8eb88e38f93e2f9e7b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)