To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??裨???洸曹 0011111100111111111001011110100100111111001111110011111110011111101010011001000110000010 3f3fe5e93f3f3f9fa99182
EUC-JP 灝?裨???洸曹 10001111110010011011111100111111111010101110101100111111001111110011111111011110101010111100000111100010 8fc9bf3feaeb3f3f3fdeabc1e2
UTF-8 灝흗裨뤱횓따洸曹 111001111000000110011101111011011001110110010111111010001010001110101000111010111010010010110001111011011001101010010011111010111001010010110000111001101011010010111000111001101001101110111001 e7819ded9d97e8a3a8eba4b1ed9a93eb94b0e6b4b8e69bb9
UHC 灝흗裨뤱횓따洸曹 11111011110011101100100011101001110111101010010110001111110111111100001110001110101101011111101111001110110010001111000011000111 fbcec8e9dea58fdfc38eb5fbcec8f0c7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)