To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????^ 0011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f5e
SJIS-WIN □??療??ィ^ 1000000110100000001111110011111110010111110000110011111100111111100000110100001001011110 81a03f3f97c33f3f83425e
EUC-JP □??療??ィ^ 1010001010100010001111110011111111001110110001010011111100111111101001011010001101011110 a2a23f3fcec53f3fa5a35e
UTF-8 □▩룶療룶혧ィ^ 11100010100101101010000111100010100101101010100111101011101000111011011011100111100110011000001011101011101000111011011011101101100110001010011111100011100000101010001101011110 e296a1e296a9eba3b6e79982eba3b6ed98a7e382a35e
UHC □▩룶療룶혧ィ^ 101000011110000010100010110011001000111110101011110101101111101110001111101010111100001010001111101010111010001101011110 a1e0a2cc8fabd6fb8fabc28faba35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)