To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN シ、鉙フ艟シ、鉙フ艝^ 101111001010010011111011110010001100110011100100011111011011110010100100111110111100100011001100111001000111101101011110 bca4fbc8cce47dbca4fbc8cce47b5e
EUC-JP シ、鉙フ艟シ、鉙フ艝^ 1000111010111100100011101010010010001111111000111110001110001110110011001110011111011110100011101011110010001110101001001000111111100011111000111000111011001100111001111101110001011110 8ebc8ea48fe3e38ecce7de8ebc8ea48fe3e38ecce7dc5e
UTF-8 シ、鉙フ艟シ、鉙フ艝^ 11101111101111011011110011101111101111011010010011101001100010011001100111101111101111101000110011101000100010011001111111101111101111011011110011101111101111011010010011101001100010011001100111101111101111101000110011101000100010011001110101011110 efbdbcefbda4e98999efbe8ce8899fefbdbcefbda4e98999efbe8ce8899d5e
UHC ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)