To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 렺씻렔렻매렮렺씻렔렻매렮B 11101011101000001011101011101100100101001011101111101011101000001001010011101011101000001011101111101011101001111010010011101011101000001010111011101011101000001011101011101100100101001011101111101011101000001001010011101011101000001011101111101011101001111010010011101011101000001010111001000010 eba0baec94bbeba094eba0bbeba7a4eba0aeeba0baec94bbeba094eba0bbeba7a4eba0ae42
UHC 렺씻렔렻매렮렺씻렔렻매렮B 10001110110000101011111011000100100011101010100110001110110000111011100011000101100011101011101110001110110000101011111011000100100011101010100110001110110000111011100011000101100011101011101101000010 8ec2bec48ea98ec3b8c58ebb8ec2bec48ea98ec3b8c58ebb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)