To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鴦?????鎖??^ 111010011111000100111111001111110011111100111111001111111000110110111101001111110011111101011110 e9f13f3f3f3f3f8dbd3f3f5e
EUC-JP 鴦?????鎖??^ 111100101111001100111111001111110011111100111111001111111011101010111111001111110011111101011110 f2f33f3f3f3f3fbabf3f3f5e
UTF-8 鴦볛뮩留롥럳鎖노쭖^ 11101001101101001010011011101011101100111001101111101011101011101010100111101111101001111000110111101011101000011010010111101011100111111011001111101001100011101001011011101011100001011011100011101100101011011001011001011110 e9b4a6ebb39bebaea9efa78deba1a5eb9fb3e98e96eb85b8ecad965e
UHC 鴦볛뮩留롥럳鎖노쭖^ 11100100111011001001001111100010100100101011001111101011101001111000111011100101100011101001001111100001111100001011001111101011101001111000111001011110 e4ec93e292b3eba78ee58e93e1f0b3eba78e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)