To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????zh???????z 0011111100111111001111110011111100111111001111110011111101111010011010000011111100111111001111110011111100111111001111110011111101111010 3f3f3f3f3f3f3f7a683f3f3f3f3f3f3f7a
SJIS-WIN セュ茱鹿セ、篠zhセュ茱鹿セ、篠z 1011111010101101111001001010001110001110101011011011111010100100100011101100001001111010011010001011111010101101111001001010001110001110101011011011111010100100100011101100001001111010 beade4a38eadbea48ec27a68beade4a38eadbea48ec27a
EUC-JP セュ茱鹿セ、篠zhセュ茱鹿セ、篠z 10001110101111101000111010101101111010001010010110111100101011111000111010111110100011101010010010111100110001000111101001101000100011101011111010001110101011011110100010100101101111001010111110001110101111101000111010100100101111001100010001111010 8ebe8eade8a5bcaf8ebe8ea4bcc47a688ebe8eade8a5bcaf8ebe8ea4bcc47a
UTF-8 セュ茱鹿セ、篠zhセュ茱鹿セ、篠z 111011111011110110111110111011111011110110101101111010001000110010110001111010011011100110111111111011111011110110111110111011111011110110100100111001111010111110100000011110100110100011101111101111011011111011101111101111011010110111101000100011001011000111101001101110011011111111101111101111011011111011101111101111011010010011100111101011111010000001111010 efbdbeefbdade88cb1e9b9bfefbdbeefbda4e7afa07a68efbdbeefbdade88cb1e9b9bfefbdbeefbda4e7afa07a
UHC ??茱鹿??篠zh??茱鹿??篠z 0011111100111111111000101011110011010110111000110011111100111111111000011100011001111010011010000011111100111111111000101011110011010110111000110011111100111111111000011100011001111010 3f3fe2bcd6e33f3fe1c67a683f3fe2bcd6e33f3fe1c67a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)