To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????z???????zB 0011111100111111001111110011111100111111001111110011111101111010001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f7a42
SJIS-WIN 上シシキシォz上シシキシォzB 100011111110001110111100101111001011011111110010111010101011110010101011011110101000111111100011101111001011110010110111111100101110101010111100101010110111101001000010 8fe3bcbcb7f2eabcab7a8fe3bcbcb7f2eabcab7a42
EUC-JP 上シシキ?シォz上シシキ?シォzB 1011111011100101100011101011110010001110101111001000111010110111001111111000111010111100100011101010101101111010101111101110010110001110101111001000111010111100100011101011011100111111100011101011110010001110101010110111101001000010 bee58ebc8ebc8eb73f8ebc8eab7abee58ebc8ebc8eb73f8ebc8eab7a42
UTF-8 上シシキシォz上シシキシォzB 111001001011100010001010111011111011110110111100111011111011110110111100111011111011110110110111111011101000100010100001111011111011110110111100111011111011110110101011011110101110010010111000100010101110111110111101101111001110111110111101101111001110111110111101101101111110111010001000101000011110111110111101101111001110111110111101101010110111101001000010 e4b88aefbdbcefbdbcefbdb7ee88a1efbdbcefbdab7ae4b88aefbdbcefbdbcefbdb7ee88a1efbdbcefbdab7a42
UHC 上??????z上??????zB 11011111101111100011111100111111001111110011111100111111001111110111101011011111101111100011111100111111001111110011111100111111001111110111101001000010 dfbe3f3f3f3f3f3f7adfbe3f3f3f3f3f3f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)