To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 襄玖濤襄矩椛襄区価 111001011111010110001011111010001001111110110111111001011111010110001011111010011000101010010001111001011111010110001011111001101000100110111111 e5f58be89fb7e5f58be98a91e5f58be689bf
EUC-JP 襄玖濤襄矩椛襄区価 111010101111011110110110111010101101111010111001111010101111011110110110111010111011001111110001111010101111011110110110111010001011001011000001 eaf7b6eadeb9eaf7b6ebb3f1eaf7b6e8b2c1
UTF-8 襄玖濤襄矩椛襄区価 111010001010010110000100111001111000111010010110111001101011111110100100111010001010010110000100111001111001111110101001111001101010010010011011111010001010010110000100111001011000110010111010111001001011111010100001 e8a584e78e96e6bfa4e8a584e79fa9e6a49be8a584e58cbae4bea1
UHC 襄玖濤襄矩?襄?? 111001011101000111001111101110001101010010100110111001011101000111001111101110110011111111100101110100010011111100111111 e5d1cfb8d4a6e5d1cfbb3fe5d13f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)