To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN 衣?衣?[衣?衣?[^ 100010001101111100111111100010001101111100111111010110111000100011011111001111111000100011011111001111110101101101011110 88df3f88df3f5b88df3f88df3f5b5e
EUC-JP 衣?衣?[衣?衣?[^ 101100001110000100111111101100001110000100111111010110111011000011100001001111111011000011100001001111110101101101011110 b0e13fb0e13f5bb0e13fb0e13f5b5e
UTF-8 衣렍衣렍[衣렍衣렍[^ 111010001010000110100011111010111010000010001101111010001010000110100011111010111010000010001101010110111110100010100001101000111110101110100000100011011110100010100001101000111110101110100000100011010101101101011110 e8a1a3eba08de8a1a3eba08d5be8a1a3eba08de8a1a3eba08d5b5e
UHC 衣렍衣렍[衣렍衣렍[^ 11101011111111011000111010100011111010111111110110001110101000110101101111101011111111011000111010100011111010111111110110001110101000110101101101011110 ebfd8ea3ebfd8ea35bebfd8ea3ebfd8ea35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)