To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN ソ苟ソ苟[ソ苟ソ苟[^ 101111111110010010010001101111111110010010010001010110111011111111100100100100011011111111100100100100010101101101011110 bfe491bfe4915bbfe491bfe4915b5e
EUC-JP ソ苟ソ苟[ソ苟ソ苟[^ 10001110101111111110011111110001100011101011111111100111111100010101101110001110101111111110011111110001100011101011111111100111111100010101101101011110 8ebfe7f18ebfe7f15b8ebfe7f18ebfe7f15b5e
UTF-8 ソ苟ソ苟[ソ苟ソ苟[^ 111011111011110110111111111010001000101110011111111011111011110110111111111010001000101110011111010110111110111110111101101111111110100010001011100111111110111110111101101111111110100010001011100111110101101101011110 efbdbfe88b9fefbdbfe88b9f5befbdbfe88b9fefbdbfe88b9f5b5e
UHC ?苟?苟[?苟?苟[^ 001111111100111111000010001111111100111111000010010110110011111111001111110000100011111111001111110000100101101101011110 3fcfc23fcfc25b3fcfc23fcfc25b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)