To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 他達其他袖淡^ 10010001101111001001001001000010100100011011010010010001101111001001000110110011100100100101011101011110 91bc924291b491bc91b392575e
EUC-JP 他達其他袖淡^ 11000010101111101100001110100011110000101011011011000010101111101100001010110101110000111011100001011110 c2bec3a3c2b6c2bec2b5c3b85e
UTF-8 他達其他袖淡^ 11100100101110111001011011101001100000011001010011100101100001011011011011100100101110111001011011101000101000101001011011100110101101111010000101011110 e4bb96e98194e585b6e4bb96e8a296e6b7a15e
UHC 他達其他袖淡^ 11110110111000101101001110111001110100001110110011110110111000101110001011000000110100111011111101011110 f6e2d3b9d0ecf6e2e2c0d3bf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)