To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 炭続辿揃達炭^ 10010010010110011001000110110001100100100100100010010001101101011001001001000010100100100101100101011110 925991b1924891b5924292595e
EUC-JP 炭続辿揃達炭^ 11000011101110101100001010110011110000111010100111000010101101111100001110100011110000111011101001011110 c3bac2b3c3a9c2b7c3a3c3ba5e
UTF-8 炭続辿揃達炭^ 11100111100000101010110111100111101101101001101011101000101111101011111111100110100011111000001111101001100000011001010011100111100000101010110101011110 e782ade7b69ae8bebfe68f83e98194e782ad5e
UHC 炭???達炭^ 11110111101010010011111100111111001111111101001110111001111101111010100101011110 f7a93f3f3fd3b9f7a95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)