To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 菴削洲菴削洲^ 11100100101111011000110111101101100011110100011011100100101111011000110111101101100011110100011001011110 e4bd8ded8f46e4bd8ded8f465e
EUC-JP 菴削洲菴削洲^ 11101000101111111011101011101111101111011010011111101000101111111011101011101111101111011010011101011110 e8bfbaefbda7e8bfbaefbda75e
UTF-8 菴削洲菴削洲^ 11101000100011111011010011100101100010011000101011100110101101001011001011101000100011111011010011100101100010011000101011100110101101001011001001011110 e88fb4e5898ae6b4b2e88fb4e5898ae6b4b25e
UHC 菴削洲菴削洲^ 11100100111000001101111011111011111100011011110111100100111000001101111011111011111100011011110101011110 e4e0defbf1bde4e0defbf1bd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)