To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 阯壌長阯壌町^ 11101000100101111000111111101011100100101011011111101000100101111000111111101011100100101010110001011110 e8978feb92b7e8978feb92ac5e
EUC-JP 阯壌長阯壌町^ 11101111111101111011111011101101110001001011100111101111111101111011111011101101110001001010111001011110 eff7beedc4b9eff7beedc4ae5e
UTF-8 阯壌長阯壌町^ 11101001100110001010111111100101101000111000110011101001100101011011011111101001100110001010111111100101101000111000110011100111100101001011101001011110 e998afe5a38ce995b7e998afe5a38ce794ba5e
UHC ??長??町^ 001111110011111111101101111111100011111100111111111011111110101101011110 3f3fedfe3f3fefeb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)