To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???i???iB 001111110011111100111111011010010011111100111111001111110110100101000010 3f3f3f693f3f3f6942
SJIS-WIN 彗??i彗??iB 1001110001100001001111110011111101101001100111000110000100111111001111110110100101000010 9c613f3f699c613f3f6942
EUC-JP 彗??i彗??iB 1101011111000010001111110011111101101001110101111100001000111111001111110110100101000010 d7c23f3f69d7c23f3f6942
UTF-8 彗묔궗i彗묔궗iB 111001011011110110010111111010111010110010010100111010101011011010010111011010011110010110111101100101111110101110101100100101001110101010110110100101110110100101000010 e5bd97ebac94eab69769e5bd97ebac94eab6976942
UHC 彗묔궗i彗묔궗iB 111110111011001010010001111011101000001010101100011010011111101110110010100100011110111010000010101011000110100101000010 fbb291ee82ac69fbb291ee82ac6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)