To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN シス鴫ヤ鄒イ 1111000111111100101111001111000110001110101111011000111010110000111100001010111111010100111001111011111010110010 f1fcbcf18ebd8eb0f0afd4e7beb2
EUC-JP ?シ?ス鴫?ヤ鄒イ 001111111000111010111100001111111000111010111101101111001011001000111111100011101101010011101110110000001000111010110010 3f8ebc3f8ebdbcb23f8ed4eec08eb2
UTF-8 シス鴫ヤ鄒イ 111011101000010110110111111011111011110110111100111011101000010010001001111011111011110110111101111010011011010010101011111011101000000110101110111011111011111010010100111010011000010010010010111011111011110110110010 ee85b7efbdbcee8489efbdbde9b4abee81aeefbe94e98492efbdb2
UHC ???????鄒? 00111111001111110011111100111111001111110011111100111111111101011101101100111111 3f3f3f3f3f3f3ff5db3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)