To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 í莰ií莰iB 1110110111101000100011101011000001101001111011011110100010001110101100000110100101000010 ede88eb069ede88eb06942
SJIS-WIN ???°i???°iB 00111111001111110011111110000001100010110110100100111111001111110011111110000001100010110110100101000010 3f3f3f818b693f3f3f818b6942
EUC-JP íè?°iíè?°iB 100011111010101110111111100011111010101110110010001111111010000111101011011010011000111110101011101111111000111110101011101100100011111110100001111010110110100101000010 8fabbf8fabb23fa1eb698fabbf8fabb23fa1eb6942
UTF-8 í莰ií莰iB 11000011101011011100001110101000110000101000111011000010101100000110100111000011101011011100001110101000110000101000111011000010101100000110100101000010 c3adc3a8c28ec2b069c3adc3a8c28ec2b06942
UHC ???°i???°iB 00111111001111110011111110100001110001100110100100111111001111110011111110100001110001100110100101000010 3f3f3fa1c6693f3f3fa1c66942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)