To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 »ó즏»ùŽ©»ó즏»ùÊŽ­» 1000111110111011111100111110110010100110100011111011101111111001100011101010100110001111101110111111001111101100101001101000111110111011111110011100101011101110100011101011001010001110101011011000111110111011 8fbbf3eca68fbbf98ea98fbbf3eca68fbbf9caee8eb28ead8fbb
SJIS-WIN ?????????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??óì¦??ù?©??óì¦??ùÊî?????? 001111110011111110001111101010111101000110001111101010111100000010001111101000101100001100111111001111111000111110101011111000110011111110001111101000101110110100111111001111111000111110101011110100011000111110101011110000001000111110100010110000110011111100111111100011111010101111100011100011111010101010110100100011111010101111000010001111110011111100111111001111110011111100111111 3f3f8fabd18fabc08fa2c33f3f8fabe33f8fa2ed3f3f8fabd18fabc08fa2c33f3f8fabe38faab48fabc23f3f3f3f3f3f
UTF-8 »ó즏»ùŽ©»ó즏»ùÊŽ­» 11000010100011111100001010111011110000111011001111000011101011001100001010100110110000101000111111000010101110111100001110111001110000101000111011000010101010011100001010001111110000101011101111000011101100111100001110101100110000101010011011000010100011111100001010111011110000111011100111000011100010101100001110101110110000101000111011000010101100101100001010001110110000101010110111000010100011111100001010111011 c28fc2bbc3b3c3acc2a6c28fc2bbc3b9c28ec2a9c28fc2bbc3b3c3acc2a6c28fc2bbc3b9c38ac3aec28ec2b2c28ec2adc28fc2bb
UHC ?????????????????????²?­?? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110101001111101110011111110100001101010010011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fa9f73fa1a93f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)