To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ûCް}ûCް{^ 1111101101000011100011101011000001111101111110110100001110001110101100000111101101011110 fb438eb07dfb438eb07b5e
SJIS-WIN ?C?°}?C?°{^ 00111111010000110011111110000001100010110111110100111111010000110011111110000001100010110111101101011110 3f433f818b7d3f433f818b7b5e
EUC-JP ûC?°}ûC?°{^ 1000111110101011111001010100001100111111101000011110101101111101100011111010101111100101010000110011111110100001111010110111101101011110 8fabe5433fa1eb7d8fabe5433fa1eb7b5e
UTF-8 ûCް}ûCް{^ 1100001110111011010000111100001010001110110000101011000001111101110000111011101101000011110000101000111011000010101100000111101101011110 c3bb43c28ec2b07dc3bb43c28ec2b07b5e
UHC ?C?°}?C?°{^ 00111111010000110011111110100001110001100111110100111111010000110011111110100001110001100111101101011110 3f433fa1c67d3f433fa1c67b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)