To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 î¯Úç´­éüŽ­}î¯Úç´­éüŽ­{^ 1110111010101111110110101110011110110100101011011110100111111100100011101010110101111101111011101010111111011010111001111011010010101101111010011111110010001110101011010111101101011110 eeafdae7b4ade9fc8ead7deeafdae7b4ade9fc8ead7b5e
SJIS-WIN ????´?????}????´?????{^ 00111111001111110011111100111111100000010100110000111111001111110011111100111111001111110111110100111111001111110011111100111111100000010100110000111111001111110011111100111111001111110111101101011110 3f3f3f3f814c3f3f3f3f3f7d3f3f3f3f814c3f3f3f3f3f7b5e
EUC-JP î¯Úç´?éü??}î¯Úç´?éü??{^ 10001111101010111100001010001111101000101011010010001111101010101110001010001111101010111010111010100001101011010011111110001111101010111011000110001111101010111110010000111111001111110111110110001111101010111100001010001111101000101011010010001111101010101110001010001111101010111010111010100001101011010011111110001111101010111011000110001111101010111110010000111111001111110111101101011110 8fabc28fa2b48faae28fabaea1ad3f8fabb18fabe43f3f7d8fabc28fa2b48faae28fabaea1ad3f8fabb18fabe43f3f7b5e
UTF-8 î¯Úç´­éüŽ­}î¯Úç´­éüŽ­{^ 11000011101011101100001010101111110000111001101011000011101001111100001010110100110000101010110111000011101010011100001110111100110000101000111011000010101011010111110111000011101011101100001010101111110000111001101011000011101001111100001010110100110000101010110111000011101010011100001110111100110000101000111011000010101011010111101101011110 c3aec2afc39ac3a7c2b4c2adc3a9c3bcc28ec2ad7dc3aec2afc39ac3a7c2b4c2adc3a9c3bcc28ec2ad7b5e
UHC ????´­???­}????´­???­{^ 0011111100111111001111110011111110100010101001011010000110101001001111110011111100111111101000011010100101111101001111110011111100111111001111111010001010100101101000011010100100111111001111110011111110100001101010010111101101011110 3f3f3f3fa2a5a1a93f3f3fa1a97d3f3f3f3fa2a5a1a93f3f3fa1a97b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)