To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ï玾åïÆҎ¨nï玾åïÆҎ¨nB 1110111111100111100011101011111011100101111011111100011011010010100011101010100001101110111011111110011110001110101111101110010111101111110001101101001010001110101010000110111001000010 efe78ebee5efc6d28ea86eefe78ebee5efc6d28ea86e42
SJIS-WIN ?????????¨n?????????¨nB 00111111001111110011111100111111001111110011111100111111001111110011111110000001010011100110111000111111001111110011111100111111001111110011111100111111001111110011111110000001010011100110111001000010 3f3f3f3f3f3f3f3f3f814e6e3f3f3f3f3f3f3f3f3f814e6e42
EUC-JP ïç??åïÆÒ?¨nïç??åïÆÒ?¨nB 10001111101010111100000110001111101010111010111000111111001111111000111110101011101010011000111110101011110000011000111110101001101000011000111110101010110100100011111110100001101011110110111010001111101010111100000110001111101010111010111000111111001111111000111110101011101010011000111110101011110000011000111110101001101000011000111110101010110100100011111110100001101011110110111001000010 8fabc18fabae3f3f8faba98fabc18fa9a18faad23fa1af6e8fabc18fabae3f3f8faba98fabc18fa9a18faad23fa1af6e42
UTF-8 ï玾åïÆҎ¨nï玾åïÆҎ¨nB 11000011101011111100001110100111110000101000111011000010101111101100001110100101110000111010111111000011100001101100001110010010110000101000111011000010101010000110111011000011101011111100001110100111110000101000111011000010101111101100001110100101110000111010111111000011100001101100001110010010110000101000111011000010101010000110111001000010 c3afc3a7c28ec2bec3a5c3afc386c392c28ec2a86ec3afc3a7c28ec2bec3a5c3afc386c392c28ec2a86e42
UHC ???¾??Æ??¨n???¾??Æ??¨nB 0011111100111111001111111010100011111010001111110011111110101000101000010011111100111111101000011010011101101110001111110011111100111111101010001111101000111111001111111010100010100001001111110011111110100001101001110110111001000010 3f3f3fa8fa3f3fa8a13f3fa1a76e3f3f3fa8fa3f3fa8a13f3fa1a76e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)