To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???旭??????隍???旭??????蝗^ 001111110011111100111111100010001010111000111111001111110011111100111111001111110011111111101000101001000011111100111111001111111000100010101110001111110011111100111111001111110011111100111111111001011001101101011110 3f3f3f88ae3f3f3f3f3f3fe8a43f3f3f88ae3f3f3f3f3f3fe59b5e
EUC-JP ???旭??????隍???旭??????蝗^ 001111110011111100111111101100001011000000111111001111110011111100111111001111110011111111110000101001100011111100111111001111111011000010110000001111110011111100111111001111110011111100111111111010011111101101011110 3f3f3fb0b03f3f3f3f3f3ff0a63f3f3fb0b03f3f3f3f3f3fe9fb5e
UTF-8 쒔렺쒔旭렕롌뤏쮱굶ㆂ隍쒔렺쒔旭렕롌뤏쮱굶ㆂ蝗^ 11101100100100101001010011101011101000001011101011101100100100101001010011100110100101111010110111101011101000001001010111101011101000011000110011101011101001001000111111101100101011101011000111101010101101011011011011100011100001101000001011101001100110101000110111101100100100101001010011101011101000001011101011101100100100101001010011100110100101111010110111101011101000001001010111101011101000011000110011101011101001001000111111101100101011101011000111101010101101011011011011100011100001101000001011101000100111011001011101011110 ec9294eba0baec9294e697adeba095eba18ceba48fecaeb1eab5b6e38682e99a8dec9294eba0baec9294e697adeba095eba18ceba48fecaeb1eab5b6e38682e89d975e
UHC 쒔렺쒔旭렕롌뤏쮱굶ㆂ隍쒔렺쒔旭렕롌뤏쮱굶ㆂ蝗^ 101111101010110110001110110000101011111010101101111010011110111110001110101010101000111011010010100011111011111110101000100011101011000110111110101001001111001011111100110110111011111010101101100011101100001010111110101011011110100111101111100011101010101010001110110100101000111110111111101010001000111010110001101111101010010011110010111111001101100101011110 bead8ec2beade9ef8eaa8ed28fbfa88eb1bea4f2fcdbbead8ec2beade9ef8eaa8ed28fbfa88eb1bea4f2fcd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)