To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????i??????????iB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101101001001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN 族逢??縡???佚?i族逢??縡???佚?iB 10010001101100001000100010100111001111110011111111100011011100010011111100111111001111111001100011000011001111110110100110010001101100001000100010100111001111110011111111100011011100010011111100111111001111111001100011000011001111110110100101000010 91b088a73f3fe3713f3f3f98c33f6991b088a73f3fe3713f3f3f98c33f6942
EUC-JP 族逢??縡???佚?i族逢??縡???佚?iB 11000010101100101011000010101001001111110011111111100101110100100011111100111111001111111101000011000101001111110110100111000010101100101011000010101001001111110011111111100101110100100011111100111111001111111101000011000101001111110110100101000010 c2b2b0a93f3fe5d23f3f3fd0c53f69c2b2b0a93f3fe5d23f3f3fd0c53f6942
UTF-8 族逢렰렕縡닻렦렧佚렡i族逢렰렕縡닻렦렧佚렡iB 111001101001011110001111111010011000000010100010111010111010000010110000111010111010000010010101111001111011100010100001111010111000101110111011111010111010000010100110111010111010000010100111111001001011110110011010111010111010000010100001011010011110011010010111100011111110100110000000101000101110101110100000101100001110101110100000100101011110011110111000101000011110101110001011101110111110101110100000101001101110101110100000101001111110010010111101100110101110101110100000101000010110100101000010 e6978fe980a2eba0b0eba095e7b8a1eb8bbbeba0a6eba0a7e4bd9aeba0a169e6978fe980a2eba0b0eba095e7b8a1eb8bbbeba0a6eba0a7e4bd9aeba0a16942
UHC 族逢렰렕縡닻렦렧佚렡i族逢렰렕縡닻렦렧佚렡iB 11110000111010011101110011110001100011101011110110001110101010101110111010101101101101001110100110001110101101011000111010110110111011001110101010001110101100100110100111110000111010011101110011110001100011101011110110001110101010101110111010101101101101001110100110001110101101011000111010110110111011001110101010001110101100100110100101000010 f0e9dcf18ebd8eaaeeadb4e98eb58eb6ecea8eb269f0e9dcf18ebd8eaaeeadb4e98eb58eb6ecea8eb26942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)