To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 證蝎??淫???掌繃證蝎??淫???掌棚^ 11100110100110101110010110011001001111110011111110001000111110100011111100111111001111111000111110110110111000110111110111100110100110101110010110011001001111110011111110001000111110100011111100111111001111111000111110110110100100100100100101011110 e69ae5993f3f88fa3f3f3f8fb6e37de69ae5993f3f88fa3f3f3f8fb692495e
EUC-JP 證蝎??淫???掌繃證蝎??淫???掌棚^ 11101011111110101110100111111001001111110011111110110000111111000011111100111111001111111011111010111000111001011101111011101011111110101110100111111001001111110011111110110000111111000011111100111111001111111011111010111000110000111010101001011110 ebfae9f93f3fb0fc3f3f3fbeb8e5deebfae9f93f3fb0fc3f3f3fbeb8c3aa5e
UTF-8 證蝎렦렧淫렕綎렊掌繃證蝎렦렧淫렕綎렊掌棚^ 11101000101011011000100111101000100111011000111011101011101000001010011011101011101000001010011111100110101101111010101111101011101000001001010111100111101101101000111011101011101000001000101011100110100011101000110011100111101110011000001111101000101011011000100111101000100111011000111011101011101000001010011011101011101000001010011111100110101101111010101111101011101000001001010111100111101101101000111011101011101000001000101011100110100011101000110011100110101000111001101001011110 e8ad89e89d8eeba0a6eba0a7e6b7abeba095e7b68eeba08ae68e8ce7b983e8ad89e89d8eeba0a6eba0a7e6b7abeba095e7b68eeba08ae68e8ce6a39a5e
UHC 證蝎렦렧淫렕綎렊掌繃證蝎렦렧淫렕綎렊掌棚^ 1111000111111011110010101110100110001110101101011000111010110110111010111110001010001110101010101110111111110010100011101010000111101101111001101101110111011110111100011111101111001010111010011000111010110101100011101011011011101011111000101000111010101010111011111111001010001110101000011110110111100110110111011101110001011110 f1fbcae98eb58eb6ebe28eaaeff28ea1ede6dddef1fbcae98eb58eb6ebe28eaaeff28ea1ede6dddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)