To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?霽?鬱屯??趙貊?縡?制競?弔?僥 111000110111000100111111111010001100011100111111100111110101010010010011110101000011111100111111111001101110001011100110101110110011111111100011011100010011111110010000101001111000101110100011001111111001001010100010001111111001100101000110 e3713fe8c73f9f5493d43f3fe6e2e6bb3fe3713f90a78ba33f92a23f9946
EUC-JP 縡?霽?鬱屯??趙貊?縡?制競?弔?僥 111001011101001000111111111100001100100100111111110111011011010111000110110101100011111100111111111011001110010011101100101111010011111111100101110100100011111111000000101010011011011010100101001111111100010010100100001111111101000110100111 e5d23ff0c93fddb5c6d63f3fece4ecbd3fe5d23fc0a9b6a53fc4a43fd1a7
UTF-8 縡렕霽렢鬱屯렕렟趙貊볕縡렕制競歷弔렲僥 111001111011100010100001111010111010000010010101111010011001110010111101111010111010000010100010111010011010110010110001111001011011000110101111111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010111010111011001110010101111001111011100010100001111010111010000010010101111001011000100010110110111001111010101110110110111011111010011010001100111001011011110010010100111010111010000010110010111001011000001110100101 e7b8a1eba095e99cbdeba0a2e9acb1e5b1afeba095eba09fe8b699e8b28aebb395e7b8a1eba095e588b6e7abb6efa68ce5bc94eba0b2e583a5
UHC 縡렕霽렢鬱屯렕렟趙貊볕縡렕制競歷弔렲僥 1110111010101101100011101010101011110000101110001000111010110011111010101010011011010100111010101000111010101010100011101011000011110000111000011101100011100111101110101011010111101110101011011000111010101010111100001010010011001100111001101110011010111000111100001100000010001110101111111110100011101001 eead8eaaf0b88eb3eaa6d4ea8eaa8eb0f0e1d8e7bab5eead8eaaf0a4cce6e6b8f0c08ebfe8e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)