To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霓?????矣??揶??霓?????矣??揶??B 11101000101111010011111100111111001111110011111100111111111000011110000100111111001111111001110110001000001111110011111111101000101111010011111100111111001111110011111100111111111000011110000100111111001111111001110110001000001111110011111101000010 e8bd3f3f3f3f3fe1e13f3f9d883f3fe8bd3f3f3f3f3fe1e13f3f9d883f3f42
EUC-JP 霓??沅??矣??揶??霓??沅??矣??揶??B 1111000010111111001111110011111110001111110001101110100100111111001111111110001011100011001111110011111111011001111010000011111100111111111100001011111100111111001111111000111111000110111010010011111100111111111000101110001100111111001111111101100111101000001111110011111101000010 f0bf3f3f8fc6e93f3fe2e33f3fd9e83f3ff0bf3f3f8fc6e93f3fe2e33f3fd9e83f3f42
UTF-8 霓얠떝沅좑㎖矣섎겱揶쏄릎霓얠떝沅좑㎖矣섎겱揶쏄릎B 11101001100111001001001111101100100101101010000011101011100101101001110111100110101100101000010111101100101000101001000111100011100011101001011011100111100111111010001111101100100001001000111011101010101100101011000111100110100011111011011011101100100011111000010011101011101001101000111011101001100111001001001111101100100101101010000011101011100101101001110111100110101100101000010111101100101000101001000111100011100011101001011011100111100111111010001111101100100001001000111011101010101100101011000111100110100011111011011011101100100011111000010011101011101001101000111001000010 e99c93ec96a0eb969de6b285eca291e38e96e79fa3ec848eeab2b1e68fb6ec8f84eba68ee99c93ec96a0eb969de6b285eca291e38e96e79fa3ec848eeab2b1e68fb6ec8f84eba68e42
UHC 霓얠떝沅좑㎖矣섎겱揶쏄릎霓얠떝沅좑㎖矣섎겱揶쏄릎B 11100111111001111011111011101100100010111011001111101010101101101010000011101111101001111010001011101011111110001001100011101011100000011011110111100101101010101001101111101010101110001010110111100111111001111011111011101100100010111011001111101010101101101010000011101111101001111010001011101011111110001001100011101011100000011011110111100101101010101001101111101010101110001010110101000010 e7e7beec8bb3eab6a0efa7a2ebf898eb81bde5aa9beab8ade7e7beec8bb3eab6a0efa7a2ebf898eb81bde5aa9beab8ad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)