To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鶯??誼??楡??野??鶯??誼??楡??野??B 111010011111001000111111001111111000101101100010001111110011111110011110101111100011111100111111100101101110110000111111001111111110100111110010001111110011111110001011011000100011111100111111100111101011111000111111001111111001011011101100001111110011111101000010 e9f23f3f8b623f3f9ebe3f3f96ec3f3fe9f23f3f8b623f3f9ebe3f3f96ec3f3f42
EUC-JP 鶯??誼??楡??野??鶯??誼??楡??野??B 111100101111010000111111001111111011010111000011001111110011111111011100110000000011111100111111110011001110111000111111001111111111001011110100001111110011111110110101110000110011111100111111110111001100000000111111001111111100110011101110001111110011111101000010 f2f43f3fb5c33f3fdcc03f3fccee3f3ff2f43f3fb5c33f3fdcc03f3fccee3f3f42
UTF-8 鶯ㅳ꺂誼븃퐴楡㏃젵野ㅽ뱚鶯ㅳ꺂誼븃퐴楡㏃젵野ㅽ뱚B 11101001101101101010111111100011100001011011001111101010101110101000001011101000101010101011110011101011101110001000001111101101100100001011010011100110101001011010000111100011100011111000001111101100101000001011010111101001100001111000111011100011100001011011110111101011101100011001101011101001101101101010111111100011100001011011001111101010101110101000001011101000101010101011110011101011101110001000001111101101100100001011010011100110101001011010000111100011100011111000001111101100101000001011010111101001100001111000111011100011100001011011110111101011101100011001101001000010 e9b6afe385b3eaba82e8aabcebb883ed90b4e6a5a1e38f83eca0b5e9878ee385bdebb19ae9b6afe385b3eaba82e8aabcebb883ed90b4e6a5a1e38f83eca0b5e9878ee385bdebb19a42
UHC 鶯ㅳ꺂誼븃퐴楡㏃젵野ㅽ뱚鶯ㅳ꺂誼븃퐴楡㏃젵野ㅽ뱚B 11100101101000111010010011100011100000111010101111101011111111101011101011101000101111011001110111101010111110001010011111101100101000001010100111100101101011111010010011101101100100111000000111100101101000111010010011100011100000111010101111101011111111101011101011101000101111011001110111101010111110001010011111101100101000001010100111100101101011111010010011101101100100111000000101000010 e5a3a4e383abebfebae8bd9deaf8a7eca0a9e5afa4ed9381e5a3a4e383abebfebae8bd9deaf8a7eca0a9e5afa4ed938142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)