To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 凹?????閻??抑??凹?????閻??抑??B 10001001100110100011111100111111001111110011111100111111111010001000010100111111001111111001011101111101001111110011111110001001100110100011111100111111001111110011111100111111111010001000010100111111001111111001011101111101001111110011111101000010 899a3f3f3f3f3fe8853f3f977d3f3f899a3f3f3f3f3fe8853f3f977d3f3f42
EUC-JP 凹?????閻??抑??凹?????閻??抑??B 10110001111110100011111100111111001111110011111100111111111011111110010100111111001111111100110111011110001111110011111110110001111110100011111100111111001111110011111100111111111011111110010100111111001111111100110111011110001111110011111101000010 b1fa3f3f3f3f3fefe53f3fcdde3f3fb1fa3f3f3f3f3fefe53f3fcdde3f3f42
UTF-8 凹좊퉬溜곕젺閻뉖젪抑뜸섞凹좊퉬溜곕젺閻뉖젪抑뜸섞B 11100101100001111011100111101100101000101000101011101101100010011010110011101111101001111000101111101010101100111001010111101100101000001011101011101001100101101011101111101011100010011001011011101100101000001010101011100110100010101001000111101011100111001011100011101100100001001001111011100101100001111011100111101100101000101000101011101101100010011010110011101111101001111000101111101010101100111001010111101100101000001011101011101001100101101011101111101011100010011001011011101100101000001010101011100110100010101001000111101011100111001011100011101100100001001001111001000010 e587b9eca28aed89acefa78beab395eca0bae996bbeb8996eca0aae68a91eb9cb8ec849ee587b9eca28aed89acefa78beab395eca0bae996bbeb8996eca0aae68a91eb9cb8ec849e42
UHC 凹좊퉬溜곕젺閻뉖젪抑뜸섞凹좊퉬溜곕젺閻뉖젪抑뜸섞B 11101000111010101010000011101011101110011000010011101010111111101011000011101011101000001010110111100111101000101000011111101011101000001010001011100101111001001011011011100100101111001010111111101000111010101010000011101011101110011000010011101010111111101011000011101011101000001010110111100111101000101000011111101011101000001010001011100101111001001011011011100100101111001010111101000010 e8eaa0ebb984eafeb0eba0ade7a287eba0a2e5e4b6e4bcafe8eaa0ebb984eafeb0eba0ade7a287eba0a2e5e4b6e4bcaf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)