To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??乳?.湲??鴉??應??檍??萸 100101001010100000111111001111111001001111111011001111111000000101000100100111111101000100111111001111111110100111101011001111110011111110011100111001000011111100111111100111101111100000111111001111111110010011001110 94a83f3f93fb3f81449fd13f3fe9eb3f3f9ce43f3f9ef83f3fe4ce
EUC-JP 畑??乳?.湲??鴉??應??檍??萸 110010001010101000111111001111111100011011111101001111111010000110100101110111101101001100111111001111111111001011101101001111110011111111011000111001100011111100111111110111001111101000111111001111111110100011010000 c8aa3f3fc6fd3fa1a5ded33f3ff2ed3f3fd8e63f3fdcfa3f3fe8d0
UTF-8 畑밴퉭乳들.湲룸윹鴉곩쳞應뀁퍥檍우뼦萸 111001111001010110010001111010111011000010110100111011011000100110101101111001001011100110110011111010111001001110100100111011111011110010001110111001101011100110110010111010111010001110111000111011001001110010111001111010011011010010001001111010101011001110101001111011001011001110011110111001101000011110001001111010111000000010000001111011011000110110100101111001101010101010001101111011001001101010110000111010111011110010100110111010001001000010111000 e79591ebb0b4ed89ade4b9b3eb93a4efbc8ee6b9b2eba3b8ec9cb9e9b489eab3a9ecb39ee68789eb8081ed8da5e6aa8dec9ab0ebbca6e890b8
UHC 畑밴퉭乳들.湲룸윹鴉곩쳞應뀁퍥檍우뼦萸 1110111110100101101110011110101010111001100001011110101011100001101101011110100110100011101011101110101010111000101101111110101110011111101100111110010010111100100000011110010110101011100001001110101111101011101100101110110010111011100111001110010111100101101111111110110010010110101010011110101110101101 efa5b9eab985eae1b5e9a3aeeab8b7eb9fb3e4bc81e5ab84ebebb2ecbb9ce5e5bfec96a9ebad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)