To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??踰??揄??癰??遊?┃???野??? 1110000110011111001111110011111111100110111110100011111100111111100111011000100100111111001111111110000110011110001111110011111110010111010101100011111110000100101010110011111100111111001111111001011011101100001111110011111100111111 e19f3f3fe6fa3f3f9d893f3fe19e3f3f97563f84ab3f3f3f96ec3f3f3f
EUC-JP 癲??踰??揄??癰??遊?┃洹??野??? 11100010101000010011111100111111111011001111110000111111001111111101100111101001001111110011111111100001111111100011111100111111110011011011011100111111101010001010110110001111110001111011101000111111001111111100110011101110001111110011111100111111 e2a13f3fecfc3f3fd9e93f3fe1fe3f3fcdb73fa8ad8fc7ba3f3fccee3f3f3f
UTF-8 癲섍퉭踰됵쭏揄쒖쾸癰귘뮦遊븅┃洹숇즸野꺨룹녇 111001111001100110110010111011001000010010001101111011011000100110101101111010001011100010110000111010111001000010110101111011001010110110001111111001101000111110000100111011001001001010010110111011001011111010111000111001111001100110110000111010101011011110011000111010111010111010100110111010011000000110001010111010111011100010000101111000101001010010000011111001101011010010111001111011001000100010000111111011001010011010111000111010011000011110001110111010101011101010101000111010111010001110111001111010111000010110000111 e799b2ec848ded89ade8b8b0eb90b5ecad8fe68f84ec9296ecbeb8e799b0eab798ebaea6e9818aebb885e29483e6b4b9ec8887eca6b8e9878eeabaa8eba3b9eb8587
UHC 癲섍퉭踰됵쭏揄쒖쾸癰귘뮦遊븅┃洹숇즸野꺨룹녇 1110111110100110100110001110101010111001100001011110101110110010100010011110111110100111100010001110101011110001100111001110110010110010100011101110100010111001100000101110001010010010101100011110101110110100101110101110100110100110101011011110101010110111100110011110101110100011100010101110010110101111100000111100111010110111111011001000011010111110 efa698eab985ebb289efa788eaf19cecb28ee8b982e292b1ebb4bae9a6adeab799eba38ae5af83ceb7ec86be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)