To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??踰??揄??癰??遊?┃???野 1110000110011111001111110011111111100110111110100011111100111111100111011000100100111111001111111110000110011110001111110011111110010111010101100011111110000100101010110011111100111111001111111001011011101100 e19f3f3fe6fa3f3f9d893f3fe19e3f3f97563f84ab3f3f3f96ec
EUC-JP 癲??踰??揄??癰??遊?┃洹??野 11100010101000010011111100111111111011001111110000111111001111111101100111101001001111110011111111100001111111100011111100111111110011011011011100111111101010001010110110001111110001111011101000111111001111111100110011101110 e2a13f3fecfc3f3fd9e93f3fe1fe3f3fcdb73fa8ad8fc7ba3f3fccee
UTF-8 癲섍퉭踰됵쭏揄쒖쾸癰귘뮦遊븅┃洹숇즸野 111001111001100110110010111011001000010010001101111011011000100110101101111010001011100010110000111010111001000010110101111011001010110110001111111001101000111110000100111011001001001010010110111011001011111010111000111001111001100110110000111010101011011110011000111010111010111010100110111010011000000110001010111010111011100010000101111000101001010010000011111001101011010010111001111011001000100010000111111011001010011010111000111010011000011110001110 e799b2ec848ded89ade8b8b0eb90b5ecad8fe68f84ec9296ecbeb8e799b0eab798ebaea6e9818aebb885e29483e6b4b9ec8887eca6b8e9878e
UHC 癲섍퉭踰됵쭏揄쒖쾸癰귘뮦遊븅┃洹숇즸野 1110111110100110100110001110101010111001100001011110101110110010100010011110111110100111100010001110101011110001100111001110110010110010100011101110100010111001100000101110001010010010101100011110101110110100101110101110100110100110101011011110101010110111100110011110101110100011100010101110010110101111 efa698eab985ebb289efa788eaf19cecb28ee8b982e292b1ebb4bae9a6adeab799eba38ae5af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)