To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 症?蔚???指??籠逵蔚???指??纜 100011111100011100111111100010010101010100111111001111110011111110001110011101110011111100111111111000101100010011100111100111001000100101010101001111110011111100111111100011100111011100111111001111111110001110011100 8fc73f89553f3f3f8e773f3fe2c4e79c89553f3f3f8e773f3fe39c
EUC-JP 症?蔚???指??籠逵蔚???指??纜 101111101100100100111111101100011011011000111111001111110011111110111011110110000011111100111111111001001100011011101101111111001011000110110110001111110011111100111111101110111101100000111111001111111110010111111100 bec93fb1b63f3f3fbbd83f3fe4c6edfcb1b63f3f3fbbd83f3fe5fc
UTF-8 症렜蔚목렰렗指펨렎籠逵蔚목렰렗指펨렎纜 111001111001011110000111111010111010000010011100111010001001010010011010111010111010101010101001111010111010000010110000111010111010000010010111111001101000110010000111111011011000111010101000111010111010000010001110111001111011000110100000111010011000000010110101111010001001010010011010111010111010101010101001111010111010000010110000111010111010000010010111111001101000110010000111111011011000111010101000111010111010000010001110111001111011101010011100 e79787eba09ce8949aebaaa9eba0b0eba097e68c87ed8ea8eba08ee7b1a0e980b5e8949aebaaa9eba0b0eba097e68c87ed8ea8eba08ee7ba9c
UHC 症렜蔚목렰렗指펨렎籠逵蔚목렰렗指펨렎纜 1111000111111000100011101010111011101010101001011011100011110001100011101011110110001110101011001111001010100110110001101110100010001110101001001101011011101011110100001011000011101010101001011011100011110001100011101011110110001110101011001111001010100110110001101110100010001110101001001101010110111111 f1f88eaeeaa5b8f18ebd8eacf2a6c6e88ea4d6ebd0b0eaa5b8f18ebd8eacf2a6c6e88ea4d5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)