To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??znf??zn^}Y??znf??zn^}bE 00111111001111110111101001101110011001100011111100111111011110100110111001011110011111010101100100111111001111110111101001101110011001100011111100111111011110100110111001011110011111010110001001000101 3f3f7a6e663f3f7a6e5e7d593f3f7a6e663f3f7a6e5e7d6245
SJIS-WIN 惑癰znf惑癰zn^}Y惑癰znf惑癰zn^}bE 100110000110011011100001100111100111101001101110011001101001100001100110111000011001111001111010011011100101111001111101010110011001100001100110111000011001111001111010011011100110011010011000011001101110000110011110011110100110111001011110011111010110001001000101 9866e19e7a6e669866e19e7a6e5e7d599866e19e7a6e669866e19e7a6e5e7d6245
EUC-JP 惑癰znf惑癰zn^}Y惑癰znf惑癰zn^}bE 110011111100011111100001111111100111101001101110011001101100111111000111111000011111111001111010011011100101111001111101010110011100111111000111111000011111111001111010011011100110011011001111110001111110000111111110011110100110111001011110011111010110001001000101 cfc7e1fe7a6e66cfc7e1fe7a6e5e7d59cfc7e1fe7a6e66cfc7e1fe7a6e5e7d6245
UTF-8 惑癰znf惑癰zn^}Y惑癰znf惑癰zn^}bE 1110011010000011100100011110011110011001101100000111101001101110011001101110011010000011100100011110011110011001101100000111101001101110010111100111110101011001111001101000001110010001111001111001100110110000011110100110111001100110111001101000001110010001111001111001100110110000011110100110111001011110011111010110001001000101 e68391e799b07a6e66e68391e799b07a6e5e7d59e68391e799b07a6e66e68391e799b07a6e5e7d6245
UHC 惑癰znf惑癰zn^}Y惑癰znf惑癰zn^}bE 111110111110001111101000101110010111101001101110011001101111101111100011111010001011100101111010011011100101111001111101010110011111101111100011111010001011100101111010011011100110011011111011111000111110100010111001011110100110111001011110011111010110001001000101 fbe3e8b97a6e66fbe3e8b97a6e5e7d59fbe3e8b97a6e66fbe3e8b97a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)