To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?AFznf?AFzn^}Y?AFznf?AFzn^}bE 0011111101000001010001100111101001101110011001100011111101000001010001100111101001101110010111100111110101011001001111110100000101000110011110100110111001100110001111110100000101000110011110100110111001011110011111010110001001000101 3f41467a6e663f41467a6e5e7d593f41467a6e663f41467a6e5e7d6245
SJIS-WIN 涯AFznf涯AFzn^}Y涯AFznf涯AFzn^}bE 100010100101010101000001010001100111101001101110011001101000101001010101010000010100011001111010011011100101111001111101010110011000101001010101010000010100011001111010011011100110011010001010010101010100000101000110011110100110111001011110011111010110001001000101 8a5541467a6e668a5541467a6e5e7d598a5541467a6e668a5541467a6e5e7d6245
EUC-JP 涯AFznf涯AFzn^}Y涯AFznf涯AFzn^}bE 101100111011011001000001010001100111101001101110011001101011001110110110010000010100011001111010011011100101111001111101010110011011001110110110010000010100011001111010011011100110011010110011101101100100000101000110011110100110111001011110011111010110001001000101 b3b641467a6e66b3b641467a6e5e7d59b3b641467a6e66b3b641467a6e5e7d6245
UTF-8 涯AFznf涯AFzn^}Y涯AFznf涯AFzn^}bE 11100110101101101010111101000001010001100111101001101110011001101110011010110110101011110100000101000110011110100110111001011110011111010101100111100110101101101010111101000001010001100111101001101110011001101110011010110110101011110100000101000110011110100110111001011110011111010110001001000101 e6b6af41467a6e66e6b6af41467a6e5e7d59e6b6af41467a6e66e6b6af41467a6e5e7d6245
UHC 涯AFznf涯AFzn^}Y涯AFznf涯AFzn^}bE 111001001111001101000001010001100111101001101110011001101110010011110011010000010100011001111010011011100101111001111101010110011110010011110011010000010100011001111010011011100110011011100100111100110100000101000110011110100110111001011110011111010110001001000101 e4f341467a6e66e4f341467a6e5e7d59e4f341467a6e66e4f341467a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)