To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??cnf??cn^}Y??cnf??cn^}bE 00111111001111110110001101101110011001100011111100111111011000110110111001011110011111010101100100111111001111110110001101101110011001100011111100111111011000110110111001011110011111010110001001000101 3f3f636e663f3f636e5e7d593f3f636e663f3f636e5e7d6245
SJIS-WIN 臧Lcnf臧Lcn^}Y臧Lcnf臧Lcn^}bE 111001000110100010000010011010110110001101101110011001101110010001101000100000100110101101100011011011100101111001111101010110011110010001101000100000100110101101100011011011100110011011100100011010001000001001101011011000110110111001011110011111010110001001000101 e468826b636e66e468826b636e5e7d59e468826b636e66e468826b636e5e7d6245
EUC-JP 臧Lcnf臧Lcn^}Y臧Lcnf臧Lcn^}bE 111001111100100110100011110011000110001101101110011001101110011111001001101000111100110001100011011011100101111001111101010110011110011111001001101000111100110001100011011011100110011011100111110010011010001111001100011000110110111001011110011111010110001001000101 e7c9a3cc636e66e7c9a3cc636e5e7d59e7c9a3cc636e66e7c9a3cc636e5e7d6245
UTF-8 臧Lcnf臧Lcn^}Y臧Lcnf臧Lcn^}bE 1110100010000111101001111110111110111100101011000110001101101110011001101110100010000111101001111110111110111100101011000110001101101110010111100111110101011001111010001000011110100111111011111011110010101100011000110110111001100110111010001000011110100111111011111011110010101100011000110110111001011110011111010110001001000101 e887a7efbcac636e66e887a7efbcac636e5e7d59e887a7efbcac636e66e887a7efbcac636e5e7d6245
UHC 臧Lcnf臧Lcn^}Y臧Lcnf臧Lcn^}bE 111011011111010110100011110011000110001101101110011001101110110111110101101000111100110001100011011011100101111001111101010110011110110111110101101000111100110001100011011011100110011011101101111101011010001111001100011000110110111001011110011111010110001001000101 edf5a3cc636e66edf5a3cc636e5e7d59edf5a3cc636e66edf5a3cc636e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)