To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??cnf??cn^}Y??cnf??cn^}bE 00111111001111110110001101101110011001100011111100111111011000110110111001011110011111010101100100111111001111110110001101101110011001100011111100111111011000110110111001011110011111010110001001000101 3f3f636e663f3f636e5e7d593f3f636e663f3f636e5e7d6245
SJIS-WIN 脛桒cnf脛桒cn^}Y脛桒cnf脛桒cn^}bE 111000111111100011111010111000110110001101101110011001101110001111111000111110101110001101100011011011100101111001111101010110011110001111111000111110101110001101100011011011100110011011100011111110001111101011100011011000110110111001011110011111010110001001000101 e3f8fae3636e66e3f8fae3636e5e7d59e3f8fae3636e66e3f8fae3636e5e7d6245
EUC-JP 脛桒cnf脛桒cn^}Y脛桒cnf脛桒cn^}bE 11100110111110101000111111000011110010010110001101101110011001101110011011111010100011111100001111001001011000110110111001011110011111010101100111100110111110101000111111000011110010010110001101101110011001101110011011111010100011111100001111001001011000110110111001011110011111010110001001000101 e6fa8fc3c9636e66e6fa8fc3c9636e5e7d59e6fa8fc3c9636e66e6fa8fc3c9636e5e7d6245
UTF-8 脛桒cnf脛桒cn^}Y脛桒cnf脛桒cn^}bE 1110100010000100100110111110011010100001100100100110001101101110011001101110100010000100100110111110011010100001100100100110001101101110010111100111110101011001111010001000010010011011111001101010000110010010011000110110111001100110111010001000010010011011111001101010000110010010011000110110111001011110011111010110001001000101 e8849be6a192636e66e8849be6a192636e5e7d59e8849be6a192636e66e8849be6a192636e5e7d6245
UHC 脛?cnf脛?cn^}Y脛?cnf脛?cn^}bE 1100110011101011001111110110001101101110011001101100110011101011001111110110001101101110010111100111110101011001110011001110101100111111011000110110111001100110110011001110101100111111011000110110111001011110011111010110001001000101 cceb3f636e66cceb3f636e5e7d59cceb3f636e66cceb3f636e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)