To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???znf???zn^}Y???znf???zn^}bE 0011111100111111001111110111101001101110011001100011111100111111001111110111101001101110010111100111110101011001001111110011111100111111011110100110111001100110001111110011111100111111011110100110111001011110011111010110001001000101 3f3f3f7a6e663f3f3f7a6e5e7d593f3f3f7a6e663f3f3f7a6e5e7d6245
SJIS-WIN 善癌鴨znf善癌鴨zn^}Y善癌鴨znf善癌鴨zn^}bE 1001000101010000100010101110000010001010100110110111101001101110011001101001000101010000100010101110000010001010100110110111101001101110010111100111110101011001100100010101000010001010111000001000101010011011011110100110111001100110100100010101000010001010111000001000101010011011011110100110111001011110011111010110001001000101 91508ae08a9b7a6e6691508ae08a9b7a6e5e7d5991508ae08a9b7a6e6691508ae08a9b7a6e5e7d6245
EUC-JP 善癌鴨znf善癌鴨zn^}Y善癌鴨znf善癌鴨zn^}bE 1100000110110001101101001110001010110011111110110111101001101110011001101100000110110001101101001110001010110011111110110111101001101110010111100111110101011001110000011011000110110100111000101011001111111011011110100110111001100110110000011011000110110100111000101011001111111011011110100110111001011110011111010110001001000101 c1b1b4e2b3fb7a6e66c1b1b4e2b3fb7a6e5e7d59c1b1b4e2b3fb7a6e66c1b1b4e2b3fb7a6e5e7d6245
UTF-8 善癌鴨znf善癌鴨zn^}Y善癌鴨znf善癌鴨zn^}bE 1110010110010110100001001110011110011001100011001110100110110100101010000111101001101110011001101110010110010110100001001110011110011001100011001110100110110100101010000111101001101110010111100111110101011001111001011001011010000100111001111001100110001100111010011011010010101000011110100110111001100110111001011001011010000100111001111001100110001100111010011011010010101000011110100110111001011110011111010110001001000101 e59684e7998ce9b4a87a6e66e59684e7998ce9b4a87a6e5e7d59e59684e7998ce9b4a87a6e66e59684e7998ce9b4a87a6e5e7d6245
UHC 善癌鴨znf善癌鴨zn^}Y善癌鴨znf善癌鴨zn^}bE 1110000010111100111001001101111111100100111001010111101001101110011001101110000010111100111001001101111111100100111001010111101001101110010111100111110101011001111000001011110011100100110111111110010011100101011110100110111001100110111000001011110011100100110111111110010011100101011110100110111001011110011111010110001001000101 e0bce4dfe4e57a6e66e0bce4dfe4e57a6e5e7d59e0bce4dfe4e57a6e66e0bce4dfe4e57a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)