To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
SJIS-WIN 鄭?蔚?nf鄭?蔚?n^}Y鄭?蔚?nf鄭?蔚?n^}bE 10010011010000010011111110001001010101010011111101101110011001101001001101000001001111111000100101010101001111110110111001011110011111010101100110010011010000010011111110001001010101010011111101101110011001101001001101000001001111111000100101010101001111110110111001011110011111010110001001000101 93413f89553f6e6693413f89553f6e5e7d5993413f89553f6e6693413f89553f6e5e7d6245
EUC-JP 鄭?蔚?nf鄭?蔚?n^}Y鄭?蔚?nf鄭?蔚?n^}bE 11000101101000100011111110110001101101100011111101101110011001101100010110100010001111111011000110110110001111110110111001011110011111010101100111000101101000100011111110110001101101100011111101101110011001101100010110100010001111111011000110110110001111110110111001011110011111010110001001000101 c5a23fb1b63f6e66c5a23fb1b63f6e5e7d59c5a23fb1b63f6e66c5a23fb1b63f6e5e7d6245
UTF-8 鄭렕蔚렎nf鄭렕蔚렎n^}Y鄭렕蔚렎nf鄭렕蔚렎n^}bE 11101001100001001010110111101011101000001001010111101000100101001001101011101011101000001000111001101110011001101110100110000100101011011110101110100000100101011110100010010100100110101110101110100000100011100110111001011110011111010101100111101001100001001010110111101011101000001001010111101000100101001001101011101011101000001000111001101110011001101110100110000100101011011110101110100000100101011110100010010100100110101110101110100000100011100110111001011110011111010110001001000101 e984adeba095e8949aeba08e6e66e984adeba095e8949aeba08e6e5e7d59e984adeba095e8949aeba08e6e66e984adeba095e8949aeba08e6e5e7d6245
UHC 鄭렕蔚렎nf鄭렕蔚렎n^}Y鄭렕蔚렎nf鄭렕蔚렎n^}bE 111011111111011110001110101010101110101010100101100011101010010001101110011001101110111111110111100011101010101011101010101001011000111010100100011011100101111001111101010110011110111111110111100011101010101011101010101001011000111010100100011011100110011011101111111101111000111010101010111010101010010110001110101001000110111001011110011111010110001001000101 eff78eaaeaa58ea46e66eff78eaaeaa58ea46e5e7d59eff78eaaeaa58ea46e66eff78eaaeaa58ea46e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)