To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 澳??弱??映у?穩??澳??弱??映уБ節??^ 111000000101001100111111001111111000111011100011001111110011111110001001011001101000010010000101001111111110001001110010001111110011111111100000010100110011111100111111100011101110001100111111001111111000100101100110100001001000010110000100010000011001000011011111001111110011111101011110 e0533f3f8ee33f3f896684853fe2723f3fe0533f3f8ee33f3f89668485844190df3f3f5e
EUC-JP 澳??弱??映у?穩??澳??弱??映уБ節??^ 110111111011010000111111001111111011110011100101001111110011111110110001110001111010011111100101001111111110001111010011001111110011111111011111101101000011111100111111101111001110010100111111001111111011000111000111101001111110010110100111101000101100000011100001001111110011111101011110 dfb43f3fbce53f3fb1c7a7e53fe3d33f3fdfb43f3fbce53f3fb1c7a7e5a7a2c0e13f3f5e
UTF-8 澳뽳쉑弱쀨쾮映у즺穩먨눍澳뽳쉑弱욑슨映уБ節닷뇿^ 11100110101111101011001111101011101111011011001111101100100010011001000111100101101111001011000111101100100000001010100011101100101111101010111011100110100110001010000011010001100000111110110010100110101110101110011110101001101010011110101110101000101010001110101110001000100011011110011010111110101100111110101110111101101100111110110010001001100100011110010110111100101100011110110010011010100100011110110010001010101010001110011010011000101000001101000110000011110100001001000111100111101011111000000011101011100010111011011111101011100001111011111101011110 e6beb3ebbdb3ec8991e5bcb1ec80a8ecbeaee698a0d183eca6bae7a9a9eba8a8eb888de6beb3ebbdb3ec8991e5bcb1ec9a91ec8aa8e698a0d183d091e7af80eb8bb7eb87bf5e
UHC 澳뽳쉑弱쀨쾮映у즺穩먨눍澳뽳쉑弱욑슨映уБ節닷뇿^ 11100111111111101001011011101111101111011010011111100101101100001001011111101000101100101000010111100111101100011010110011100101101000111000110011101000101100011001000011100101100001111010100111100111111111101001011011101111101111011010011111100101101100001001111011101111101111011011110011100111101100011010110011100101101011001010001011101111101111011011010011100101100001111010000001011110 e7fe96efbda7e5b097e8b285e7b1ace5a38ce8b190e587a9e7fe96efbda7e5b09eefbdbce7b1ace5aca2efbdb4e587a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)