A feed aggregator built in Python. It sources the articles from the hyperlinks of the RSS feeds given in a text file, stores the relevant information in an SQLite3 database and creates a .xlsx file with only the latest articles in it everytime it is run.
Python 3.0and newer versionsSQLite3- Any Spreadsheet viewer that can open *.xlsx files.
You can easily use pip install <module_name> on Windows or sudo pip install <module_name> on Ubuntu to install these modules.
feedparsermoduleopenpyxlmodule
- Add python to your
PATHvariable. - Migrate to folder containing the file
scalerts.pyby usingcdcommand. - Now you can use the
python scalerts.pycommand to run the file.
- Migrate to folder containing the file
scalerts.pyby usingcdcommand. - Use the
python scalerts.pycommand to run the file.
- The
urlList.txtfile in the repository contains a list of RSS feed URLs. This can and should be edited to add your own URLs. Since the program pulls the URLs from this file, it should under no circumstances be deleted. - The
scalerts.dbandscalerts.xlsxfile in this repository are example database and spreadsheet files that were created using thescalerts.pyprogram. They should be deleted before the program is compiled for the first time. - You can find instructions on how to make dedicated RSS feeds for any topic using Google alerts here.
- On running the program for the first time, one will see the accompanying database file
scalerts.dband excel filescalerts.xlsxbeing created. - The database file will be appended with new incoming content and keep increasing in size. This file should not be deleted casually.
- The excel file will be overwritten with new posts everytime the program is compiled. Maintaining daily backups of this file is the user's responsibility. Do not save backups with the same name( and in the same folder) as the
scalerts.xlsxfile.