Skip to content

eMarchOnward/blog2epub

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

368 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

blog2epub

Maintenance MIT license

Convert blog to epub using command line or GUI.

Supported blogs:

  • *.blogspot.com
  • *.wordpress.com and some blogs based on WordPress

Main features

  • command line (CLI) and graphic user interface (GUI)
  • script downloads all text contents of selected blog to epub file,
  • if it's possible, it includes post comments,
  • images are downsized (to maximum 800/600px) and converted to grayscale,
  • one post = one epub chapter,
  • chapters are sorted by date ascending,
  • cover is generated automatically from downloaded images.

Example covers

Installation

  • for Windows and macOS users: available builds
  • python3 setup.py install

Running froum sources

git clone git@github.com:bohdanbobrowski/blog2epub.git
cd blog2epub
python -m venv venv
source ./venv/bin/activate
pip install -r ./requirements.txt
./blog2epubgui.py

Screenshots of GUI

Windows 10

macOS

Linux

Mint 21.2 Cinnamon

CLI

blog2epub [blog url] <parameters>

Parameters

-l/--limit=[x] - limit epub file to x posts
-s/--skip=[x] - skip x latest posts
-q/--images-quality=[0-100] - included images quality (default is 40)
-n/--no-images - don't include images

Examples

blog2epub starybezpiek.blogspot.com
blog2epub velosov.blogspot.com -l=10
blog2epub poznanskiehistorie.blogspot.com -q=100
blog2epub classicameras.blogspot.com --limit=10 --no-images

Planned features and known bugs

  • code needs some refactor: classes are too "nested" within each other
  • there are some bugs in wordpress.com crawler
  • Linux app/package(s)
  • Android app
  • more crawlers (ex. GitHub Pages 🙃)
  • improve existing crawlers (allow)
  • improve GUI - configuration, allow to save in selected place etc.

Release notes

  • utilize GitHub Pages for this project documentation
  • save url textfield history and allow easily choose from these urls
  • change font from Lato (nice but large files) to much smaller (Alegreya and Martian Mono for UI)
  • make program window not resizable
  • add popup similar to about dialog after finished ebook generation
  • article loop fix (crawler was reading by default from atomfeed, and returning maximum 25 articles)
  • refactor, as usual
  • url history saved into yaml file
  • Windows and macOS (unsigned) builds
  • fixed encoding error in month name
  • some minor refactors and fixes
  • Empty images list bug
  • Fixing macOS build (works on my machine - Ventura 13.3.1)
  • Improving macOS build - dmg now contains Applications folder shortcut to ease installation
  • Kivy threading fix
  • skip parameter fix
  • Linux build (now I'm working on adding package to various linux package repositories)
  • migration to Kivy :-)
  • some bugfixes in crawler
  • wordpress.com support!
  • migration to Gtk (for better support on multiple platforms)
  • requirements cleanup
  • about dialog
  • macOS dmg installer included
  • gzip html in cache folder
  • atom feed parsing
  • better system notifications, also under linux
  • improved saving GUI settings
  • system notification on finished download
  • saving GUI settings to yaml file
  • first macOS builds (pyinstaller)

About

Convert blog (blogspot.com, wordpress.com or another based on Wordpress) to epub using command line or GUI.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 96.9%
  • Shell 2.6%
  • Other 0.5%