Skip to content

Needs to set a directory for pathless URLs #3

@stuartlangridge

Description

@stuartlangridge
14:40 ~/Scra+/pwbm (master) $ snap/local/pwbm http://kryogenix.org
Sat 18 Jan 14:40:29 GMT 2020 
Added URL: http://kryogenix.org
14:40 ~/Scra+/pwbm (master) $ snap/local/pwbm http://kryogenix.org/days/
Sat 18 Jan 14:40:32 GMT 2020 
Added URL: http://kryogenix.org/days/
14:40 ~/Scra+/pwbm (master) $ snap/local/pwbm
Sat 18 Jan 14:40:37 GMT 2020 
Start:
Sat 18 Jan 14:40:37 GMT 2020 
Processing: http://kryogenix.org
Sat 18 Jan 14:40:42 GMT 2020 http://kryogenix.org saved to ./archive/kryogenix.org//2020-01-18T14:40:37+00:00-index.html
Sat 18 Jan 14:40:42 GMT 2020 
Processing: http://kryogenix.org/days/
Sat 18 Jan 14:40:43 GMT 2020 http://kryogenix.org/days/ saved to ./archive/kryogenix.org/days/2020-01-18T14:40:42+00:00-index.html
Sat 18 Jan 14:40:43 GMT 2020 
Finish:
14:40 ~/Scra+/pwbm (master) $ ls -R archive
archive:
kryogenix.org

archive/kryogenix.org:
2020-01-18T14:40:37+00:00-index.html  days

archive/kryogenix.org/days:
2020-01-18T14:40:42+00:00-index.html

Note that the fetch of https://kryogenix.org goes into archive/kryogenix.org/ but the fetch of https://kryogenix.org/days goes into archive/kryogenix.org/days/ -- this is because the first one has no URL path, and so the script fetches it into ./archive/kryogenix.org//2020-01-18T14:40:37+00:00-index.html with two // in (because there's no path to put there). It should probably substitute a path in there if there isn't one ("index", perhaps?) Otherwise you have some pages directly in ./archive/domain and some in ./archive/domain/dir.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions