-
-
Notifications
You must be signed in to change notification settings - Fork 30
Test htmldate on further web pages and report bugs #8
Copy link
Copy link
Open
Labels
good first issueGood for newcomersGood for newcomersup for grabsGood for (first) contributorsGood for (first) contributors
Description
I have mostly tested htmldate on a set of English, German and French web pages I had run into by surfing or during web crawls. There are definitely further web pages and cases in other languages for which the extraction of a date doesn't work so far.
Please install the dateparser library beforehand as it significantly extends linguistic coverage: pipor pip3 install -U dateparser or pip install -U htmldate[all].
Corresponding bug reports can either be filed as a list in an issue like this one or in the code as XPath expressions in core.py (see DATE_EXPRESSIONS and ADDITIONAL_EXPRESSIONS).
Thanks!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
good first issueGood for newcomersGood for newcomersup for grabsGood for (first) contributorsGood for (first) contributors