Skip to content

Stable hosting (+long-term archiving) of preprocessed data sets #227

@alexis-michaud

Description

@alexis-michaud

Having preprocessed data sets at hand matters a lot for easier experimenting. Links to online data can break. This happened for Persephone-related materials: #226. The issue was fixed quickly, but in the mid & long run the answer lies in stable hosting (+long-term archiving) of preprocessed data sets.

Some data sets preprocessed by @gw17 for experiments in 2020 are up here:
https://github.com/gw17/sltu_corpora

It's fine to have those in different places, hopefully with some sort of inventory somewhere (in Wiki mode?). Or could the Persephone / Elpis team also offer hosting solutions?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions