Skip to content

Add workaround for NCBI Datasets' incompatibility issues #82

@joverlee521

Description

@joverlee521

Initially brought up by @tsibley in nextstrain/public#18 (comment)

Sigh. This isn't the first time this has happened (one example). Typically, new fields are backwards compatible with old software versions because the software is designed to be forwards compatible by ignoring fields it doesn't know about. NCBI's intentionally chosen to not include such forward compatibility, which means you basically need to stay in lockstep with the latest release. That's frustrating.

NCBI mentioned a workaround we could maybe use (№ 3):

Use datasets summary virus genome taxon 3048448 --as-json-lines | dataformat tsv virus-genome. This avoids triggering the error because datasets summary does not pick up the new field.

although I don't know how much overhead it adds compared to our current approach of downloading the bulk data package.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions