Skip to content

validate_upload should not allow two lines to share the same data_path #1142

@jswelling

Description

@jswelling

Two different lines of an Upload metadata.tsv should never share the same data_path, because that would imply that the exact same data was to end up in two different datasets. This problem is not detected. Failure to detect the problem led to a situation when a multi-line metadata.tsv file with all data_path entries set to '.' caused all the files to be scanned by the plugin validators many times, once for each line of metadata.

I know of no way to describe a relationship between lines in the schema language which defines the table schemata, but it would be easy to insert an ad-hoc test of the 'data_path' field around here somewhere:

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions