Skip to content

Invalid Dataset Format #1

@NiteshMethani

Description

@NiteshMethani

Hi,
Thanks for amaking your work open source.
I was trying to explore this repo and understand XOR limitations better but I couldn't execute the evaluate.py script even for a single dataset. For example, for conll2003 dataset, there seems to be an error in conll2003.py => read_data() function. Specifically at line 107 in conll2003.py.
Each line in the original dataset has 4 columns: token, WS, POS, NER tag. But the provided code splits it into 3 columns and hence the error.

Any suggestions on the expected dataset format to reproduce the results in the paper?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions