[WIP] Add XLNet support for Reader by n0thingLLM · Pull Request #205 · cdqa-suite/cdQA

n0thingLLM · 2019-07-16T17:49:58Z

codecov · 2019-07-16T17:55:34Z

Codecov Report

Merging #205 (660760c) into master (bda1c32) will decrease coverage by 8.00%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##           master     #205      +/-   ##
==========================================
- Coverage   31.23%   23.22%   -8.01%     
==========================================
  Files           7        9       +2     
  Lines        1508     2032     +524     
==========================================
+ Hits          471      472       +1     
- Misses       1037     1560     +523

Impacted Files	Coverage Δ
cdqa/reader/reader_sklearn.py	`0.00% <0.00%> (ø)`
cdqa/reader/utils_squad.py	`0.00% <0.00%> (ø)`
cdqa/reader/utils_squad_evaluate.py	`0.00% <0.00%> (ø)`
cdqa/reader/bertqa_sklearn.py	`58.90% <0.00%> (+0.15%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update bda1c32...660760c. Read the comment docs.

n0thingLLM · 2019-07-19T07:50:14Z

ValueError during evaluation after training:

Traceback (most recent call last):
  File "tutorial-train-xlnet-squad.py", line 39, in <module>
    out_eval, final_prediction = reader.evaluate(X='dev-v2.0.json')
ValueError: too many values to unpack (expected 2)

n0thingLLM · 2019-07-19T13:51:46Z

To use the XLNet reader with a pretrained .bin model:

import wget
from cdqa.reader.reader_sklearn import Reader

wget.download(url='https://github.com/cdqa-suite/cdQA/releases/download/XLNet_cased_vCPU/pytorch_model.bin', out='.')

# cast Reader class with train params
reader = Reader(model_type='xlnet',
                model_name_or_path='xlnet-base-cased',
                output_dir='.',
                evaluate_during_training=False,
                no_cuda=False,
                fp16=False,
                pretrained_model_path='.')

# make some predictions
reader.predict(X='dev-v2.0-small.json')

n0thingLLM · 2019-07-19T13:55:25Z

hardware: GeForce RTX 208
training time: 9 hours

andrelmfarias · 2019-07-24T14:22:09Z

Implementation of XLNetForQuestionAnswering is pretty different from BertForQuestionAnswering and the official HF version does not output the logits for now.
XLNetForQuestionAnswering uses Beam Search to find the best (and more probable) span, while BertForQuestionAnswering maximises the start_score and end_score separately.

from #196

alex-movila · 2019-08-09T14:44:56Z

Any progress with this?
In meantime we have even better models: RoBERTa and ERNIE 2.0

n0thingLLM · 2019-08-10T09:23:44Z

Hi @alex-movila

You can follow our progress on this PR here. We described all the steps to achieve in order to be synced with the latest changes made by @huggingface.

At the moment we depend on the pytorch-transformers repository as a backend for our QA system. The @huggingface community is progressively implementing new models. They are now in the process of adding RoBERTa (see this). They don't have plan to add ERNIE a the moment (see this).

Their new API should allow the user to use any transformer to do QA. We are looking to provide the same thing with cdQA.

n0thingLLM · 2019-09-15T13:07:17Z

I could not replicate results of official SQuAD 2.0 with our trained XLNet model:

from cdqa.reader.reader_sklearn import Reader

reader = Reader(model_type='xlnet',
                model_name_or_path='xlnet-base-cased',
                fp16=False,
                output_dir='.',
                no_cuda=False,
                pretrained_model_path='.')

reader.evaluate(X='dev-v2.0.json')

See my colab notebook for reproducibility: https://colab.research.google.com/github/cdqa-suite/cdQA/blob/sync-huggingface/examples/tutorial-eval-xlnet-squad2.0.ipynb

{
  "exact": 35.643897919649625,
  "f1": 40.81892328134685,
  "total": 11873,
  "HasAns_exact": 67.29082321187585,
  "HasAns_f1": 77.65571459504568,
  "HasAns_total": 5928,
  "NoAns_exact": 4.087468460891506,
  "NoAns_f1": 4.087468460891506,
  "NoAns_total": 5945,
  "best_exact": 50.07159100480081,
  "best_exact_thresh": 0.0,
  "best_f1": 50.07159100480081,
  "best_f1_thresh": 0.0
}

{'HasAns_exact': 67.29082321187585,
 'HasAns_f1': 77.65571459504568,
 'HasAns_total': 5928,
 'NoAns_exact': 4.087468460891506,
 'NoAns_f1': 4.087468460891506,
 'NoAns_total': 5945,
 'best_exact': 50.07159100480081,
 'best_exact_thresh': 0.0,
 'best_f1': 50.07159100480081,
 'best_f1_thresh': 0.0,
 'exact': 35.643897919649625,
 'f1': 40.81892328134685,
 'total': 11873}

It might be an not optimzed hyperparameters issue (see this: huggingface/transformers#822).

@andrelmfarias can you confirm the params you used during training? (https://github.com/cdqa-suite/cdQA/blob/sync-huggingface/examples/tutorial-train-xlnet-squad.py)

andrelmfarias · 2019-09-16T07:44:42Z

I had to reduce some parameters (max_length, batch_size, etc.). The GPU did not handle the training with default parameters. It might be that.

n0thingLLM · 2019-10-13T09:19:09Z

I could not replicate results of official SQuAD 2.0 with our trained XLNet model:

from cdqa.reader.reader_sklearn import Reader

reader = Reader(model_type='xlnet',
                model_name_or_path='xlnet-base-cased',
                fp16=False,
                output_dir='.',
                no_cuda=False,
                pretrained_model_path='.')

reader.evaluate(X='dev-v2.0.json')

See my colab notebook for reproducibility: https://colab.research.google.com/github/cdqa-suite/cdQA/blob/sync-huggingface/examples/tutorial-eval-xlnet-squad2.0.ipynb

{
  "exact": 35.643897919649625,
  "f1": 40.81892328134685,
  "total": 11873,
  "HasAns_exact": 67.29082321187585,
  "HasAns_f1": 77.65571459504568,
  "HasAns_total": 5928,
  "NoAns_exact": 4.087468460891506,
  "NoAns_f1": 4.087468460891506,
  "NoAns_total": 5945,
  "best_exact": 50.07159100480081,
  "best_exact_thresh": 0.0,
  "best_f1": 50.07159100480081,
  "best_f1_thresh": 0.0
}

{'HasAns_exact': 67.29082321187585,
 'HasAns_f1': 77.65571459504568,
 'HasAns_total': 5928,
 'NoAns_exact': 4.087468460891506,
 'NoAns_f1': 4.087468460891506,
 'NoAns_total': 5945,
 'best_exact': 50.07159100480081,
 'best_exact_thresh': 0.0,
 'best_f1': 50.07159100480081,
 'best_f1_thresh': 0.0,
 'exact': 35.643897919649625,
 'f1': 40.81892328134685,
 'total': 11873}

It might be an not optimzed hyperparameters issue (see this: huggingface/transformers#822).

@andrelmfarias can you confirm the params you used during training? (https://github.com/cdqa-suite/cdQA/blob/sync-huggingface/examples/tutorial-train-xlnet-squad.py)

This issue is being discussed here: huggingface/transformers#947 (comment)

n0thingLLM added 20 commits July 12, 2019 14:57

update latest code from HF

922fa7d

prepare for reverse-engineering and adaptation to HF release

5fb00f2

add utils_squad from HF original

9ccea6b

adapt utils_squad for cdqa

9a9b1a5

add reader_sklearn from HF original run_squad.py

12af10b

foundations sklearn wrapper XLNet

1ba2db8

replace args from parser by class parameters

211ca68

fix indent error

eb8ee3f

little fixes

a2ab7b5

add notebook for XLNet training on SQuAD

1318d25

add pytorch-transformers to reqs

d20e06d

sync with pytorch-transformers 1.0

1a60319

eval script SQuAD update

1681d97

sync with pytorch-transformers 1.0

2c86a94

fix import errors

3c34c63

add new reqs

02d81de

fix import error

95f214a

fix change params Reader()

e911d4c

remove verbose debug

53887d7

update training notebook

3501a40

n0thingLLM changed the title ~~Add XLNet support for Reader #196~~ Add XLNet support for Reader Jul 16, 2019

n0thingLLM mentioned this pull request Jul 16, 2019

XLNet support for Reader #196

Open

n0thingLLM added 7 commits July 17, 2019 10:46

return final_prediction in predict()

ca24672

debug

d675c3b

continue debug

67a6e46

fix cached_features_file in predict mode

c67739c

fix FileNotFoundError in torch.save()

b236cf7

Fix TypeError() in write_predictions_extended()

e850e9f

update XLNet / SQuAD 2.0 test notebook

860bdad

n0thingLLM added 9 commits July 17, 2019 17:41

sync HF

dfe2669

debug write_predictions_extended()

f6c73bb

update last notebook

7e27e7a

add script to train xlnet reader on SQuAD 2.0

b4f1a1f

allow reader.fit(X='train-v2.0.json')

bae9143

small fixes

8e7bb0e

update notebook (workflow verified)

9081abc

quick fix error evaluation during training

640f984

fix no_cuda

993ac5e

keep basic tokenizer when using pretrained model

e17db67

add notebook tutorial predict with XLNet on custom dataset

8979d02

n0thingLLM mentioned this pull request Jul 20, 2019

Support for SQuAD 2.0 #157

Closed

1 task

andrelmfarias changed the title ~~Add XLNet support for Reader~~ [WIP] Add XLNet support for Reader Jul 24, 2019

andrelmfarias mentioned this pull request Jul 26, 2019

How use pytorch fine tunned model #220

Closed

sync with latest HF changes

c3d8a7a

sync latest HF changes

8276470

n0thingLLM mentioned this pull request Aug 17, 2019

No module named 'cdqa.reader.reader_sklearn' #237

Closed

andrelmfarias and others added 3 commits August 25, 2019 13:48

added verbose_logging option + reformatted with black

2eeb34a

added last verbose conditions

eab6401

add colab notebook for xlnet eval on squad 2.0

660760c

lewtun mentioned this pull request Oct 5, 2019

Bert model prediction is slow , consider more practical implementation #197

Open

	final_predictions_sorted = collections.OrderedDict(sorted(final_predictions.items(),
	key=lambda item: item[1]['start_log_prob'] +
	item[1]['end_log_prob'],
	reverse=True))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add XLNet support for Reader#205

[WIP] Add XLNet support for Reader#205
n0thingLLM wants to merge 43 commits intomasterfrom
sync-huggingface

n0thingLLM commented Jul 16, 2019 •

edited

Loading

Uh oh!

codecov bot commented Jul 16, 2019 •

edited

Loading

Uh oh!

n0thingLLM commented Jul 19, 2019

Uh oh!

n0thingLLM commented Jul 19, 2019

Uh oh!

n0thingLLM commented Jul 19, 2019

Uh oh!

andrelmfarias commented Jul 24, 2019

Uh oh!

alex-movila commented Aug 9, 2019

Uh oh!

n0thingLLM commented Aug 10, 2019

Uh oh!

n0thingLLM commented Sep 15, 2019 •

edited

Loading

Uh oh!

andrelmfarias commented Sep 16, 2019

Uh oh!

n0thingLLM commented Oct 13, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

n0thingLLM commented Jul 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

n0thingLLM commented Jul 19, 2019

Uh oh!

n0thingLLM commented Jul 19, 2019

Uh oh!

n0thingLLM commented Jul 19, 2019

Uh oh!

andrelmfarias commented Jul 24, 2019

Uh oh!

alex-movila commented Aug 9, 2019

Uh oh!

n0thingLLM commented Aug 10, 2019

Uh oh!

n0thingLLM commented Sep 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andrelmfarias commented Sep 16, 2019

Uh oh!

n0thingLLM commented Oct 13, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

n0thingLLM commented Jul 16, 2019 •

edited

Loading

codecov bot commented Jul 16, 2019 •

edited

Loading

n0thingLLM commented Sep 15, 2019 •

edited

Loading