Data quantity issue

The eval_data file downloaded from the provided link indicates that POPQA comprises 1,399 data points and Bio contains 500 data points. I would like to inquire whether these represent the full datasets utilized in the actual experiments, or if a subset was employed?