ML predictor integration into pvacseq by jyao36 · Pull Request #1333 · griffithlab/pVACtools

jyao36 · 2025-11-03T23:29:37Z

…odel artifacts; modified argument parser and pvacseq run.py file to run the predictor as part of pvacseq; added test files and script.

…ts.yml

susannasiebert

This looks pretty good. I added some comments for improvements that could be made.

I would also like to see the addition of a standalone pvacseq add_ml_predictions command that someone could run standalone to add ml_predictions to the output from their pipeline if the ml option was not enabled in the original run. Have a look at pvactools/tools/pvacseq/mark_genes_of_interest.py and its test tests/test_pvacseq_mark_genes_of_interest.py to see how to do so. This command addition also requires an update to pvactools/tools/pvacseq/main.py and pvactools/tools/pvacseq/__init__.py to add this command.

Additionally, this new command should then be documented in docs/pvacseq/optional_downstream_analysis_tools.rst. Most of the documentation can be auto-generated from the command line docs by using the program-output tool but this would also be the spot to put any additional documentation you want to have around this feature.

susannasiebert

Left a couple of code suggestions to convert named required parameters to positional parameters in your parser.

Co-authored-by: Susanna Kiwala <susanna.kiwala@wustl.edu>

…or; updated argument format for add_ml_predictions.py

…mat.

…column is present

susannasiebert

+1

susannasiebert

I made some copy editing suggestions. Two thoughts I had while reviewing this:

It would be great to have some additional exploration of different candidates in ML section of the pVACview vignette. I wrote down some examples in a comment in that part of the text.
Have you and Malachi discussed whether it would be worthwhile to make the reject threshold configurable? It seems a bit weird to me to hardcode the reject threshold but make the accept threshold configurable.

susannasiebert

+1

susannasiebert · 2026-01-20T19:32:39Z

+     - Directory containing image files for pVACview. Not generated when running with presentation and immunogenicity algorithms only.  
+   * - ``ml_predict/<sample_name>_predict_pvacview.tsv`` (optional)
+     - ML-based neoantigen evaluation predictions file. Generated when both MHC Class I and Class II predictions are run and the ``--run-ml-predictions`` flag is set.
     - Directory containing image files for pVACview. Not generated when running only with presentation and immunogenicity algorithms only.


Suggested change

- Directory containing image files for pVACview. Not generated when running only with presentation and immunogenicity algorithms only.

This should fix the issue with this table not rendering.

… columns and fill with NaN; Updated ML file name, argument names; Updated documentation

susannasiebert

This looks good. Just two minor change requests.

susannasiebert · 2026-03-10T12:50:27Z

    parser.add_argument(
        "output_dir",
-        help="Directory where the ML prediction TSV files should be written."
+        nargs="?",


Instead of using nargs="?" and a positional parameter for optional parameters, we've been using the -- convention (like for the --ml-threshold-accept and --ml-threshold-predict). If you rename the parameter to --output-dir it will automatically be optional. It should also be moved behind all of the positional parameters, i.e between sample_name and --ml-threshold-accept).

Co-authored-by: Susanna Kiwala <susanna.kiwala@wustl.edu>

Incorporate ml_predictor module into pvactools; Uploaded associated m…

7adece8

…odel artifacts; modified argument parser and pvacseq run.py file to run the predictor as part of pvacseq; added test files and script.

jyao36 changed the title ~~Incorporate ml_predictor module into pvactools; Uploaded associated m…~~ ML predictor integration into pvacseq Nov 3, 2025

Add imbalanced-learn dependency installation to .github/workflows/tes…

3ef9937

…ts.yml

susannasiebert reviewed Nov 5, 2025

View reviewed changes

Comment thread pvactools/tools/pvacview/server.R

susannasiebert added this to the 7.0 milestone Nov 10, 2025

Base automatically changed from percentile_cutoffs to 7.0.0 November 14, 2025 13:32

jyao36 and others added 3 commits November 18, 2025 19:09

add add_ml_predictions.py and test script

6f44f1a

fixed test_pvacseq_add_ml_predictions.py

6a6e843

Comment out tests that fail nondeterministically

de5cc22

susannasiebert reviewed Nov 26, 2025

View reviewed changes

Comment thread pvactools/lib/ml_predictor.py Outdated

Comment thread pvactools/lib/ml_predictor.py Outdated

Comment thread pvactools/lib/ml_predictor.py Outdated

jyao36 and others added 4 commits November 26, 2025 14:41

Update pvactools/lib/ml_predictor.py

60c46ea

Co-authored-by: Susanna Kiwala <susanna.kiwala@wustl.edu>

Update pvactools/lib/ml_predictor.py

0a100bd

Co-authored-by: Susanna Kiwala <susanna.kiwala@wustl.edu>

Update pvactools/lib/ml_predictor.py

65ec594

Co-authored-by: Susanna Kiwala <susanna.kiwala@wustl.edu>

updated pvacview ui.R and server.R to include footnote for ML predict…

5992296

…or; updated argument format for add_ml_predictions.py

susannasiebert reviewed Dec 1, 2025

View reviewed changes

Comment thread pvactools/lib/ml_predictor.py Outdated

susannasiebert reviewed Dec 1, 2025

View reviewed changes

Comment thread ...ml_predictor/ml_predictor_test_output/HCC1395.MHC_I.all_epitopes.aggregated.ML_predicted.tsv

susannasiebert reviewed Dec 1, 2025

View reviewed changes

Comment thread pvactools/tools/pvacview/server.R Outdated

jyao36 added 3 commits December 1, 2025 16:29

update output file format to preserve original class I aggregated for…

5f37de5

…mat.

update server.R to include footnote at all times when the prediction …

61260a8

…column is present

Update python "None" value handling.

8b35c79

susannasiebert approved these changes Dec 2, 2025

View reviewed changes

jyao36 and others added 2 commits December 10, 2025 17:10

documentation for ML predictor

c926053

Merge branch '7.0.0' into ml_predictor3

1ad3651

susannasiebert requested changes Dec 11, 2025

View reviewed changes

jyao36 added 2 commits December 12, 2025 14:27

pin scikit-learn version

65b1ef8

update scikit-learn version

4132b45

jyao36 marked this pull request as ready for review December 15, 2025 16:47

jyao36 added 2 commits December 19, 2025 16:40

incorporate comments on documentation files

4b0ca2d

add threshold-reject instead of hardcoding the number

a9c7eae

susannasiebert approved these changes Jan 12, 2026

View reviewed changes

jyao36 commented Jan 20, 2026

View reviewed changes

Comment thread docs/pvacseq/output_files.rst

susannasiebert reviewed Jan 20, 2026

View reviewed changes

jyao36 and others added 4 commits January 20, 2026 15:18

Added test in tes_pvacseeq.py; Updated ml_predictor.py to add missing…

380675a

… columns and fill with NaN; Updated ML file name, argument names; Updated documentation

Merge remote-tracking branch 'origin/7.0.0' into ml_predictor3

16cfad8

Add missing dependency

358519b

updated output file saved location to Class I folder

0066b13

susannasiebert requested changes Mar 10, 2026

View reviewed changes

jyao36 and others added 3 commits March 10, 2026 10:37

Update docs/pvacseq/output_files.rst

ae97284

Co-authored-by: Susanna Kiwala <susanna.kiwala@wustl.edu>

updated outdir parameter

96af94a

Fix failing test

6210546

Conversation

jyao36 commented Nov 3, 2025

Uh oh!

susannasiebert left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

susannasiebert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

susannasiebert left a comment

Choose a reason for hiding this comment

Uh oh!

susannasiebert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

susannasiebert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

susannasiebert Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

susannasiebert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

susannasiebert Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

susannasiebert left a comment •

edited

Loading