Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
535 commits
Select commit Hold shift + click to select a range
b9dd3a3
Dataframe implementation of class FDR
ypriverol Nov 10, 2021
81babe5
Dataframe implementation of class FDR
ypriverol Nov 10, 2021
7ba71d0
Dataframe implementation of class FDR
ypriverol Nov 10, 2021
b11b032
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
da375c8
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
5b40607
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
3c9c785
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
ed930cb
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
ef35c54
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
6d8f03a
Dataframe implementation of class FDR
ypriverol Nov 11, 2021
8784a02
support for triqler output
ypriverol Nov 12, 2021
3a7ba30
support for triqler output
ypriverol Nov 12, 2021
465ae6e
support for triqler output
ypriverol Nov 12, 2021
c6db15b
support for triqler output
ypriverol Nov 13, 2021
d757c20
support for triqler output
ypriverol Nov 13, 2021
3b0f340
support for triqler output
ypriverol Nov 13, 2021
b2693ce
explaining the Bayesian class FDR
ypriverol Nov 15, 2021
a300de2
explaining the Bayesian class FDR
ypriverol Nov 23, 2021
b45ccfd
Updated vcf-to-proteindb to parse using pandas and added support to f…
husensofteng Dec 1, 2021
1f3a4fe
requirement for pandas version
husensofteng Dec 1, 2021
c1a672f
Merge pull request #57 from bigbio/dev
ypriverol Dec 2, 2021
8676aa0
Merge branch 'master' of https://github.com/bigbio/py-pgatk into clas…
ypriverol Dec 2, 2021
c4265a1
merge new vcf reader in pandas
ypriverol Dec 2, 2021
abcbdc6
pythons github actions
ypriverol Dec 2, 2021
f9ad33c
pythons github actions
ypriverol Dec 2, 2021
c50d7b8
pythons github actions
ypriverol Dec 2, 2021
90c2818
pythons github actions
ypriverol Dec 2, 2021
59677cc
pythons github actions
ypriverol Dec 2, 2021
25d3d93
pythons github actions
ypriverol Dec 2, 2021
079f349
pythons github actions
ypriverol Dec 2, 2021
6bd97cb
pythons github actions
ypriverol Dec 2, 2021
4daddaa
pythons github actions
ypriverol Dec 2, 2021
d7c46a2
pythons github actions
ypriverol Dec 2, 2021
d936645
pythons github actions
ypriverol Dec 2, 2021
6820d57
pythons github actions
ypriverol Dec 2, 2021
631442b
pythons github actions
ypriverol Dec 2, 2021
c03cc2a
Remove bayesian class FDR.
ypriverol Dec 20, 2021
98f3246
Merge pull request #54 from bigbio/class-fdr
husensofteng Dec 20, 2021
1539e12
Update README.md
ypriverol May 8, 2022
7de493a
Added flag to specify separator in transcript description
husensofteng May 10, 2022
6f22e1a
Updated conda yaml to remove PyVCF from the requirements as it is no …
husensofteng May 23, 2022
a44d568
Updates version info for
ypriverol May 23, 2022
7d5ddc9
fixed separator for CDSs and other features from gffread fasta
husensofteng May 24, 2022
7f000f7
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into class-fdr
ypriverol Jun 13, 2022
3331030
Remove bayesian class FDR.
ypriverol Jun 13, 2022
c59d309
Merge branch 'master' of https://github.com/bigbio/py-pgatk into clas…
ypriverol Jun 13, 2022
f41246b
change the config file
ypriverol Jun 13, 2022
f93006a
change the config file
ypriverol Jun 13, 2022
86b2976
change the config file
ypriverol Jun 13, 2022
620b09a
change the config file
ypriverol Jun 13, 2022
92c1c16
files can be now dowload to a file
ypriverol Jun 17, 2022
9e1ebf9
files can be now download to a file
ypriverol Jun 17, 2022
84799f5
files can be now download to a file
ypriverol Jun 17, 2022
a447e4f
Merge pull request #60 from bigbio/class-fdr
ypriverol Jun 17, 2022
bea7717
MANIFEST included
ypriverol Jun 20, 2022
bc5e0b0
Merge branch 'master' of https://github.com/bigbio/py-pgatk into clas…
ypriverol Jun 20, 2022
fc6c3ed
first iteration of ms-rescore configuration generation.
ypriverol Jun 20, 2022
23143c4
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
97ed3b6
Updates version info for
ypriverol Jun 21, 2022
cc36a7b
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
cd59fa1
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
1f7324f
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
39f4712
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
7da51b3
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
38ac1cd
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
c85b9d7
first iteration of ms-rescore configuration generation.
ypriverol Jun 21, 2022
db5aa23
Merge pull request #62 from bigbio/class-fdr
ypriverol Jun 21, 2022
0956fc0
Merge remote-tracking branch 'origin/dev' into dev
ypriverol Jun 22, 2022
d233674
Updates version info for
ypriverol Jun 22, 2022
99d6eeb
Merge pull request #63 from bigbio/dev
ypriverol Jun 22, 2022
9bd1779
major changes dealing with configs files
ypriverol Jun 27, 2022
7edbd4c
major changes dealing with configs files
ypriverol Jun 28, 2022
621fee1
example of two psms for spectrumAI
husensofteng Jun 30, 2022
41b6d12
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into dev
husensofteng Jun 30, 2022
2be5281
version command added
ypriverol Jul 4, 2022
16aab88
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into dev
ypriverol Jul 4, 2022
042875d
default parameters captured.
ypriverol Jul 6, 2022
667edf8
changes in parameters handling and bug fixing
ypriverol Jul 6, 2022
88c57e9
fixing bug with thershold
ypriverol Jul 6, 2022
ac34362
extending tests
ypriverol Jul 6, 2022
8393f19
extending tests
ypriverol Jul 6, 2022
5baf3f8
fixed error in the ensembl.py
ypriverol Jul 6, 2022
7fba76a
small refactoring of the file structures
ypriverol Jul 6, 2022
ab5dfd4
bug fixed in ensembl default values
ypriverol Jul 6, 2022
3b80353
move header from var -> var_
ypriverol Jul 6, 2022
ee18274
fix error with accepted values in mutations.
ypriverol Jul 7, 2022
579d28a
fix errors with release- prefix and validation of taxonomies
ypriverol Jul 7, 2022
0bdf661
fix errors with release- prefix and validation of taxonomies
ypriverol Jul 7, 2022
443f14f
updated ReadMe adding the new commands
husensofteng Jul 7, 2022
bfb6e84
Merge pull request #64 from bigbio/dev
husensofteng Jul 7, 2022
611dd89
fixed bug in gtf file name, just keep version not release string
husensofteng Jul 8, 2022
32f8b66
Merge pull request #69 from bigbio/master
ypriverol Jul 13, 2022
2390235
Updates version info for
ypriverol Jul 13, 2022
2b4c34b
var_prefix -> protein_prefix
ypriverol Jul 13, 2022
3cb4c9f
Merge pull request #68 from bigbio/dev
husensofteng Jul 13, 2022
317631a
add spectrumAI
DongdongdongW Aug 23, 2022
2c316fd
fix and add test files
DongdongdongW Aug 26, 2022
cc5816d
ci test
DongdongdongW Aug 28, 2022
fcf48ca
fix matplotlib version
DongdongdongW Aug 28, 2022
cce6855
fix pyopenms version
DongdongdongW Aug 28, 2022
e689c49
fix
DongdongdongW Aug 28, 2022
73ecf19
Revert "fix"
DongdongdongW Aug 28, 2022
b1b1d4c
Update pythonpublish.yml
ypriverol Aug 29, 2022
c1029d8
update
DongdongdongW Aug 30, 2022
06a044a
Update requirements.txt
DongdongdongW Aug 30, 2022
ba78bdb
Create __init__.py
DongdongdongW Aug 30, 2022
fee9a00
Update setup.py
DongdongdongW Aug 30, 2022
0d9b9e3
add mztab class-fdr
DongdongdongW Sep 20, 2022
6d65d0c
Update mzTab_class_fdr.py
DongdongdongW Sep 20, 2022
1101a1c
update test
DongdongdongW Sep 21, 2022
6654dac
fix Codacy Static Code Analysis
DongdongdongW Sep 22, 2022
c6775f9
fix
DongdongdongW Sep 22, 2022
8d018e1
fix
DongdongdongW Sep 22, 2022
d3cf2a0
fix
DongdongdongW Sep 22, 2022
07fbebb
add blast
DongdongdongW Oct 6, 2022
d3d45e8
update biopython version
DongdongdongW Oct 6, 2022
af47ce2
add test file
DongdongdongW Oct 6, 2022
b0e8d22
Update pypgatk_tests.py
DongdongdongW Oct 6, 2022
f6d269d
Merge pull request #70 from DongdongdongW/spectrumAI
ypriverol Oct 17, 2022
a0d35b0
Fix the bug of class-fdr, adjust the blast to multi-process running t…
DongdongdongW Oct 31, 2022
99342ef
update pathos version
DongdongdongW Oct 31, 2022
bdb0d9d
update test command
DongdongdongW Oct 31, 2022
c334b18
Update pythonpackage.yml
DongdongdongW Oct 31, 2022
687bfab
Update pythonapp.yml
DongdongdongW Oct 31, 2022
dbac310
Update mzTab_class_fdr.py
DongdongdongW Oct 31, 2022
f333df6
Merge pull request #71 from DongdongdongW/spectrumAI
ypriverol Oct 31, 2022
77b5429
small changes in the name of the classes.
ypriverol Oct 31, 2022
5907cd6
minor changes in the naming
ypriverol Oct 31, 2022
9058a9c
minor refinements in the code.
ypriverol Oct 31, 2022
02bd95a
fixes issue #72
Nov 15, 2022
0f95bc4
Merge pull request #73 from bigbio/issue72
ypriverol Nov 15, 2022
228ddb8
update blast
DongdongdongW Mar 29, 2023
0154dbb
update spectrumAI
DongdongdongW Apr 16, 2024
3dd6869
Merge branch 'spectrumAI' of https://github.com/DongdongdongW/py-pgat…
DongdongdongW Apr 16, 2024
92384f3
update info
DongdongdongW Apr 16, 2024
1a8dda3
update
DongdongdongW Apr 18, 2024
3fcf781
update
DongdongdongW Apr 18, 2024
eead330
removing versions of dependencies
ypriverol Apr 19, 2024
6108524
changes on cbioportal download system.
ypriverol Apr 19, 2024
b7dfe7d
Merge branch 'spectrumAI' of https://github.com/DongdongdongW/py-pgat…
ypriverol Apr 19, 2024
cf00173
Merge branch 'master' of https://github.com/bigbio/py-pgatk into spec…
ypriverol Apr 19, 2024
b32f630
remove unnecesary code
ypriverol Apr 19, 2024
3792ad2
remove unnecesary code
ypriverol Apr 19, 2024
c72ff20
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
b3ec7d9
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
b07d14c
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
98bee47
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
3596398
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
e9ebe47
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
54db12e
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
8eea6dc
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
8cad583
skip to tests not working with pytests pooling.
ypriverol Apr 19, 2024
95c3354
minor changes in code.
ypriverol Apr 19, 2024
d5896bd
minor changes in code.
ypriverol Apr 19, 2024
4584a04
minor changes in code.
ypriverol Apr 19, 2024
f0023f2
minor changes in code.
ypriverol Apr 19, 2024
ec4c896
minor changes in code.
ypriverol Apr 19, 2024
3cef9ea
Merge pull request #77 from bigbio/spectrumAI
ypriverol Apr 19, 2024
c0ae4e4
small changes in blast -> USI accessions
ypriverol Apr 22, 2024
0df4a00
small changes in blast -> USI accessions
ypriverol Apr 22, 2024
279a94b
small changes in blast -> USI accessions
ypriverol Apr 22, 2024
9750bd4
Updates version info for
ypriverol Apr 22, 2024
adaa875
Merge branch 'master' of https://github.com/bigbio/py-pgatk into spec…
ypriverol Apr 23, 2024
01026c0
update new blast and spectrumAI
DongdongdongW Apr 24, 2024
8d673eb
Merge pull request #79 from DongdongdongW/spectrumAI
ypriverol Apr 24, 2024
fbf8a30
validate_peptides.py -> spectrumai.py
ypriverol Apr 24, 2024
da0f0fc
validate_peptides.py -> spectrumai.py
ypriverol Apr 24, 2024
efffc6d
validate_peptides.py -> spectrumai.py
ypriverol Apr 24, 2024
cfb7d02
Merge pull request #80 from bigbio/spectrumAI
ypriverol Apr 24, 2024
789dc87
disable cbioportal download test
ypriverol Apr 24, 2024
d4df92e
disable cbioportal download test
ypriverol Apr 24, 2024
a367755
disable cbioportal download test
ypriverol Apr 24, 2024
b98d59b
disable cbioportal download test
ypriverol Apr 24, 2024
b2951e8
Merge branch 'master' of https://github.com/bigbio/py-pgatk into spec…
ypriverol Apr 24, 2024
2a1b0d2
Merge remote-tracking branch 'origin/dev' into dev
ypriverol Apr 24, 2024
5339047
disable cbioportal download test
ypriverol Apr 24, 2024
4d7765c
Merge pull request #82 from bigbio/dev
ypriverol Apr 24, 2024
4cb65af
disable cbioportal download test
ypriverol Apr 26, 2024
89d4816
Merge pull request #83 from bigbio/dev
ypriverol Apr 26, 2024
5c91508
minor change in gitignore
ypriverol Jan 3, 2026
e737f0f
major changes to structure
ypriverol Jan 3, 2026
e9c28f9
minor changes
ypriverol Jan 3, 2026
6cdce09
support for major versions
ypriverol Jan 3, 2026
534a3e9
minor changes
ypriverol Jan 3, 2026
6b7a5f2
minor changes
ypriverol Jan 3, 2026
66fe927
minor changes
ypriverol Jan 3, 2026
e8e3311
minor chhanges for code review
ypriverol Jan 3, 2026
07ffbd2
minor changes
ypriverol Jan 3, 2026
ec32831
pinned versions
ypriverol Jan 3, 2026
7c42e06
minor changes
ypriverol Jan 3, 2026
96ac6cc
minonr changes
ypriverol Jan 3, 2026
27ced62
Minro changes
ypriverol Mar 1, 2026
7157769
Remove large Ensembl reference files from tracking
ypriverol Mar 1, 2026
e4f4586
Updates version info for
ypriverol Mar 1, 2026
6b7063f
refactor: rename pypgatk to pgatk
ypriverol Mar 1, 2026
daf560c
ci: exclude docs stylesheets from Codacy analysis
ypriverol Mar 1, 2026
46cc5fe
refactor: remove unused mzTab class FDR feature
ypriverol Mar 1, 2026
b6af1dc
refactor: add config registry for auto-loading bundled defaults
ypriverol Mar 1, 2026
bcf5e43
fix: resolve multiple bugs across codebase
ypriverol Mar 1, 2026
656b086
refactor: remove dead code and improve logging
ypriverol Mar 1, 2026
bcae956
refactor: remove OpenMS FDR computation and peptide-class-fdr command
ypriverol Mar 1, 2026
008e471
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into refacto…
ypriverol Mar 1, 2026
0a2c376
merge: integrate master with PRs #86, #87, #88 into refactor/rename
ypriverol Mar 1, 2026
29cd2f9
chore: remove large Ensembl test data files from tracking
ypriverol Mar 1, 2026
d54c0a4
fix: replace deprecated Bio.pairwise2 with sliding window in blast
ypriverol Mar 1, 2026
ed534eb
refactor: rename pypgatk_tests.py to pgatk_tests.py
ypriverol Mar 1, 2026
9840614
fix(#67): verify downloaded files exist and are non-empty in tests
ypriverol Mar 1, 2026
4bd8e43
style: remove unnecessary lambda in blast_get_position
ypriverol Mar 1, 2026
576f190
fix: resolve multiple bugs from code review
ypriverol Mar 1, 2026
cf17786
docs: add graph engine design and Phase 0 implementation plan
ypriverol Mar 2, 2026
a97c42c
test: add conftest.py with testdata/config path fixtures
ypriverol Mar 2, 2026
b141af8
fix: rename shadowing variable decoy_sequence in print_target_decoy_c…
ypriverol Mar 2, 2026
c1fd195
refactor: convert SNP to dataclass, rename .type to .mutation_type
ypriverol Mar 2, 2026
b2a06df
refactor: replace pathos with concurrent.futures in blast_get_position
ypriverol Mar 2, 2026
ae5272a
refactor: replace pathos with concurrent.futures in spectrumai, remov…
ypriverol Mar 2, 2026
4ee6268
refactor: replace print() with logger across all production modules
ypriverol Mar 2, 2026
f7b961b
refactor: replace broad except Exception with specific exception types
ypriverol Mar 2, 2026
290c382
refactor: add get_config_value helper, DRY up config fallback pattern
ypriverol Mar 2, 2026
f5d952c
refactor: convert digest_mutant_protein from standalone script to Cli…
ypriverol Mar 2, 2026
479f210
refactor: convert map_peptide2genome from standalone script to Click …
ypriverol Mar 2, 2026
4ce76c2
refactor: use pathlib.Path for file path operations
ypriverol Mar 2, 2026
38e0a93
refactor: add type hints to all public API methods
ypriverol Mar 2, 2026
37ff86b
test: add unit tests for get_altseq, get_mut_pro_seq, and revswitch
ypriverol Mar 2, 2026
fdde235
fix: resolve Codacy static analysis warnings
ypriverol Mar 2, 2026
5848290
fix: prefix unused tuple variables with underscore in tests
ypriverol Mar 2, 2026
234194b
ci: exclude test files from Codacy static analysis
ypriverol Mar 2, 2026
db034fc
Minor changes
ypriverol Mar 2, 2026
e84d16d
Minor change
ypriverol Mar 2, 2026
e20f8c8
Merge pull request #89 from bigbio/refactor/rename
ypriverol Mar 2, 2026
61020a2
docs: add ClinVar/NCBI RefSeq support design document
ypriverol Mar 2, 2026
3242516
docs: add ClinVar/NCBI implementation plan (7 tasks, TDD)
ypriverol Mar 2, 2026
8cf5040
refactor: extract shared VCF utility functions to toolbox/vcf_utils.py
ypriverol Mar 2, 2026
be2195e
feat: add ChromosomeMapper for NC_/numeric/UCSC chromosome name conve…
ypriverol Mar 2, 2026
136d70a
feat: add ClinVar config with CLNSIG filtering and NCBI FTP URLs
ypriverol Mar 2, 2026
343619e
feat: add ClinVarService pipeline for VCF-to-protein generation
ypriverol Mar 2, 2026
366349b
fix: handle multi-allelic ALT and add CLNSIG metadata to FASTA headers
ypriverol Mar 3, 2026
b9416e5
feat: add NcbiDataDownloader for RefSeq/ClinVar file downloads
ypriverol Mar 3, 2026
3e5e4eb
feat: add clinvar-to-proteindb and ncbi-downloader CLI commands
ypriverol Mar 3, 2026
409d2b7
test: add end-to-end CLI integration tests for ClinVar pipeline
ypriverol Mar 3, 2026
e23deca
docs: add ClinVar pipeline hardening implementation plan
ypriverol Mar 3, 2026
f8860ea
feat(clinvar): activate MC consequence filtering from config
ypriverol Mar 3, 2026
989c463
refactor(clinvar): single-pass VCF reading — build overlap map from D…
ypriverol Mar 3, 2026
7debf23
feat(clinvar): add biotype filtering from GTF gene_biotype attribute
ypriverol Mar 3, 2026
335cbeb
fix(clinvar): add duplicate variant-transcript guard, clean up _read_vcf
ypriverol Mar 3, 2026
3d4eb7c
test(clinvar): add integration tests for MC filtering and output vali…
ypriverol Mar 3, 2026
0b07b70
minor changes
ypriverol Mar 3, 2026
6db35d9
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into refacto…
ypriverol Mar 3, 2026
90c5c22
clean scripts
ypriverol Mar 3, 2026
9daa268
minor changes
ypriverol Mar 3, 2026
d9c00f5
Merge pull request #91 from bigbio/refactor/rename
ypriverol Mar 3, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions .codacy.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
exclude_paths:
- "docs/stylesheets/**"
- "docs/plans/**"
- "pgatk/tests/**"
43 changes: 43 additions & 0 deletions .github/workflows/deploy-docs.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
name: Deploy Documentation

on:
push:
branches: ["master"]
workflow_dispatch:

permissions:
contents: write

jobs:
deploy-docs:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
with:
fetch-depth: 0

- name: Set up Python 3.10
uses: actions/setup-python@v5
with:
python-version: "3.10"

- name: Cache pip dependencies
uses: actions/cache@v4
with:
path: ~/.cache/pip
key: ${{ runner.os }}-pip-mkdocs
restore-keys: |
${{ runner.os }}-pip-

- name: Install MkDocs and dependencies
run: |
python -m pip install --upgrade pip
python -m pip install mkdocs-material

- name: Build MkDocs documentation
run: |
mkdocs build --strict

- name: Deploy to GitHub Pages
run: |
mkdocs gh-deploy --force --message "docs: deploy documentation [skip ci]"
4 changes: 2 additions & 2 deletions .github/workflows/pythonapp.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ jobs:
- name: Test with unittest
shell: bash -l {0}
run: |
cd pypgatk
cd pgatk
python -m unittest discover -s tests -p "*_tests.py" -v

- name: Update changelog
Expand All @@ -67,7 +67,7 @@ jobs:
shell: bash -l {0}
run: |
pip install pytest pytest-cov coverage
cd pypgatk
cd pgatk
coverage run -m unittest discover -s tests -p "*_tests.py"
coverage xml -o coverage.xml

Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/pythonpackage.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,5 +46,5 @@ jobs:
- name: Test with unittest
shell: bash -l {0}
run: |
cd pypgatk
cd pgatk
python -m unittest discover -s tests -p "*_tests.py" -v
29 changes: 21 additions & 8 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -108,17 +108,30 @@ venv.bak/
.pydevproject

.idea/
pypgatk/database_ensembl/
pypgatk/database_cbioportal/
pypgatk/database_cosmic/
pypgatk/config/private/
pgatk/database_ensembl/
pgatk/database_cbioportal/
pgatk/database_cosmic/
pgatk/config/private/
.pypirc
/pypgatk/test_all.bed
/pypgatk/test_annotated.vcf

/pgatk/test_all.bed
/pgatk/test_annotated.vcf

# Test-generated output files (produced by pgatk tests)
pgatk/testdata/proteindb_from_*.fa
pgatk/testdata/output_decoy.fa
pgatk/testdata/test_cbioportal_data_mutations_mskcc_proteindb*.fa
pgatk/testdata/test_cosmic_mutations_proteindb*.fa
pgatk/testdata/test_blast_psms_out.tsv
pgatk/testdata/test_blast_validate_psms_out.tsv
pgatk/testdata/test_validate_psms_out.tsv
pgatk/testdata/20151020_QE3_UPLC8_DBJ_SA_HCT116_Rep2_46frac_10_consensus_filter.idxml
pgatk/testdata/proteindb_minus_strand_test.fa

# Large Ensembl reference files (download at test time, not in git)
pgatk/testdata/Meleagris_gallopavo*

#Ignore cursor AI rules
.cursor/rules/codacy.mdc
.DS_Store
.codacy/
.cursor/
.cursor/
74 changes: 37 additions & 37 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,96 +1,96 @@
# Changelog

## [v0.0.24](https://github.com/bigbio/py-pgatk/tree/v0.0.24) (2024-04-19)
## [v0.0.24](https://github.com/bigbio/pgatk/tree/v0.0.24) (2024-04-19)

**Implemented enhancements:**

- change var\_prefix [\#66](https://github.com/bigbio/py-pgatk/issues/66)
- change var\_prefix [\#66](https://github.com/bigbio/pgatk/issues/66)

## [v0.0.23](https://github.com/bigbio/py-pgatk/tree/v0.0.23) (2022-07-07)
## [v0.0.23](https://github.com/bigbio/pgatk/tree/v0.0.23) (2022-07-07)

## [v0.0.22](https://github.com/bigbio/py-pgatk/tree/v0.0.22) (2022-06-21)
## [v0.0.22](https://github.com/bigbio/pgatk/tree/v0.0.22) (2022-06-21)

## [v0.0.21](https://github.com/bigbio/py-pgatk/tree/v0.0.21) (2022-06-17)
## [v0.0.21](https://github.com/bigbio/pgatk/tree/v0.0.21) (2022-06-17)

**Implemented enhancements:**

- gffread fasta generated file uses space and semicolon [\#59](https://github.com/bigbio/py-pgatk/issues/59)
- gffread fasta generated file uses space and semicolon [\#59](https://github.com/bigbio/pgatk/issues/59)

**Fixed bugs:**

- Write urls for download into file instead of downloading [\#61](https://github.com/bigbio/py-pgatk/issues/61)
- changing values in the config files have no effect [\#58](https://github.com/bigbio/py-pgatk/issues/58)
- Write urls for download into file instead of downloading [\#61](https://github.com/bigbio/pgatk/issues/61)
- changing values in the config files have no effect [\#58](https://github.com/bigbio/pgatk/issues/58)

## [v0.0.20](https://github.com/bigbio/py-pgatk/tree/v0.0.20) (2022-05-10)
## [v0.0.20](https://github.com/bigbio/pgatk/tree/v0.0.20) (2022-05-10)

**Implemented enhancements:**

- Implementation of Class FDR [\#56](https://github.com/bigbio/py-pgatk/issues/56)
- Cleaning the code [\#50](https://github.com/bigbio/py-pgatk/issues/50)
- Implementation of Class FDR [\#56](https://github.com/bigbio/pgatk/issues/56)
- Cleaning the code [\#50](https://github.com/bigbio/pgatk/issues/50)

**Fixed bugs:**

- pyVCF reader deprecated only supported for py3.6 [\#55](https://github.com/bigbio/py-pgatk/issues/55)
- pyVCF reader deprecated only supported for py3.6 [\#55](https://github.com/bigbio/pgatk/issues/55)

**Closed issues:**

- Fasta input for vcf-to-proteindb [\#29](https://github.com/bigbio/py-pgatk/issues/29)
- Fasta input for vcf-to-proteindb [\#29](https://github.com/bigbio/pgatk/issues/29)

## [v0.0.19](https://github.com/bigbio/py-pgatk/tree/v0.0.19) (2021-03-31)
## [v0.0.19](https://github.com/bigbio/pgatk/tree/v0.0.19) (2021-03-31)

## [v0.0.18](https://github.com/bigbio/py-pgatk/tree/v0.0.18) (2021-03-30)
## [v0.0.18](https://github.com/bigbio/pgatk/tree/v0.0.18) (2021-03-30)

**Fixed bugs:**

- Downloading all the cBioPortal experiments failing [\#20](https://github.com/bigbio/py-pgatk/issues/20)
- Downloading all the cBioPortal experiments failing [\#20](https://github.com/bigbio/pgatk/issues/20)

## [v0.0.17](https://github.com/bigbio/py-pgatk/tree/v0.0.17) (2021-03-12)
## [v0.0.17](https://github.com/bigbio/pgatk/tree/v0.0.17) (2021-03-12)

## [v0.0.16](https://github.com/bigbio/py-pgatk/tree/v0.0.16) (2021-03-06)
## [v0.0.16](https://github.com/bigbio/pgatk/tree/v0.0.16) (2021-03-06)

## [v0.0.15](https://github.com/bigbio/py-pgatk/tree/v0.0.15) (2021-02-17)
## [v0.0.15](https://github.com/bigbio/pgatk/tree/v0.0.15) (2021-02-17)

## [v0.0.14](https://github.com/bigbio/py-pgatk/tree/v0.0.14) (2021-02-15)
## [v0.0.14](https://github.com/bigbio/pgatk/tree/v0.0.14) (2021-02-15)

## [v0.0.13](https://github.com/bigbio/py-pgatk/tree/v0.0.13) (2021-02-15)
## [v0.0.13](https://github.com/bigbio/pgatk/tree/v0.0.13) (2021-02-15)

## [v0.0.12](https://github.com/bigbio/py-pgatk/tree/v0.0.12) (2021-02-11)
## [v0.0.12](https://github.com/bigbio/pgatk/tree/v0.0.12) (2021-02-11)

## [v0.0.11](https://github.com/bigbio/py-pgatk/tree/v0.0.11) (2021-02-07)
## [v0.0.11](https://github.com/bigbio/pgatk/tree/v0.0.11) (2021-02-07)

## [v0.0.10](https://github.com/bigbio/py-pgatk/tree/v0.0.10) (2021-02-05)
## [v0.0.10](https://github.com/bigbio/pgatk/tree/v0.0.10) (2021-02-05)

**Implemented enhancements:**

- Ensembl release [\#23](https://github.com/bigbio/py-pgatk/issues/23)
- Add a changelog.md to the project [\#22](https://github.com/bigbio/py-pgatk/issues/22)
- Ensembl release [\#23](https://github.com/bigbio/pgatk/issues/23)
- Add a changelog.md to the project [\#22](https://github.com/bigbio/pgatk/issues/22)

## [v0.0.9](https://github.com/bigbio/py-pgatk/tree/v0.0.9) (2021-02-03)
## [v0.0.9](https://github.com/bigbio/pgatk/tree/v0.0.9) (2021-02-03)

**Implemented enhancements:**

- overlap between config parameters in cgenomes between cosmic and cbioportal downloaders [\#17](https://github.com/bigbio/py-pgatk/issues/17)
- Major review to the code [\#8](https://github.com/bigbio/py-pgatk/issues/8)
- overlap between config parameters in cgenomes between cosmic and cbioportal downloaders [\#17](https://github.com/bigbio/pgatk/issues/17)
- Major review to the code [\#8](https://github.com/bigbio/pgatk/issues/8)

**Fixed bugs:**

- Error from ENSEMBL URL retry [\#25](https://github.com/bigbio/py-pgatk/issues/25)
- Error from ENSEMBL URL retry [\#25](https://github.com/bigbio/pgatk/issues/25)

## [v0.0.8](https://github.com/bigbio/py-pgatk/tree/v0.0.8) (2021-01-19)
## [v0.0.8](https://github.com/bigbio/pgatk/tree/v0.0.8) (2021-01-19)

**Implemented enhancements:**

- Cosmic mutations for celllines [\#28](https://github.com/bigbio/py-pgatk/issues/28)
- Cosmic mutations for celllines [\#28](https://github.com/bigbio/pgatk/issues/28)

## [v0.0.7](https://github.com/bigbio/py-pgatk/tree/v0.0.7) (2021-01-11)
## [v0.0.7](https://github.com/bigbio/pgatk/tree/v0.0.7) (2021-01-11)

## [v0.0.6](https://github.com/bigbio/py-pgatk/tree/v0.0.6) (2020-05-02)
## [v0.0.6](https://github.com/bigbio/pgatk/tree/v0.0.6) (2020-05-02)

**Implemented enhancements:**

- Add the decoy Sanger tool to the library [\#19](https://github.com/bigbio/py-pgatk/issues/19)
- Major refactoring of the library to be more package oriented design [\#13](https://github.com/bigbio/py-pgatk/issues/13)
- Create / Download Canonical protein fasta file from ENSEMBL [\#1](https://github.com/bigbio/py-pgatk/issues/1)
- Add the decoy Sanger tool to the library [\#19](https://github.com/bigbio/pgatk/issues/19)
- Major refactoring of the library to be more package oriented design [\#13](https://github.com/bigbio/pgatk/issues/13)
- Create / Download Canonical protein fasta file from ENSEMBL [\#1](https://github.com/bigbio/pgatk/issues/1)



Expand Down
8 changes: 3 additions & 5 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
FROM biocontainers/biocontainers:debian-stretch-backports
MAINTAINER Yasset Perez-Riverol <ypriverol@gmail.com>
LABEL software="pypgatk" \
container="pypgatk" \
LABEL software="pgatk" \
container="pgatk" \
software.version="0.0.26" \
version="1"

Expand All @@ -13,11 +13,9 @@ WORKDIR /data
RUN mkdir -p /tool/source

RUN git config --global http.sslVerify false
RUN git clone --depth 1 https://github.com/bigbio/py-pgatk.git /tool/source
RUN git clone --depth 1 https://github.com/bigbio/pgatk.git /tool/source
Comment on lines 15 to +16
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major

Do not disable Git SSL verification during image build.

http.sslVerify false weakens transport security for source fetches and increases supply-chain risk.

🔒 Proposed fix
-RUN git config --global http.sslVerify false
 RUN git clone --depth 1 https://github.com/bigbio/pgatk.git /tool/source
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
RUN git config --global http.sslVerify false
RUN git clone --depth 1 https://github.com/bigbio/py-pgatk.git /tool/source
RUN git clone --depth 1 https://github.com/bigbio/pgatk.git /tool/source
RUN git clone --depth 1 https://github.com/bigbio/pgatk.git /tool/source
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@Dockerfile` around lines 15 - 16, Remove the insecure override by deleting
the RUN git config --global http.sslVerify false line and ensure the git clone
step (git clone --depth 1 https://github.com/bigbio/pgatk.git /tool/source) runs
with normal SSL verification; if the base image lacks CA certificates,
install/enable the distro CA bundle (e.g., ensure ca-certificates is present) or
switch to a verified transport (SSH key or explicitly set a trusted CA via
GIT_SSL_CAINFO) so the clone succeeds without disabling http.sslVerify.

WORKDIR /tool/source
RUN pip3 install --no-cache-dir -r requirements.txt && pip3 install --no-cache-dir -e .

ENV LC_ALL=C.UTF-8
ENV LANG=C.UTF-8
ENV PATH=$PATH:/tool/source/pypgatk/
RUN chmod +x /tool/source/pypgatk/pypgatkc.py
4 changes: 2 additions & 2 deletions MANIFEST.in
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
include *.md
include *.txt
include *.yaml
recursive-include pypgatk *.json
recursive-include pypgatk *.yaml
recursive-include pgatk *.json
recursive-include pgatk *.yaml
Loading
Loading