-
Notifications
You must be signed in to change notification settings - Fork 13
Major refactor of the database generation tool #90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Changes from all commits
Commits
Show all changes
535 commits
Select commit
Hold shift + click to select a range
b9dd3a3
Dataframe implementation of class FDR
ypriverol 81babe5
Dataframe implementation of class FDR
ypriverol 7ba71d0
Dataframe implementation of class FDR
ypriverol b11b032
Dataframe implementation of class FDR
ypriverol da375c8
Dataframe implementation of class FDR
ypriverol 5b40607
Dataframe implementation of class FDR
ypriverol 3c9c785
Dataframe implementation of class FDR
ypriverol ed930cb
Dataframe implementation of class FDR
ypriverol ef35c54
Dataframe implementation of class FDR
ypriverol 6d8f03a
Dataframe implementation of class FDR
ypriverol 8784a02
support for triqler output
ypriverol 3a7ba30
support for triqler output
ypriverol 465ae6e
support for triqler output
ypriverol c6db15b
support for triqler output
ypriverol d757c20
support for triqler output
ypriverol 3b0f340
support for triqler output
ypriverol b2693ce
explaining the Bayesian class FDR
ypriverol a300de2
explaining the Bayesian class FDR
ypriverol b45ccfd
Updated vcf-to-proteindb to parse using pandas and added support to f…
husensofteng 1f3a4fe
requirement for pandas version
husensofteng c1a672f
Merge pull request #57 from bigbio/dev
ypriverol 8676aa0
Merge branch 'master' of https://github.com/bigbio/py-pgatk into clas…
ypriverol c4265a1
merge new vcf reader in pandas
ypriverol abcbdc6
pythons github actions
ypriverol f9ad33c
pythons github actions
ypriverol c50d7b8
pythons github actions
ypriverol 90c2818
pythons github actions
ypriverol 59677cc
pythons github actions
ypriverol 25d3d93
pythons github actions
ypriverol 079f349
pythons github actions
ypriverol 6bd97cb
pythons github actions
ypriverol 4daddaa
pythons github actions
ypriverol d7c46a2
pythons github actions
ypriverol d936645
pythons github actions
ypriverol 6820d57
pythons github actions
ypriverol 631442b
pythons github actions
ypriverol c03cc2a
Remove bayesian class FDR.
ypriverol 98f3246
Merge pull request #54 from bigbio/class-fdr
husensofteng 1539e12
Update README.md
ypriverol 7de493a
Added flag to specify separator in transcript description
husensofteng 6f22e1a
Updated conda yaml to remove PyVCF from the requirements as it is no …
husensofteng a44d568
Updates version info for
ypriverol 7d5ddc9
fixed separator for CDSs and other features from gffread fasta
husensofteng 7f000f7
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into class-fdr
ypriverol 3331030
Remove bayesian class FDR.
ypriverol c59d309
Merge branch 'master' of https://github.com/bigbio/py-pgatk into clas…
ypriverol f41246b
change the config file
ypriverol f93006a
change the config file
ypriverol 86b2976
change the config file
ypriverol 620b09a
change the config file
ypriverol 92c1c16
files can be now dowload to a file
ypriverol 9e1ebf9
files can be now download to a file
ypriverol 84799f5
files can be now download to a file
ypriverol a447e4f
Merge pull request #60 from bigbio/class-fdr
ypriverol bea7717
MANIFEST included
ypriverol bc5e0b0
Merge branch 'master' of https://github.com/bigbio/py-pgatk into clas…
ypriverol fc6c3ed
first iteration of ms-rescore configuration generation.
ypriverol 23143c4
first iteration of ms-rescore configuration generation.
ypriverol 97ed3b6
Updates version info for
ypriverol cc36a7b
first iteration of ms-rescore configuration generation.
ypriverol cd59fa1
first iteration of ms-rescore configuration generation.
ypriverol 1f7324f
first iteration of ms-rescore configuration generation.
ypriverol 39f4712
first iteration of ms-rescore configuration generation.
ypriverol 7da51b3
first iteration of ms-rescore configuration generation.
ypriverol 38ac1cd
first iteration of ms-rescore configuration generation.
ypriverol c85b9d7
first iteration of ms-rescore configuration generation.
ypriverol db5aa23
Merge pull request #62 from bigbio/class-fdr
ypriverol 0956fc0
Merge remote-tracking branch 'origin/dev' into dev
ypriverol d233674
Updates version info for
ypriverol 99d6eeb
Merge pull request #63 from bigbio/dev
ypriverol 9bd1779
major changes dealing with configs files
ypriverol 7edbd4c
major changes dealing with configs files
ypriverol 621fee1
example of two psms for spectrumAI
husensofteng 41b6d12
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into dev
husensofteng 2be5281
version command added
ypriverol 16aab88
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into dev
ypriverol 042875d
default parameters captured.
ypriverol 667edf8
changes in parameters handling and bug fixing
ypriverol 88c57e9
fixing bug with thershold
ypriverol ac34362
extending tests
ypriverol 8393f19
extending tests
ypriverol 5baf3f8
fixed error in the ensembl.py
ypriverol 7fba76a
small refactoring of the file structures
ypriverol ab5dfd4
bug fixed in ensembl default values
ypriverol 3b80353
move header from var -> var_
ypriverol ee18274
fix error with accepted values in mutations.
ypriverol 579d28a
fix errors with release- prefix and validation of taxonomies
ypriverol 0bdf661
fix errors with release- prefix and validation of taxonomies
ypriverol 443f14f
updated ReadMe adding the new commands
husensofteng bfb6e84
Merge pull request #64 from bigbio/dev
husensofteng 611dd89
fixed bug in gtf file name, just keep version not release string
husensofteng 32f8b66
Merge pull request #69 from bigbio/master
ypriverol 2390235
Updates version info for
ypriverol 2b4c34b
var_prefix -> protein_prefix
ypriverol 3cb4c9f
Merge pull request #68 from bigbio/dev
husensofteng 317631a
add spectrumAI
DongdongdongW 2c316fd
fix and add test files
DongdongdongW cc5816d
ci test
DongdongdongW fcf48ca
fix matplotlib version
DongdongdongW cce6855
fix pyopenms version
DongdongdongW e689c49
fix
DongdongdongW 73ecf19
Revert "fix"
DongdongdongW b1b1d4c
Update pythonpublish.yml
ypriverol c1029d8
update
DongdongdongW 06a044a
Update requirements.txt
DongdongdongW ba78bdb
Create __init__.py
DongdongdongW fee9a00
Update setup.py
DongdongdongW 0d9b9e3
add mztab class-fdr
DongdongdongW 6d65d0c
Update mzTab_class_fdr.py
DongdongdongW 1101a1c
update test
DongdongdongW 6654dac
fix Codacy Static Code Analysis
DongdongdongW c6775f9
fix
DongdongdongW 8d018e1
fix
DongdongdongW d3cf2a0
fix
DongdongdongW 07fbebb
add blast
DongdongdongW d3d45e8
update biopython version
DongdongdongW af47ce2
add test file
DongdongdongW b0e8d22
Update pypgatk_tests.py
DongdongdongW f6d269d
Merge pull request #70 from DongdongdongW/spectrumAI
ypriverol a0d35b0
Fix the bug of class-fdr, adjust the blast to multi-process running t…
DongdongdongW 99342ef
update pathos version
DongdongdongW bdb0d9d
update test command
DongdongdongW c334b18
Update pythonpackage.yml
DongdongdongW 687bfab
Update pythonapp.yml
DongdongdongW dbac310
Update mzTab_class_fdr.py
DongdongdongW f333df6
Merge pull request #71 from DongdongdongW/spectrumAI
ypriverol 77b5429
small changes in the name of the classes.
ypriverol 5907cd6
minor changes in the naming
ypriverol 9058a9c
minor refinements in the code.
ypriverol 02bd95a
fixes issue #72
0f95bc4
Merge pull request #73 from bigbio/issue72
ypriverol 228ddb8
update blast
DongdongdongW 0154dbb
update spectrumAI
DongdongdongW 3dd6869
Merge branch 'spectrumAI' of https://github.com/DongdongdongW/py-pgat…
DongdongdongW 92384f3
update info
DongdongdongW 1a8dda3
update
DongdongdongW 3fcf781
update
DongdongdongW eead330
removing versions of dependencies
ypriverol 6108524
changes on cbioportal download system.
ypriverol b7dfe7d
Merge branch 'spectrumAI' of https://github.com/DongdongdongW/py-pgat…
ypriverol cf00173
Merge branch 'master' of https://github.com/bigbio/py-pgatk into spec…
ypriverol b32f630
remove unnecesary code
ypriverol 3792ad2
remove unnecesary code
ypriverol c72ff20
skip to tests not working with pytests pooling.
ypriverol b3ec7d9
skip to tests not working with pytests pooling.
ypriverol b07d14c
skip to tests not working with pytests pooling.
ypriverol 98bee47
skip to tests not working with pytests pooling.
ypriverol 3596398
skip to tests not working with pytests pooling.
ypriverol e9ebe47
skip to tests not working with pytests pooling.
ypriverol 54db12e
skip to tests not working with pytests pooling.
ypriverol 8eea6dc
skip to tests not working with pytests pooling.
ypriverol 8cad583
skip to tests not working with pytests pooling.
ypriverol 95c3354
minor changes in code.
ypriverol d5896bd
minor changes in code.
ypriverol 4584a04
minor changes in code.
ypriverol f0023f2
minor changes in code.
ypriverol ec4c896
minor changes in code.
ypriverol 3cef9ea
Merge pull request #77 from bigbio/spectrumAI
ypriverol c0ae4e4
small changes in blast -> USI accessions
ypriverol 0df4a00
small changes in blast -> USI accessions
ypriverol 279a94b
small changes in blast -> USI accessions
ypriverol 9750bd4
Updates version info for
ypriverol adaa875
Merge branch 'master' of https://github.com/bigbio/py-pgatk into spec…
ypriverol 01026c0
update new blast and spectrumAI
DongdongdongW 8d673eb
Merge pull request #79 from DongdongdongW/spectrumAI
ypriverol fbf8a30
validate_peptides.py -> spectrumai.py
ypriverol da0f0fc
validate_peptides.py -> spectrumai.py
ypriverol efffc6d
validate_peptides.py -> spectrumai.py
ypriverol cfb7d02
Merge pull request #80 from bigbio/spectrumAI
ypriverol 789dc87
disable cbioportal download test
ypriverol d4df92e
disable cbioportal download test
ypriverol a367755
disable cbioportal download test
ypriverol b98d59b
disable cbioportal download test
ypriverol b2951e8
Merge branch 'master' of https://github.com/bigbio/py-pgatk into spec…
ypriverol 2a1b0d2
Merge remote-tracking branch 'origin/dev' into dev
ypriverol 5339047
disable cbioportal download test
ypriverol 4d7765c
Merge pull request #82 from bigbio/dev
ypriverol 4cb65af
disable cbioportal download test
ypriverol 89d4816
Merge pull request #83 from bigbio/dev
ypriverol 5c91508
minor change in gitignore
ypriverol e737f0f
major changes to structure
ypriverol e9c28f9
minor changes
ypriverol 6cdce09
support for major versions
ypriverol 534a3e9
minor changes
ypriverol 6b7a5f2
minor changes
ypriverol 66fe927
minor changes
ypriverol e8e3311
minor chhanges for code review
ypriverol 07ffbd2
minor changes
ypriverol ec32831
pinned versions
ypriverol 7c42e06
minor changes
ypriverol 96ac6cc
minonr changes
ypriverol 27ced62
Minro changes
ypriverol 7157769
Remove large Ensembl reference files from tracking
ypriverol e4f4586
Updates version info for
ypriverol 6b7063f
refactor: rename pypgatk to pgatk
ypriverol daf560c
ci: exclude docs stylesheets from Codacy analysis
ypriverol 46cc5fe
refactor: remove unused mzTab class FDR feature
ypriverol b6af1dc
refactor: add config registry for auto-loading bundled defaults
ypriverol bcf5e43
fix: resolve multiple bugs across codebase
ypriverol 656b086
refactor: remove dead code and improve logging
ypriverol bcae956
refactor: remove OpenMS FDR computation and peptide-class-fdr command
ypriverol 008e471
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into refacto…
ypriverol 0a2c376
merge: integrate master with PRs #86, #87, #88 into refactor/rename
ypriverol 29cd2f9
chore: remove large Ensembl test data files from tracking
ypriverol d54c0a4
fix: replace deprecated Bio.pairwise2 with sliding window in blast
ypriverol ed534eb
refactor: rename pypgatk_tests.py to pgatk_tests.py
ypriverol 9840614
fix(#67): verify downloaded files exist and are non-empty in tests
ypriverol 4bd8e43
style: remove unnecessary lambda in blast_get_position
ypriverol 576f190
fix: resolve multiple bugs from code review
ypriverol cf17786
docs: add graph engine design and Phase 0 implementation plan
ypriverol a97c42c
test: add conftest.py with testdata/config path fixtures
ypriverol b141af8
fix: rename shadowing variable decoy_sequence in print_target_decoy_c…
ypriverol c1fd195
refactor: convert SNP to dataclass, rename .type to .mutation_type
ypriverol b2a06df
refactor: replace pathos with concurrent.futures in blast_get_position
ypriverol ae5272a
refactor: replace pathos with concurrent.futures in spectrumai, remov…
ypriverol 4ee6268
refactor: replace print() with logger across all production modules
ypriverol f7b961b
refactor: replace broad except Exception with specific exception types
ypriverol 290c382
refactor: add get_config_value helper, DRY up config fallback pattern
ypriverol f5d952c
refactor: convert digest_mutant_protein from standalone script to Cli…
ypriverol 479f210
refactor: convert map_peptide2genome from standalone script to Click …
ypriverol 4ce76c2
refactor: use pathlib.Path for file path operations
ypriverol 38e0a93
refactor: add type hints to all public API methods
ypriverol 37ff86b
test: add unit tests for get_altseq, get_mut_pro_seq, and revswitch
ypriverol fdde235
fix: resolve Codacy static analysis warnings
ypriverol 5848290
fix: prefix unused tuple variables with underscore in tests
ypriverol 234194b
ci: exclude test files from Codacy static analysis
ypriverol db034fc
Minor changes
ypriverol e84d16d
Minor change
ypriverol e20f8c8
Merge pull request #89 from bigbio/refactor/rename
ypriverol 61020a2
docs: add ClinVar/NCBI RefSeq support design document
ypriverol 3242516
docs: add ClinVar/NCBI implementation plan (7 tasks, TDD)
ypriverol 8cf5040
refactor: extract shared VCF utility functions to toolbox/vcf_utils.py
ypriverol be2195e
feat: add ChromosomeMapper for NC_/numeric/UCSC chromosome name conve…
ypriverol 136d70a
feat: add ClinVar config with CLNSIG filtering and NCBI FTP URLs
ypriverol 343619e
feat: add ClinVarService pipeline for VCF-to-protein generation
ypriverol 366349b
fix: handle multi-allelic ALT and add CLNSIG metadata to FASTA headers
ypriverol b9416e5
feat: add NcbiDataDownloader for RefSeq/ClinVar file downloads
ypriverol 3e5e4eb
feat: add clinvar-to-proteindb and ncbi-downloader CLI commands
ypriverol 409d2b7
test: add end-to-end CLI integration tests for ClinVar pipeline
ypriverol e23deca
docs: add ClinVar pipeline hardening implementation plan
ypriverol f8860ea
feat(clinvar): activate MC consequence filtering from config
ypriverol 989c463
refactor(clinvar): single-pass VCF reading — build overlap map from D…
ypriverol 7debf23
feat(clinvar): add biotype filtering from GTF gene_biotype attribute
ypriverol 335cbeb
fix(clinvar): add duplicate variant-transcript guard, clean up _read_vcf
ypriverol 3d4eb7c
test(clinvar): add integration tests for MC filtering and output vali…
ypriverol 0b07b70
minor changes
ypriverol 6db35d9
Merge branch 'dev' of https://github.com/bigbio/py-pgatk into refacto…
ypriverol 90c5c22
clean scripts
ypriverol 9daa268
minor changes
ypriverol d9c00f5
Merge pull request #91 from bigbio/refactor/rename
ypriverol File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,5 @@ | ||
| --- | ||
| exclude_paths: | ||
| - "docs/stylesheets/**" | ||
| - "docs/plans/**" | ||
| - "pgatk/tests/**" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,43 @@ | ||
| name: Deploy Documentation | ||
|
|
||
| on: | ||
| push: | ||
| branches: ["master"] | ||
| workflow_dispatch: | ||
|
|
||
| permissions: | ||
| contents: write | ||
|
|
||
| jobs: | ||
| deploy-docs: | ||
| runs-on: ubuntu-latest | ||
| steps: | ||
| - uses: actions/checkout@v4 | ||
| with: | ||
| fetch-depth: 0 | ||
|
|
||
| - name: Set up Python 3.10 | ||
| uses: actions/setup-python@v5 | ||
| with: | ||
| python-version: "3.10" | ||
|
|
||
| - name: Cache pip dependencies | ||
| uses: actions/cache@v4 | ||
| with: | ||
| path: ~/.cache/pip | ||
| key: ${{ runner.os }}-pip-mkdocs | ||
| restore-keys: | | ||
| ${{ runner.os }}-pip- | ||
|
|
||
| - name: Install MkDocs and dependencies | ||
| run: | | ||
| python -m pip install --upgrade pip | ||
| python -m pip install mkdocs-material | ||
|
|
||
| - name: Build MkDocs documentation | ||
| run: | | ||
| mkdocs build --strict | ||
|
|
||
| - name: Deploy to GitHub Pages | ||
| run: | | ||
| mkdocs gh-deploy --force --message "docs: deploy documentation [skip ci]" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,5 +1,5 @@ | ||
| include *.md | ||
| include *.txt | ||
| include *.yaml | ||
| recursive-include pypgatk *.json | ||
| recursive-include pypgatk *.yaml | ||
| recursive-include pgatk *.json | ||
| recursive-include pgatk *.yaml |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do not disable Git SSL verification during image build.
http.sslVerify falseweakens transport security for source fetches and increases supply-chain risk.🔒 Proposed fix
-RUN git config --global http.sslVerify false RUN git clone --depth 1 https://github.com/bigbio/pgatk.git /tool/source📝 Committable suggestion
🤖 Prompt for AI Agents