A thesis project conducted in 2023/2024 at Pluskal lab, IOCB Prague. The project aimed to perform a bioinformatics analysis of characterized terpene synthases and mine putative terpene synthases from large-scale repositories.
The project is organized into three directories corresponding to different parts of the thesis:
01_tps_db_analysis- analysis of characterized terpene synthases02_mining- mining of putative terpene synthases03_mining_analysis- analysis of the mining results
Each directory contains a README.md with overview of the directory, and a Snakefile pipeline for file processing and creation.
The analysis directories also include notebooks directory containing various analysis notebooks.
conda and python packages that were used can be found in packages.txt.