Project for Jack Ringer as part of the CFDE 2025 Internship @UNM.
Although this project has been used to compare the computational similarities of ligands active within different protein kinase groups, the Snakemake workflow could be adapted to explore other protein families or to assist with other projects using ChEMBL data.
This work investigates whether there is a relationship between the 2D computational similarity of ligands and their activity within specific human protein kinase groups. Using data from ChEMBL binding assays, the distribution of pairwise Tanimoto similarity coefficients values computed between all protein kinase ligands was compared against the distribution of pairwise similarity values computed with respect to ligands active within a specific protein kinase group. Except for the CK1 group, no significant group-specific differences were found. These results suggest there is limited utility of 2D similarity metrics for identifying ligand selectivity across a majority of protein kinase groups. However, given the many confounders that exist when performing large scale computational analyses of ChEMBL bioassay data these results are not definitive, and limitations as well as potential follow-ups are discussed in detail.
For more information see the project report and poster.
- Setup PostgreSQL on your system by following these instructions
- Install Conda environment:
conda env create -f environment.yaml - Modify
config.yamlto your desired params - (Optional): Run
export PGPASSWORD=<your_password>- avoids password prompts during the DB build process - If you don't have it installed already, set up the ChEMBL PostgreSQL DB:
snakemake --snakefile Snakefile_DB_Setup --cores 1 - Run main Snakemake workflow:
snakemake --cores 1- If prompted for
Password:, enter the<password>of your SUPERUSER
- If prompted for
Output of snakemake --forceall --rulegraph | dot -Tpng > rulegraph.png:
Note: This workflow has been tested with PostgreSQL version psql (PostgreSQL) 16.9 (Ubuntu 16.9-0ubuntu0.24.04.1)
If you do not already have PostgreSQL installed, you can follow the instructions here.
After installing PostgreSQL, you need to make your user a superuser prior to DB setup:
- Switch to postgres user:
(base) <username>@<computer>:~$ sudo -u postgres psql
- Make yourself a superuser:
CREATE ROLE "<username>" WITH SUPERUSER PASSWORD '<password>' - Enable login:
ALTER ROLE "<username>" WITH LOGIN;

