Skip to content

lauramago/Improving_PDB_visualizations

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

269 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README


2022_group12_final_project


This it the GitHub repository for the final project of the R for Bio Data Science course at DTU. The project was done by:

  • Deeptha Sri - s210230
  • Eric Bautista - s212514
  • Jonathan Funk - s212697
  • Laura Machado - s212775

Introduction


The objective of our final project was to analyze the meta-data of the RCSB protein data bank

The data was analyzed using5 R tidyverse](https://www.tidyverse.org/). The project structure is inspired by the Josh Reich’s Load-clean-func-do-thought and this 2009 paper by William Stafford Noble.

Data


We combined data from different sources for our analysis, namely:

which were accessed on the 03/05/2022.

Data analysis pieline


The data was processed and analyzed based on a flowchart below:

![](doc/img/R_flowchart.jpg){width=75%}

Results


We recreated pie charts which are on the RCSB and visalized the data as bar plots. some of the categories which were chosen by creators of the plots were changed during this project based on our preferences:

{width=45%} {width=45%} {width=45%} {width=45%} {width=45%} {width=45%} {width=40%} {width=40%}

About

This it the GitHub repository for the project of Group 12

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%