Software that discovers and displays order of mutation patterns in Breast Cancer fragments
This software was created over 10 weeks in the Schatz Lab at Cold Spring Harbor Laboratory. It is a DNA analysis and visualization algorithm using Python and Graphviz software. In order to create this software, I created a new Graph class that implemented nodes with start and end “ports” which better represents DNA fragments. The program is able to reveal a clear and detailed representation of the mutation process of a convoluted area in the HER2+ breast cancer genome by showing the "jumps" or structural variations as colorful paths that the user can trace with a finger. Please see abstract.pdf and ALFORD.pdf for further details on project scope and impact.
My mentor for that summer, Maria Nattestad, has recently published an open-use, sophisticated DNA mapping software, SplitThreader (http://splitthreader.com/), which uses the same graph structure that we came up with, and ideas inspired by our discussions about my project.
-
Download GraphViz,
main.py, andgraph_template.dot -
Open the terminal and navigate to the file that contains
main.pyandgraph_template.dot -
Type into terminal:
python main.py <filename> -
A .png file with the filename specified will open revealing the DNA mutation mapping of the Her2 region.
You will also see a legend in the terminal describing the paths.
This indicates that path 5, for example, can be attained by tracing along the green lines from chromosome fragment Y to chromosome fragment B17 to chromosome fragment Z, and that the minimum copy number along that path is 121.

