Tumor Classification with RNA-Seq Data Set

This dataset is from the UCI Machine Learning repository. This collection of data is part of the RNA-Seq (HiSeq) PANCAN data set, it is a random extraction of gene expressions of patients having different types of tumor: BRCA, KIRC, COAD, LUAD and PRAD.

The datast is divided in data.csv and labels.csv. We need both to correctly analyse the dataset.

Our goal is to preprocess the large dataset using 3 different dimensionality reduction models: PCA, TSNE, UMAP and then apply different classification models on the reduced data.

Dataset: https://archive.ics.uci.edu/dataset/401/gene+expression+cancer+rna+seq

Contributors: https://github.com/EmiljaB https://github.com/kleagjoshi https://github.com/sindiziu1

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Data_Mining		Data_Mining
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tumor Classification with RNA-Seq Data Set

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tumor Classification with RNA-Seq Data Set

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages