Skip to content
deena-b edited this page Jun 27, 2019 · 4 revisions

Determine cell lineage relationships w/ machine learning

Aim: use machine learning to create a dendogram that shows mitochondria sequence similarity between cells

Data

Input: 1 fasta file per cell

Methods

Calculate a sequence similarity matrix

  • how?

Perform heirarchical clustering

Generate a cluster heatmap

  • seaborn (See Serge's ipynb, link above)

Google these terms or read these pages to find out more about methods we should consider

Download Practice Fasta Files

  • Flu genes are short and there are tons of them. For an example on how to download ~100 fasta files that are ~1,500 nucleotides (nt) long, see our tutorial "dwnld flu fa"

Clone this wiki locally