Skip to content

A codebase to extract a specific set of genes from bacteria gene sequences and construct phylogenetic trees to analyse the evolution of bacteria using them.

Notifications You must be signed in to change notification settings

IsuruBoyagane15/phylo

Repository files navigation

CS4742 - Bioinformatics

Assignment - Phylogenetic Trees

Group - LabRats

protein_set = {

  • site-specific DNA-methyltransferase,
  • LysR family transcriptional regulator,
  • helix-turn-helix domain-containing protein,
  • efflux transporter outer membrane subunit
    }

STEP 1 - Get the set of bactria species which have all 4 proteins in protein_set

STEP 2 - Download the gene sequence of species in common_bacteria_set

STEP 3 - Extract gene sequence of each protein for each species and write them to homologous_gene_sequences

STEP 4 - Build trees

About

A codebase to extract a specific set of genes from bacteria gene sequences and construct phylogenetic trees to analyse the evolution of bacteria using them.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages