Monday, November 5, 2012

Tree Distance Heatmaps

This is an attempt to find a “goodness” measure for phylogenetic trees generated by programs such as Ninja (http://nimbletwist.com/software/ninja/index.html) or RAxML (http://www.phylo.org/news/RAxML). We calculate distance, which we refer as “Tree Distance”,  for each pair of sequences based on the structure of the tree. We then compare this against original pairwise distances computed for sequences based on either local alignment or multiple sequence alignment. Definition for tree distance is not unique and we currently use the “Edge Sum” definition given below. Edge Count is possibly another definition to test, though we have not tested it yet.
  • Edge Sum
    • Given two sequence A and B, we find the shortest path from A to B in the tree and sum up values on edges along the path.
  • Edge Count
    • Given two sequence A and B, we find the shortest path from A to B and count the number of edges in it.
We have performed this analysis for two datasets.

Heatmaps for Fungi 200

3D DA-SMACOF of SWG PID Vs SWG PID
 
3.)whole-plot-DASMACOF-SWG-PID-Vs-SWG-PIDDensitySat[0.85]-large  
   
   
   
Edge Sum Ninja from SWG PID Vs 3D DA-SMACOF of SWG PID
Edge Sum Ninja from SWG PID Vs SWG PID
1.)whole-plot-Ninja-Edgesum-Vs-DASMACOF-SWG-PIDDensitySat[0.85]-large 2.)whole-plot-Ninja-Edgesum-Vs-SWG-PIDDensitySat[0.85]-large

Heatmaps of Fungi 2133

3D DA-SMACOF of SWG PID Vs SWG PID
 
whole-plot-DASMACOF-SWG-PID-Vs-SWG-PIDDensitySat[0.85]-large  
   
   
   
Edge Sum Ninja from 3D DA-SMACOF of SWG PID Vs 3D DA-SMACOF of SWG PID
Edge Sum Ninja from 3D DA-SMACOF of SWG PID Vs SWG PID
whole-plot-Ninja-Edgesum-Vs-DASMACOF-SWG-PIDDensitySat[0.85]-large whole-plot-Ninja-Edgesum-Vs-SWG-PIDDensitySat[0.85]-large
   
   
   
Edge Sum RAxML(20 iterations) from ClustlO MSA Vs SWG PID
Edge Sum RAxML (20 iterations) from ClustlO MSA (distances from 0 to 0.15) Vs SWG PID
whole-plot-Raxml(20)ClustlO-Edgesum-Vs-SWG-PIDDensitySat[0.85]-large-1 whole-plot-Raxml(20)ClustlO-Edgesum-Vs-SWG-PIDDensitySat[0.85]-large-2
   
   
   
Edge SumRAxML(20 iterations) from ClustlO MSA Vs Strict PIDNG
Edge Sum RAxML (20 iterations) from ClustlO MSA (distances from 0 to 0.15) Vs Strict PIDNG
whole-plot-Raxml(20)ClustlO-Edgesum-Vs-Strict-PIDNGDensitySat[0.85]-large-1 whole-plot-Raxml(20)ClustlO-Edgesum-Vs-Strict-PIDNGDensitySat[0.85]-large-2

No comments:

Post a Comment