![]() |
![]() |
|||||
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| A forest of 14 billion trees | ||
|
|
||||
|
Example of a phylogenetic tree for the 13 species studied in the Campanulaceae project. The numbers along the lines are an estimate of evolutionary distance, expressed as the expected number of evolutionary events.
|
![]() |
|||
|
Each phylogenetic reconstruction, called a tree, represents one possible history of the species. The Campanulaceae project begins with 12 modern species of bluebell and a single species of tobacco. Tobacco is used as an outgroup, a species that is clearly very distant from the others, and is used to identify the root of the tree. To predict the evolutionary history, almost 14 billion trees must be built and compared to one another. Bader, Moret and their colleague Tandy Warnow of the computer science department at the University of Texas at Austin go far beyond constructing the underlying tree and its eventual outcome, also calculating gene order for each predicted ancestor within the trees. That means a whopping 100 billion genomes must be reconstructed.
|
||||
![]() |
![]() |
Diagram of conversion from breakpoint median to the travelling salesperson problem.
|
||
|
A breakpoint occurs any time two genes are adjacent in one genome but are not adjacent in a genome to which the first is compared. An internal node's label is derived by finding the gene order that minimizes the number of breakpoints between a node and its three closest neighbors. "This is where the parsimony criterion comes in," say Moret. "We find a label that minimizes the amount of change at this place in the tree." A travelling salesperson problem solver—a common, if expensive, mathematical method of solving optimization problems—is used to find the median, calculating the hypothesized gene order data for each node.
|
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |