A consistent collection of characters is one for which a binary treeT can be constructed
whose splits match those taken from the split notations of the characters.
The split notation is a way for a character to be encoded into a split S1∣Sc1, where
S1 and Sc1 are the two disjointsets of taxa that the character separates.
On the other hand, removing an edgee from a binary tree results in the creation of two subtrees,
each containing a subset of the taxa, which also induces a split S2∣Sc2.
Thus, we can label e with the split S2∣Sc2, and so we have two ways of
generating a split: from a character and from an edge of a binary tree.
A collection of characters is called consistent if a binary tree T can be found
whose edge splits do not conflict with the splits inferred from characters.
Such a conflict will occur precisely when we have one split S1∣Sc1 inferred
from a character, another split S2∣Sc2 inferred from an edge of T, and
all four intersectionsS1∩S2, S1∩Sc2, Sc1∩S2,
Sc1∩Sc2 are nonempty.
For a simple example, consider the conflicting quartets{a,b}∣{c,d} and {a,c}∣{b,d},
which must correspond to two distinct trees on the four taxa a, b, c, and d.
Report a typo
Page:
Context:
Flag as inappropriate
Are you sure you want to flag this comment as inappropriate?
Welcome to Rosalind!
Rosalind is a platform for learning bioinformatics through problem solving.
Please login with Google/Twitter/Facebook or
register a new account.