Sequence Comparison Analysis

Sequence Comparison Analysis

Insthoviricetes

Pair Frequencies
1. ii (Identical Pairs): Base pairs in DNA that are the same when two sequences are compared. They indicate the degree of similarity or genetic relationship between the sequences.
2. si (Transitional Pairs): Substitutions between purines (A and G) or between pyrimidines (C and T) in DNA. These mutations are more common because of the chemical structure similarities between purines and between pyrimidines.
3. sv (Transversional Pairs): Substitutions between a purine and a pyrimidine (e.g., A to C or T, G to C or T) in DNA. These are less common due to the more significant chemical structure changes involved.
4. R (si/sv): The ratio of transitional to transversional pairs. This ratio helps in understanding the predominance of mutation types in DNA and can indicate evolutionary pressures or repair mechanisms.
Polymorphic Sites
1. Conserved Sites: DNA sequence positions that remain unchanged across different individuals or species, indicating evolutionary importance and functional significance.
2. Variable Site: Positions in DNA sequences where variations are observed when comparing different individuals or species, used to study genetic diversity.
3. Parsim-Info Sites: Sites that provide information for the least number of evolutionary changes, useful in phylogenetic construction for simplifying evolutionary assumptions.
4. Singleton Sites: Positions in a DNA sequence that show a mutation in only one individual within a population, potentially indicating recent mutation events.
5. CpG Sites: Specific DNA regions where a cytosine is followed by a guanine, often involved in gene regulation through methylation processes.
Genetic Distance
"Others" refers to all other genera within Insthoviricetes, excluding Alphainfluenzavirus, Betainfluenzavirus, Gammainfluenzavirus, and Deltainfluenzavirus.