Identification of physicochemical selective pressure on protein encoding nucleotide sequences
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
Identification of physicochemical selective pressure on protein encoding nucleotide sequences. / Wong, Wendy S. W.; Sainudiin, Raazesh; Nielsen, Rasmus.
In: BMC Bioinformatics, Vol. 7, 148, 2006.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Identification of physicochemical selective pressure on protein encoding nucleotide sequences
AU - Wong, Wendy S. W.
AU - Sainudiin, Raazesh
AU - Nielsen, Rasmus
PY - 2006
Y1 - 2006
N2 - Background: Statistical methods for identifying positively selected sites in protein coding regions are one of the most commonly used tools in evolutionary bioinformatics. However, they have been limited by not taking the physiochemical properties of amino acids into account. Results: We develop a new codon-based likelihood model for detecting site-specific selection pressures acting on specific physicochemical properties. Nonsynonymous substitutions are divided into substitutions that differ with respect to the physicochemical properties of interest, and those that do not. The substitution rates of these two types of changes, relative to the synonymous substitution rate, are then described by two parameters, γ and ω respectively. The new model allows us to perform likelihood ratio tests for positive selection acting on specific physicochemical properties of interest. The new method is first used to analyze simulated data and is shown to have good power and accuracy in detecting physicochemical selective pressure. We then re-analyze data from the class-I alleles of the human Major Histocompatibility Complex (MHC) and from the abalone sperm lysine. Conclusion: Our new method allows a more flexible framework to identify selection pressure on particular physicochemical properties.
AB - Background: Statistical methods for identifying positively selected sites in protein coding regions are one of the most commonly used tools in evolutionary bioinformatics. However, they have been limited by not taking the physiochemical properties of amino acids into account. Results: We develop a new codon-based likelihood model for detecting site-specific selection pressures acting on specific physicochemical properties. Nonsynonymous substitutions are divided into substitutions that differ with respect to the physicochemical properties of interest, and those that do not. The substitution rates of these two types of changes, relative to the synonymous substitution rate, are then described by two parameters, γ and ω respectively. The new model allows us to perform likelihood ratio tests for positive selection acting on specific physicochemical properties of interest. The new method is first used to analyze simulated data and is shown to have good power and accuracy in detecting physicochemical selective pressure. We then re-analyze data from the class-I alleles of the human Major Histocompatibility Complex (MHC) and from the abalone sperm lysine. Conclusion: Our new method allows a more flexible framework to identify selection pressure on particular physicochemical properties.
U2 - 10.1186/1471-2105-7-148
DO - 10.1186/1471-2105-7-148
M3 - Journal article
C2 - 16542458
AN - SCOPUS:33645649132
VL - 7
JO - B M C Bioinformatics
JF - B M C Bioinformatics
SN - 1471-2105
M1 - 148
ER -
ID: 222644168