Detecting coevolving amino acid sites using Bayesian mutational mapping

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Detecting coevolving amino acid sites using Bayesian mutational mapping. / Dimmic, Matthew W.; Hubisz, Melissa J.; Bustamente, Carlos D.; Nielsen, Rasmus.

In: Bioinformatics, Vol. 21, No. 1, 2005, p. i126-i135.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Dimmic, MW, Hubisz, MJ, Bustamente, CD & Nielsen, R 2005, 'Detecting coevolving amino acid sites using Bayesian mutational mapping', Bioinformatics, vol. 21, no. 1, pp. i126-i135. https://doi.org/10.1093/bioinformatics/bti1032

APA

Dimmic, M. W., Hubisz, M. J., Bustamente, C. D., & Nielsen, R. (2005). Detecting coevolving amino acid sites using Bayesian mutational mapping. Bioinformatics, 21(1), i126-i135. https://doi.org/10.1093/bioinformatics/bti1032

Vancouver

Dimmic MW, Hubisz MJ, Bustamente CD, Nielsen R. Detecting coevolving amino acid sites using Bayesian mutational mapping. Bioinformatics. 2005;21(1):i126-i135. https://doi.org/10.1093/bioinformatics/bti1032

Author

Dimmic, Matthew W. ; Hubisz, Melissa J. ; Bustamente, Carlos D. ; Nielsen, Rasmus. / Detecting coevolving amino acid sites using Bayesian mutational mapping. In: Bioinformatics. 2005 ; Vol. 21, No. 1. pp. i126-i135.

Bibtex

@article{23c4b3e074c311dbbee902004c4f4f50,
title = "Detecting coevolving amino acid sites using Bayesian mutational mapping",
abstract = "Motivation: The evolution of protein sequences is constrained by complex interactions between amino acid residues. Because harmful substitutions may be compensated for by other substitutions at neighboring sites, residues can coevolve. We describe a Bayesian phylogenetic approach to the detection of coevolving residues in protein families. This method, Bayesian mutational mapping (BMM), assigns mutations to the branches of the evolutionary tree stochastically, and then test statistics are calculated to determine whether a coevolutionary signal exists in the mapping. Posterior predictive P-values provide an estimate of significance, and specificity is maintained by integrating over uncertainty in the estimation of the tree topology, branch lengths and substitution rates. A coevolutionary Markov model for codon substitution is also described, and this model is used as the basis of several test statistics. Results: Results on simulated coevolutionary data indicate that the BMM method can successfully detect nearly all coevolving sites when the model has been correctly specified, and that non-parametric statistics such as mutual information are generally less powerful than parametric statistics. On a dataset of eukaryotic proteins from the phosphoglycerate kinase (PGK) family, interdomain site contacts yield a significantly greater coevolutionary signal than interdomain non-contacts, an indication that the method provides information about interacting sites. Failure to account for the heterogeneity in rates across sites in PGK resulted in a less discriminating test, yielding a marked increase in the number of reported positives at both contact and non-contact sites. ",
author = "Dimmic, {Matthew W.} and Hubisz, {Melissa J.} and Bustamente, {Carlos D.} and Rasmus Nielsen",
year = "2005",
doi = "10.1093/bioinformatics/bti1032",
language = "English",
volume = "21",
pages = "i126--i135",
journal = "Computer Applications in the Biosciences",
issn = "1471-2105",
publisher = "Oxford University Press",
number = "1",

}

RIS

TY - JOUR

T1 - Detecting coevolving amino acid sites using Bayesian mutational mapping

AU - Dimmic, Matthew W.

AU - Hubisz, Melissa J.

AU - Bustamente, Carlos D.

AU - Nielsen, Rasmus

PY - 2005

Y1 - 2005

N2 - Motivation: The evolution of protein sequences is constrained by complex interactions between amino acid residues. Because harmful substitutions may be compensated for by other substitutions at neighboring sites, residues can coevolve. We describe a Bayesian phylogenetic approach to the detection of coevolving residues in protein families. This method, Bayesian mutational mapping (BMM), assigns mutations to the branches of the evolutionary tree stochastically, and then test statistics are calculated to determine whether a coevolutionary signal exists in the mapping. Posterior predictive P-values provide an estimate of significance, and specificity is maintained by integrating over uncertainty in the estimation of the tree topology, branch lengths and substitution rates. A coevolutionary Markov model for codon substitution is also described, and this model is used as the basis of several test statistics. Results: Results on simulated coevolutionary data indicate that the BMM method can successfully detect nearly all coevolving sites when the model has been correctly specified, and that non-parametric statistics such as mutual information are generally less powerful than parametric statistics. On a dataset of eukaryotic proteins from the phosphoglycerate kinase (PGK) family, interdomain site contacts yield a significantly greater coevolutionary signal than interdomain non-contacts, an indication that the method provides information about interacting sites. Failure to account for the heterogeneity in rates across sites in PGK resulted in a less discriminating test, yielding a marked increase in the number of reported positives at both contact and non-contact sites.

AB - Motivation: The evolution of protein sequences is constrained by complex interactions between amino acid residues. Because harmful substitutions may be compensated for by other substitutions at neighboring sites, residues can coevolve. We describe a Bayesian phylogenetic approach to the detection of coevolving residues in protein families. This method, Bayesian mutational mapping (BMM), assigns mutations to the branches of the evolutionary tree stochastically, and then test statistics are calculated to determine whether a coevolutionary signal exists in the mapping. Posterior predictive P-values provide an estimate of significance, and specificity is maintained by integrating over uncertainty in the estimation of the tree topology, branch lengths and substitution rates. A coevolutionary Markov model for codon substitution is also described, and this model is used as the basis of several test statistics. Results: Results on simulated coevolutionary data indicate that the BMM method can successfully detect nearly all coevolving sites when the model has been correctly specified, and that non-parametric statistics such as mutual information are generally less powerful than parametric statistics. On a dataset of eukaryotic proteins from the phosphoglycerate kinase (PGK) family, interdomain site contacts yield a significantly greater coevolutionary signal than interdomain non-contacts, an indication that the method provides information about interacting sites. Failure to account for the heterogeneity in rates across sites in PGK resulted in a less discriminating test, yielding a marked increase in the number of reported positives at both contact and non-contact sites.

U2 - 10.1093/bioinformatics/bti1032

DO - 10.1093/bioinformatics/bti1032

M3 - Journal article

C2 - 15961449

VL - 21

SP - i126-i135

JO - Computer Applications in the Biosciences

JF - Computer Applications in the Biosciences

SN - 1471-2105

IS - 1

ER -

ID: 87235