Open access · OA
via OpenAlex
Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions
E. Andrés Houseman, Brock C. Christensen, Ru‐Fang Yeh, Carmen J. Marsit, Margaret R. Karagas, Margaret Wrensch, Heather H. Nelson, Joseph L. Wiemels, Shichun Zheng, John K. Wiencke, Karl T. Kelsey
BMC Bioinformatics · 2008 · ▲ 216 citations
Abstract
BACKGROUND: Epigenetics is the study of heritable changes in gene function that cannot be explained by changes in DNA sequence. One of the most commonly studied epigenetic alterations is cytosine methylation, which is a well recognized mechanism of epigenetic gene silencing and often occurs at tumor suppressor gene loci in human cancer. Arrays are now being used to study DNA methylation at a large number of loci; for example, the Illumina GoldenGate platform assesses DNA methylation at 1505 loci associated with over 800 cancer-related genes. Model-based cluster analysis is often used to identify DNA methylation subgroups in data, but it is unclear how to cluster DNA methylation data from arrays in a scalable and reliable manner. RESULTS: We propose a novel model-based recursive-partitioning algorithm to navigate clusters in a beta mixture model. We present simulations that show that the method is more reliable than competing nonparametric clustering approaches, and is at least as reliable as conventional mixture model methods. We also show that our proposed method is more computationally efficient than conventional mixture model approaches. We demonstrate our method on the normal tissue samples and show that the clusters are associated with tissue type as well as age. CONCLUSION: Our proposed recursively-partitioned mixture model is an effective and computationally efficient method for clustering DNA methylation data.
◌ CITATION ONLY
Full text is not openly licensed for redistribution here. Read it at the source:
Provenance
- Source
- OpenAlex
- DOI
- 10.1186/1471-2105-9-365
- Canonical
- link ↗
- Fetched
- 2026-06-03 MST
Cite this
APA
Houseman, E.A., Christensen, B.C., Yeh, R., Marsit, C.J., Karagas, M.R., Wrensch, M., Nelson, H.H., Wiemels, J.L., Zheng, S., Wiencke, J.K., & Kelsey, K.T. (2008). Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions. <em>BMC Bioinformatics</em>. https://doi.org/10.1186/1471-2105-9-365
Vancouver
Houseman EA, Christensen BC, Yeh R, Marsit CJ, Karagas MR, Wrensch M, et al. Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions. BMC Bioinformatics. 2008. doi:10.1186/1471-2105-9-365.
BibTeX
@article{e2008Modelb,
title = {Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions},
author = {E. Andrés Houseman and Brock C. Christensen and Ru‐Fang Yeh and Carmen J. Marsit and Margaret R. Karagas and Margaret Wrensch and Heather H. Nelson and Joseph L. Wiemels and Shichun Zheng and John K. Wiencke and Karl T. Kelsey},
journal = {BMC Bioinformatics},
year = {2008},
doi = {10.1186/1471-2105-9-365},
}
Research neighborhood
References, citing works, and semantically nearest findings. Click a node to open it.
Related findings
Bioinformatics 2012
Open access · OA
A new statistical approach to detecting differentially methylated loci for case control Illumina array methylation data
PLoS ONE 2008
Open access · CC-BY
Epigenotyping in Peripheral Blood Cell DNA and Breast Cancer Risk: A Proof of Principle Study
Journal of Nutrition 2002
Citation only
DNA Methylation and Atherosclerosis
BMC Bioinformatics 2012
Open access · CC-BY
DNA methylation arrays as surrogate measures of cell mixture distribution
Epigenetics 2011
Open access · OA
Infant growth restriction is associated with distinct patterns of DNA methylation in human placentas
Journal of the American Chemical Society 2014
Citation only