- Poster presentation
- Open Access
Comparison of multiplexed reduced representation bisulfite sequencing (mRRBS) with the 450K Illumina Human BeadChip: from concordance to practical applications for methylomic profiling in epigenetic epidemiologic studies
Epigenetics & Chromatinvolume 6, Article number: P36 (2013)
Reduced representation bisulfite sequencing (RRBS) is an efficient approach for large-scale base-pair resolution DNA methylation analysis. RRBS utilizes Mspl digestion to enrich for CpG dinucleotides prior to sequencing. RRBS works with considerably lower amounts of DNA compared to the human 450K BeadChip Illumina microarray. Previous studies have compared these two methodologies with good concordance on a relatively small set of overlapping sites. Boyle and colleagues recently proposed a variant of RRBS which is gel-free and high-throughput, allowing for the simultaneous processing of multiple samples . Given the potential of mRRBS to characterize the methylome in larger epidemiology studies, there is a need to compare the performance of the multiplexed RRBS (mRRBS) with the 450K BeadChip, especially in terms of reproducibility of the results and genomic coverage.
Materials and methods
We have compared mRRBS with the 450K BeadChip using buffy coat genomic DNA extracted from 24 samples from 12 males in an existing cohort. Additionally, 6 of the 24 samples were replicated in the mRRBS study as analytic duplicates. A further 12 samples were sequenced again at a higher cluster density. Sequencing with 75bp single-end reads used both 6 or 12 sample pools on the Illumina HiSeq 2000. Post-processing included read trimming with Trim Galore, alignment using Bismark, and merging of informative reads from both strands for CpGs. Data from the 450k beadchip were normalized using a recent comprehensive pipeline .
Among 42 samples sequenced, 28 had more than 5M reads and after alignment of >71% of trimmed reads, these samples had a median of 1.3M CpGs at ≥10x depth (300K to 2.5M CpGs). Samples <5 million reads were related to particular Illumina sequencing adapters and position on the library preparation plate. Sequencing at a higher cluster density yielded ~300K extra CpGs at >10x depth. To represent a population, we took the best passing sample (>5M reads, n=11) from each individual. There were 160K shared sites among all 11 samples at ≥10x depth with 1M found in 8 of 11. Between 24K and 124K sites per sample overlapped with Illumina 450K sites. Pearson correlation coefficients between RRBS %methylation (≥10x depth) and quantile normalized 450K beta values for these 11 samples ranged from 0.92 to 0.95.
Given the observed differences in reads by library position and adapter ligation efficiency, a more even distribution of reads per sample may be achieved by screening adapters and concentration matching prior to pooling samples. These results support the use of mRRBS for methylomics in epigenetic epidemiologic studies, and further investigation of sample quality and measurement variability is ongoing.
Boyle P, Clement K, Gu H, Smith ZD, Ziller M, Fostel JL, Holmes L, Meldrim J, Kelley F, Gnirke A, Meissner A: Gel-free multiplexed reduced representation bisulfite sequencing for large-scale DNA methylation profiling. Genome Biol. 2012, 13: R92-10.1186/gb-2012-13-10-r92.
Touleimat N, Tost J: Complete pipeline for Infinium((R)) Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation. Epigenomics. 2012, 4: 325-341. 10.2217/epi.12.21.