Global analysis of DNA methylation patterns in non-small cell carcinoma. (A) Correlation between experimental replicates. Each point represents the raw methylation signal (mean coverage in 10 bp bins). Density of points (log10 scale) is shown in different shades of blue. Pearson’s correlation coefficient is denoted in the top left corner of each scatter plot. (B) Correlation between tumor (x axis) and paired lung tissue sample (y axis). (C) Raw reads from a lung sample (above panel) and its matching tumor sample (middle panel) in the position chr2:176,716,200 to 176,738,910 viewed in the UCSC genome browser; normalized P-value is depicted in the bottom panel. Y-axis depicts the number of sequence reads in each region per sample. Dashed line represents the maximum number of reads (highest peak) in each sample; (D) Representation of the distribution of hypomethylated (green) and hypermethylated (red) regions across chromosomes in a tumor versus paired lung sample. (E) Composition of hypo- and hypermethylated regions in comparison to randomly sampled regions across the genome. Bars indicate the mean proportions of regions covered by each distinct genomic feature. Error bars denote standard error of the mean. Promoters were defined as regions ± 1 kb from all Ensembl transcription start site. Distinct classes of repeats were retrieved from the UCSC Table Browser (RepeatMasker table for hg18). Repeats were excluded from all subsequent features. Random regions were sampled across the genome independently seven times and each time their number was matched to the number of differentially methylated region in one sample pair. Statistical testing was done against random regions using one-tailed Student’s t-test (*P-≤0.01; **P-≤0.001; ***p≤0.0001).