Skip to main content

Ethnicity-specific epigenetic variation in naïve CD4+ T cells and the susceptibility to autoimmunity



Genetic and epigenetic variability contributes to the susceptibility and pathogenesis of autoimmune diseases. T cells play an important role in several autoimmune conditions, including lupus, which is more common and more severe in people of African descent. To investigate inherent epigenetic differences in T cells between ethnicities, we characterized genome-wide DNA methylation patterns in naïve CD4+ T cells in healthy African-Americans and European-Americans, and then confirmed our findings in lupus patients.


Impressive ethnicity-specific clustering of DNA methylation profiling in naïve CD4+ T cells was revealed. Hypomethylated loci in healthy African-Americans were significantly enriched in pro-apoptotic and pro-inflammatory genes. We also found hypomethylated genes in African-Americans to be disproportionately related to autoimmune diseases including lupus. We then confirmed that these genes, such as IL32, CD226, CDKN1A, and PTPRN2 were similarly hypomethylated in lupus patients of African-American compared to European-American descent. Using patch DNA methylation and luciferase reporter constructs, we showed that methylation of the IL32 promoter region reduces gene expression in vitro. Importantly, bisulfite DNA sequencing demonstrated that cis-acting genetic variants within and directly disrupting CpG sites account for some ethnicity-specific variability in DNA methylation.


Ethnicity-specific inherited epigenetic susceptibility loci in CD4+ T cells provide clues to explain differences in the susceptibility to autoimmunity and possibly other T cell-related diseases between populations.


DNA methylation is part of the epigenetic regulatory system in the cell and serves as an intermediary between an individual’s environment and their genetic background. One classic example is monozygotic twins; they share a nearly identical genotype, but display increased differences in DNA methylation and histone acetylation as they age [1]. Differences in DNA methylation also contribute to the phenotypic variation present between individuals and ethnicities. Epigenetic differences between ethnicities are an important consideration when investigating autoimmune disorders with a pattern of heterogeneous risk. Systemic lupus erythematosus (SLE, or lupus) and scleroderma both show a preponderance of risk and younger age of onset among African-Americans compared to European-Americans, while type-1 diabetes and multiple sclerosis have higher incidence rates among European-Americans [2]. Ethnicity can also be associated with disease severity and outcome. In a survey of SLE patients in southeastern Michigan, there was a 2.2-fold increase in renal involvement and 3.4-fold increase in end-stage renal disease in African-Americans compared to European-Americans [3].

Previous studies of the DNA methylation patterns in different ethnicities revealed that peripheral blood cells from non-Hispanic African-Americans display a global decrease in methylation of the retrotransposon LINE-1 compared to European-Americans [4]. DNA methylation differences seen in individuals of African-American, European-American, and Han Chinese ancestry were associated with genes related to xenobiotic metabolism and immune system function, among others [5]. These examples of epigenetic variation between ethnicities may provide answers as to what contribution inherent epigenetic differences make to the development and progression of autoimmune diseases.

DNA methylation changes in CD4+ T cells play an important role in the pathogenesis and clinical presentation of SLE, and these changes can be identified in naïve CD4+ T cells prior to T cell activation [6, 7]. Because SLE is more common and more severe in African-Americans compared to European-Americans, we hypothesized that inherent T cell DNA methylation differences might exist in African-Americans compared to European-Americans. To address this hypothesis, we evaluated the DNA methylome of naïve CD4+ T cells in healthy African-American and European-American populations, and then confirmed our findings in SLE patients.


DNA methylation differences in naïve CD4+ T cells between African-Americans and European-Americans

The Illumina Infinium HumanMethylation450 microarray interrogates the DNA methylation status of a total 485,512 probes. Differential methylation between African-American and European-American individuals was performed on 425,161 probes that remained after initial filtering. There was a clear distinction in the naïve CD4+ T cell DNA methylation profile between African-Americans and European-Americans as demonstrated in a multidimensional scaling plot of the 1000 most variable array probes after batch effect adjustment (Fig. 1). DNA methylation profile clustering can distinguish between the two ethnicities, with the exception of one European-American individual with a methylation profile that clustered with our African-American samples.

Fig. 1

Multidimensional scaling plots of the Euclidean distances calculated using the 1000 most variable CpG sites after batch effect adjustment with ComBat and color coded by ethnicity (a). b Shows a kernel density estimate distribution of all β values before (red line) and after (blue line) batch effect adjustment. Density is a unit-less value of the probability distribution of probes within a certain range of β values

Applying a threshold of |Δβ| > 0.1 and FDR-adjusted P value ≤0.05, we found 306 hypomethylated CpG sites associated with 144 genes and 375 hypermethylated CpG sites associated with 164 genes in naïve CD4+ T cells isolated from African-Americans compared to European -Americans (Additional file 1: Table S1). The most hypomethylated gene was NRGN (Δβ = −0.41; P = 1.82E−05) (Table 1). This gene encodes the protein neurogranin (Ng), which is a regulator of the calcium-binding calmodulin protein [8]. Other hypomethylated genes in African-Americans included GSTM1, IL32, and CD226.

Table 1 Top 30 most hypomethylated CpG sites in naïve CD4+ T cells from healthy African-Americans compared to European Americans

The most hypermethylated CpG site in African-Americans (Δβ = 0.39; P = 1.99E−08) was not associated with a gene. Other hypermethylated loci were associated with CCS (Δβ = 0.36; P = 4.78E−05) and DUSP22 (Δβ = 0.25; P = 3.16E−04). CCS encodes a copper chaperone for super oxide dismutase that delivers copper ions to copper-dependent enzymes [9]. DUSP22 encodes a protein tyrosine phosphatase [10].

Gene ontologies associated with apoptosis are overrepresented in hypomethylated CpG sites in African-Americans

Using the functional annotation database DAVID, we explored biological processes and cellular component gene ontologies in genes associated with hypomethylated CpG sites in healthy African-Americans. We found genes related to regulating and promoting cellular apoptosis to be enriched among hypomethylated genes with the most significant being “positive regulation of apoptosis” (GO:0043065, FDR = 8.4 %) (Table 2). Hypermethylated CpG sites in African-American naïve CD4+ T cells were found to be enriched for gene ontologies related to the regulation of muscle adaptation and contraction (GO:0043502, FDR = 4.9 %; GO:0006936, FDR = 5.2 %), and did not appear to be related to naïve CD4+ T cell activity or growth.

Table 2 Biological process and cellular component gene ontologies represented in hypomethylated CpG sites in naïve CD4+ T cells from healthy African-Americans

Hypomethylated genes in African-Americans are enriched in genes related to autoimmunity and are hypomethylated in lupus patients

As pro-apoptotic gene ontologies were associated with hypomethylated genes in African-Americans, and because autoimmune diseases such as lupus tend to be more common and more severe in African-Americans, we investigated if hypomethylated genes in naïve CD4+ T cells in African-Americans are enriched for autoimmunity-related genes. Using the literature mining software IRIDESCENT [1113], which scans all published MEDLINE abstracts for co-occurring terms (and their synonyms), we identified how many of these 144 genes had been published with the terms “autoimmune diseases” and “SLE”. We found 27/144 hypomethylated genes had been associated with “autoimmune diseases” (P < 0.005), and 18/144 associated with “SLE” (P < 0.037) within MEDLINE abstracts (Fig. 2). Each gene needed to be co-mentioned a minimum of three times with each query term to be considered “associated”. To validate that the CpG sites within the 18 lupus-related genes that are hypomethylated in healthy African-Americans compared to European-Americans are also differentially methylated between the two ethnicities in lupus, we compared DNA methylation levels across these sites between African-American and European-American lupus patients. We demonstrate that the majority of these CpG sites remain differentially methylated between the two ethnicities during disease, including multiple CpG sites in the cytokine gene IL32 promoter region (Table 3).

Fig. 2

Hypomethylated genes in African-Americans that are connected in the literature to autoimmune diseases (a) and SLE (b). Thickness of the lines is proportional to how many publications mention each of the connections depicted, and color and size of the nodes are proportional to how many connections each node has in the network (red = most, green = fewest)

Table 3 Differential DNA methylation between African-American and European-American lupus patients in CpG sites identified to be differentially hypomethylated in healthy African-Americans and located in genes connected to lupus using IRIDESCENT literature mining analysis

Gene co-regulation analysis

To further characterize differential epigenetic accessibility in naïve CD4+ T cells between African-Americans and European-Americans, we investigated the transcriptional co-regulation of the 144 hypomethylated genes in African-Americans using a meta-analysis of 3900 human gene expression microarrays [14]. Pair-wise gene–gene correlations of the 144 genes revealed three tightly co-regulated gene blocks (Fig. 3). Literature mining analysis for commonality between the genes within each block, using IRIDESCENT, revealed general relationships to IL-2 signaling, cell migration, and the glutathione S-transferase gene GSTO1, for the gene groups in blocks 1, 2 and 3, respectively (Fig. 3).

Fig. 3

Analysis of the transcriptional correlation of 144 genes hypomethylated in African-American naïve CD4+ T cells was conducted by calculating Pearson’s correlation coefficients of the reported transcriptional fold changes across 3900 human 2-color microarray data sets. Correlation values range from 1.0 (perfect positive correlation represented in red) to −1.0 (perfect negative correlation represented in green). Three blocks of transcriptionally co-regulated genes can be seen in the colored squares. Using literature mining, the highest scoring themes among the genes in the yellow, blue and purple blocks are the cytokine IL-2, cell migration, and the glutathione S-transferase gene GSTO1, respectively

Cis-acting genetic variants accounts for some population-specific methylation differences

To investigate if cis-acting genetic variants might explain at least some of the DNA methylation differences between African-American and European-American individuals, the presence of single-nucleotide polymorphisms (SNPs) with a minor allele frequency of >1 % located within the top 30 hypomethylated CpG dinucleotides in African-Americans was determined using the UCSC Genome Browser. These SNPs located beyond the array probe sequences do not influence probe hybridization and the technical ability of probes to accurately measure DNA methylation. The top 30 hypomethylated CpG sites included four SNPs (substitutions or insertion–deletions), three of which were associated with genes including NRGN, BPIFA3 (C20orf71) and INTS1. Rs55661361 is an intragenic SNP located in the NRGN CpG site most hypomethylated in African-Americans. The ancestral A allele changes the CpG site in NRGR to a CpA site, which would prevent the methylation of the cytosine residue in this locus. The ancestral A allele has a frequency of 79 and 37 % in the Yoruba from Ibadan, Nigeria (YRI) and the European-Americans from Utah, USA (CEU) populations, respectively. This suggests that loss of DNA methylation in this CpG site in African-Americans might be explained by this genetic variation. To confirm this, we performed bisulfite DNA sequencing in 10 African-American and 21 European-American healthy individuals to determine the genotype–methylation relationship in this locus. We found a distinct pattern of association between the genotype of rs55661361 and the average DNA methylation of NRGN (Fig. 4). Individuals in this study can be categorized into high methylation, intermediate methylation, and low methylation groups representing three genotypes: G/G, G/A and A/A in rs55661361, respectively. The high methylation group (G/G) included 15 individuals (13 EA; 2 AA; average  % methylation = 81.6), the intermediate methylation group (G/A) included 13 individuals (8 EA; 5 AA; average  % methylation = 56.5), and the low methylation group (A/A) included 3 individuals (0 EA; 3 AA; average  % methylation = 0.0) (Fig. 4). Therefore, the DNA methylation difference between the two ethnicities on this CpG site can be completely explained by the genotype on rs55661361.

Fig. 4

The CpG-SNP rs55661361 (A > G) in NRGN has an allele frequency that differs between CEU and YRI populations (a). Bisulfite sequencing of the CpG-SNP region in healthy individuals (b) revealed three distinct methylation categories based on being heterozygous or homozygous for the disruptive A allele [high methylation (G/G); intermediate methylation (G/A); low methylation (A/A)]. A cartoon (c) demonstrating the effect of the A allele on cytosine methylation by DNMT1. Average DNA methylation levels in NRGN stratified by genotype in 31 healthy individuals included in the study (d). Genotype, average methylation ± SD, N: G/G, 0.82 ± 0.07, N = 15; G/A, 0.56 ± 0.07, N = 13; A/A, 0 ± 0, N = 3

Patch methylation of the IL32 promoter region suppresses gene expression in vitro

To confirm that DNA methylation changes we report are transcriptionally relevant, we tested if the differentially methylated regions we observed in IL32 can influence gene expression using in vitro reporter assays. Both of the upstream promoter/5'-untranslated regions hypomethylated in African-Americans compared to European-Americans, in healthy individuals and in lupus patients, displayed decreased luminescence after being fully methylated in vitro and inserted upstream of a Lucia luciferase reporter gene compared to an identical sequence that remained unmethylated. The Lucia/firefly luciferase ratio for the unmethylated Region 1 insert (−230 to +232 from IL32 transcription start site) was 2.43-fold greater than the methylated insert (P = 0.0001). The ratio for unmethylated Region 2 insert (+194 to +573 from IL32 transcription start site) was 2.74-fold greater than the methylated insert (P = 0.001) (Fig. 5).

Fig. 5

Using a luciferase reporter vector with an inserted promoter/5'-untranslated regions from IL32 shows a significant difference in luciferase expression between fully methylated and unmethylated regions as measured by a ratio of Lucia luciferase over firefly luciferase luminescence. A cartoon (a) of the linearized plasmid showing the location of the IL32 promoter/5'-untranslated region inserts in the Lucia Luciferase reporter vector and the number of CpG sites present in each insert. The expression of the Lucia luciferase reported as a ratio of Lucia/firefly luciferase (b)


We present the first comparison of the genome-wide DNA methylation status of peripheral naïve CD4+ T cells in healthy African-American and European-American individuals. Site-specific methylation differences between ethnicities was due in part to the presence of genetic variants that alter the methylation status of CpG dinucleotides. One mechanism behind these differences is the disruption of the CpG dinucleotide motif recognized by DNA methyltransferase 1 enzyme (DNMT1) [15]. DNMT1 recognizes hemi-methylated CpG dinucleotides during DNA replication and methylates the unmethylated CpG dinucleotide on the newly synthesized DNA strand. The effects of these polymorphisms in the genome would be a reduction in average DNA methylation of CpG sites in individuals with the disruptive allele. Differences in average DNA methylation would be greatest for CpG-SNPs that have significantly different allele frequencies between populations. The occurrence of CpG-SNPs and their effect on allele-specific methylation regions is highly population and cell-type specific. One study reported that 38–88 % of allele-specific methylation regions measured across 16 human adult and pluripotent cell lines were present solely due to CpG-SNPs. This allele-specific methylation was also found to extend beyond the individual CpG site in some instances, resulting in reduced methylation at nearby sites that contained no overlapping SNPs [16].

The most hypomethylated CpG site in African-Americans in this study in NRGN is a prime example of the effect genetic variation can have on DNA methylation. The presence of the A allele is associated with reduced methylation and is the major allele in African-Americans, but the minor allele in European-Americans. We demonstrated a stratification of African-American and European-American individuals by methylation status and rs55661361 genotype, with African-Americans having significant reduction in average DNA methylation in this locus compared to European-Americans. In addition to the NRGN locus, 3 others of the top 30 most hypomethylated CpG sites included disruptive SNPs. The population-specific frequency of these disruptive alleles varies between CEU and YRI populations. There are likely more CpG-SNPs across the genome that alter DNA methylation and account for differences between populations.

Neurogranin (Ng), encoded by NRGN, regulates calcium-dependent apoptosis in mature T cells [17]. These IL-2-dependent cells express Ng and interact with the calcium-binding protein calmodulin. Devireddy et al. proposed an interaction model in which an IL-2 deprived environment increases Ng expression and Ng-calmodulin interaction displaces calmodulin-bound Ca2+ and thus raises the intercellular Ca2+ concentration, promoting apoptosis. It is worth noting that lupus T cells are characterized by defective IL-2 expression, enhanced calcium influx, and increased apoptosis, and that lupus is more common in African-Americans compared to European-derived populations [18]. Further, recent reports suggest increased Ng expression in PBMCs of lupus patients [19, 20].

Apoptosis was found to be a recurring theme among genes associated with hypomethylated CpG sites in naïve CD4+ T cells from African-American individuals.

Gene ontology terms for the hypomethylated genes show an enrichment of cellular apoptosis driven by the hypomethylation of genes such as TNFRSF10A, COL18A1, CDKN1A, AHRR, UNC13D, RIPK1, GRIN1, AKAP13, VAV2, and PRODH. Tumor necrosis factor receptor superfamily, member 10a (or TRAILR1) is a cell surface protein encoded by TNFRSF10A. It recognizes extracellular Apo-2L (or TRAIL) and acts as a signal for the downstream recruitment of pro-apoptotic caspases in the cell [21].

Four CpG sites upstream of the cytokine gene IL32 were hypomethylated in African-American individuals independently of any known SNP. An 803-bp region (chr16:3115083–3115886; HG19) immediately upstream of and including the IL32 gene transcription start site and its 5'-untranslated region contained these four CpG sites. These sites were cloned with their surrounding sequence and inserted into a CpG-free luciferase reporter vector. The unmethylated and in vitro methylated regions were compared and showed a significant reduction in the expression of luciferase in transfected HEK 293 cells when all CpG sites were methylated. These regions contain regulatory elements that are influenced by DNA methylation and could regulate the expression of IL32 in peripheral T cells. IL-32 is a cytokine that induces the production of MIP-2, IL-8 and TNF in monocytic cell lines [22]. IL-32 (primarily IL-32β) expression in activated CD4 + T cells was also associated with increased apoptosis and was presumed to play a role in activation-induced cell death [23]. The hypomethylation of apoptosis-related genes could indicate an epigenetic susceptibility to apoptosis and might contribute to the pathogenesis of diseases in which T cell apoptosis is an associated factor. A significant increase in T cell apoptosis has been reported in SLE patients compared to controls [24]. Apoptosis was also increased in active (SLE Disease Activity Index >4) SLE patients compared to inactive patients. Taken together, hypomethylation of pro-apoptotic genes in naïve CD4+ T cells from healthy African-Americans may contribute to the increased frequency and severity of lupus in African-Americans.

The glutathione S-transferase mu 1 protein (GSTM1) encoded by GSTM1 is involved in the metabolism of xenobiotic compounds including carcinogens and products of oxidative stress. It plays a role in processing carcinogens and has been previously associated with cancer risk [25]. There were 4 CpG sites in GSTM1, two within 200 bp upstream of the transcription start site, one in the 1st exon, and one in the gene body, with significant hypomethylation in African-American naïve CD4+ T cells.

We also found significant hypomethylation in CD226 in African-American naïve CD4+ T cells. CD226 is expressed on the surface of T cells and natural killer T cells. It is involved in the activation, adhesion, cytotoxicity, and differentiation of T cells, and has previously been identified as a genetic risk locus for SLE [26]. The risk locus is located downstream of the CD226 gene body while the demethylated CpG site is within 1500 bp of the transcription start site.

In summary, we identified significant epigenetic difference in naïve CD4+ T cells between African-American and European-American healthy individuals. These ethnicity-specific epigenetic differences are at least in part derived from cis-acting genetic polymorphisms. We found significant hypomethylation in pro-apoptotic genes in African-Americans, and hypomethylation of pro-inflammatory genes such as IL32. This epigenetic variability might play a role in increased frequency and severity of some autoimmune diseases such as lupus, which is characterized by increased T cell apoptosis. Indeed, of the 144 hypomethylated genes in African-American naïve CD4+ T cells, 18 and 27 have been previously related to lupus and autoimmune diseases, respectively. We demonstrated that the majority of these 18 lupus-related genes were also hypomethylated in African-American compared to European-American lupus patients, providing further validation and disease-relevance of the ethnicity-specific DNA methylation changes reported in this study. Further work is required to determine if these hypomethylated sites exist in a transient state or are maintained as the cell develops, and whether or not they are relevant to other cell types. Exploration of the DNA methylome could help further resolve what inherent epigenetic differences in immune activity exist among populations that might influence the pathogenesis of complex heterogeneous diseases.


We provide evidence for wide-spread ethnicity-specific differences in DNA methylation patterns in naïve CD4+ T cells. These differences promote an overall T cell chromatin architecture that is more permissive to autoimmunity in African-Americans compared to European-Americans. Indeed, hypomethylated loci in African-Americans are enriched in pro-apoptotic genes involved in autoimmunity. These same genes are also hypomethylated in African-American compared to European-American lupus patients. Our data also provide strong evidence that these methylation differences between ethnicities are at least in part a result of genetic variants that directly disrupt CpG dinucleotides. Therefore, inherited epigenetic susceptibility loci contribute to explaining differences in susceptibility to autoimmune diseases, including lupus, between populations.


Study participants

A total of 66 healthy female individuals (21 African-Americans and 45 European-Americans) were recruited for our studies. There was no significant difference in age between the African-American (mean age = 39.3 years) and European-American (mean age = 42.5 years) groups (P value = 0.34). All study participants signed an informed consent prior to enrollment into the study. The institutional review boards at the University of Michigan and the Henry Ford Health System approved this study. To validate the relevance of detected ethnicity-specific DNA methylation differences in an autoimmune disease, a group of age-matched 21 African-American and 42 European-American female lupus patients were studied. All patients met the American College of Rheumatology classification criteria for SLE [27].

Isolation of naïve CD4+ T cells

Whole blood was collected at the time of enrollment and processed within 6 h of collection. Peripheral blood mononuclear cells were isolated using Ficoll density gradient centrifugation media (GE Healthcare Biosciences, USA) followed by indirect isolation of untouched naïve CD4+ T cells using the Naïve CD4+ T cell Isolation Kit II (Miltenyi Biotec Inc., USA). DNA was isolated using the DNeasy Blood and Tissue Kit (Qiagen, USA) and DNA concentration was determined by spectrophotometry on the NanoDrop 2000 (Thermo Scientific, USA). Flow cytometry was used to confirm purity of isolated naïve CD4+ T cells using CD4 and CD45RA fluorochrome-conjugated antibodies (BioLegend, USA).

DNA methylation analysis

500 ng of DNA from each sample was bisulfite converted using the EZ DNA Methylation™ Kit (Zymo Research, USA) as previously described [6]. DNA methylation was evaluated by hybridizing bisulfite-converted DNA to the Infinium HumanMethylation450 BeadChips (Illumina, USA) according to manufacturer’s directions. The Infinium array interrogates over 485,000 methylation sites across the human genome, covering 96 % of CpG islands, shores and flanking regions as well as 99 % of RefSeq genes with an average of 17 CpG sites per gene region. The array covers the promoter, 5′-untranslated region, first exon, gene body, and 3′-untranslated region of each gene. Array hybridization of the bisulfite-converted DNA was performed at the University of Michigan DNA Sequencing Core.

Methylation data processing and analysis

DNA methylation data analysis was performed using the minfi (1.12.0) statistical analysis package in the R environment (3.1.2) [28, 29] (Additional file 1: Figure S1). Minfi is an analysis suite for Infinium HumanMethylation450 data that allows user flexibility in preprocessing, quality control, and identification of differentially methylated regions. Raw data were imported into minfi as a single object of red and green channel intensity values and sample phenotype information. Preprocessing of these channel values was accomplished using a method included in minfi that is equivalent to the data preprocessing method in the Illumina GenomeStudio Methylation Module that converts the red and green channel intensities into methylated and unmethylated values. This included correction for background signal and normalization of each channel to control probes included on each array. An array was chosen at random for normalization. After preprocessing, 306 probes with a detection P value >0.05 in >20 % of the probes were removed from analysis. 60,045 probes were identified as having a SNP within 10 base pairs of the interrogated nucleotide and were removed from analysis as well. Using minfi’s “fixMethOutliers” command, extreme values in the methylated and unmethylated channels were identified separately. An intensity cutoff value was calculated by first log2(x + 0.5) transforming all intensity values and then calculating the median intensity and median intensity absolute deviation. An intensity cutoff was defined as the median + (−3 * median absolute deviation). Any intensity less than the cutoff value was considered an outlier, and adjusted to be equal to the intensity cutoff. The methylated and unmethylated probe values were used to calculate both a Beta (β) and M value ratio. β values are calculated using the formula: \(\beta = \frac{\text{Methylated}}{{{\text{Methylated}}\; + \;{\text{Unmethylated}}\; +\; 100}}\) and M values are calculated as the logit(β).

Batch effect correction was performed using the ComBat script available as part of the sva (3.12.0) package [30]. ComBat utilizes a parametric empirical Bayes method to first standardize means and variances across batches for normalized data. Next it uses the distribution of the standardized data to determine batch effect parameters, and finally, uses the empirically determined batch effect estimators to adjust the data. Individual Infinium microarray chips used were input into ComBat as batches. M values were calculated from normalized β values and used as input for ComBat, as it assumes a normal distribution of the data, and the batch effect-adjusted matrix of data was converted back into β values for further analysis.

After adjusting for batch effect, the delta β (Δβ) of each probe between the African-American and European-American groups was calculated as \(\varDelta \beta = {\text{Mean}} \beta \left( {{\text{African}} - {\text{American}}} \right) - {\text{Mean}} \beta ({\text{European}} - {\text{American}})\). A two-sided Student’s t test was performed assuming equal variances of the mean \(\beta\) in each group to calculate a P value adjusted for false discovery rate (FDR). Further analysis was limited to probes with an FDR-adjusted P value of less than or equal to 0.05 and a |Δβ| greater than 0.1.

Gene ontology and bioinformatics analysis

Gene ontologies (GO) identification of hypo- and hypermethylated probes was performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID v6.7) [31, 32]. Search parameters used in DAVID were a minimum gene group membership of 2, a Modified Fisher Exact P value (EASE Score) maximum of 0.1, and the human genome as background. Further bioinformatics analysis was conducted using IRIDESCENT [12].

Bisulfite DNA sequencing

Confirmation of the methylation status reported by the BeadChip array for each donor was conducted by first bisulfite converting 100 ng of genomic DNA from 31 samples (21 EA; 10 AA) as described earlier in this paper. Polymerase chain reaction (PCR) was used to amplify a 581-bp region of NRGN (chr11:124613509–124614089; HG19) using forward (5′-GAATGTGAAGTTTAGGAAATGAGGT-3′) and reverse (5′-TTCAACCTAATAATAATCAAACCCACCTCT-3′) primers added to ZymoTaq™ PreMix (Zymo Research, USA). PCR thermocycler settings were as follows: DNA denaturation at 95 °C for 10 min followed by 40 cycles at 95 °C for 30 s, then 56 °C for 40 s, and finally 72 °C for 1 min. PCR amplification of the desired region was confirmed by electrophoresis of an aliquot from each product on a 2.0 % agarose gel at 93 V for 1 h. The remaining PCR product was prepared for sequencing using the DNA Clean and Concentrator-5 Kit (Zymo Research, USA) following manufacturer’s directions and DNA concentration was determined using the NanoDrop 2000 (Thermo Scientific, USA). Sequencing was performed on an ABI Model 3730XL sequencing platform. Average DNA methylation and genotypes were determined using Epigenetic Sequencing MEthylation (ESME) analysis software [33]. β values determined by ESME and the HumanMethylation450 array showed a high, positive correlation (R 2 = 0.8217).

Co-regulation analysis of hypomethylated genes

The co-expression of 144 hypomethylated genes was determined using 3900 processed, 2-color human gene expression microarrays retrieved from the Gene Expression Omnibus as previously described [14]. Global gene–gene pair co-expression was quantified using Pearson’s correlation coefficients.

Ligation of IL32 promoter regions into luciferase reporter vector

Two regions containing CpG sites of interest within the IL32 promoter/5'-untranslated regions were amplified from human genomic DNA. Region 1 (chr16:3115083–3115545; HG19) (forward primer: 5′-TTTGAGCTCCCTATCTTGTCCCACAGGTAGA-3′, reverse primer: 5′-AGGTACTCGAGACAGACAGAGACAGAGACAGAG-3′) contains cg00239353, cg26724967, and cg23813257 and region 2 (chr16:3115507–3115886; HG19) (forward primer: 5′-CCCGAGCTCTCTCTGTCTGTCTCTGTCTCTG-3′, reverse primer: 5′-ATGTACTCGAGGCACCCTTACGGTCTGTTT-3′) contains cg18350391, cg08978665, and cg00471190. These regions also contained other CpG sites not included on the Infinium array. The PCR thermocycler conditions used were as follows: 30 s at 98 °C, and 3 cycles of 98 °C for 10 s followed by 67 °C for 30 s followed by 72 °C for 20 s, and 27 cycles of 98 °C for 10 s followed by 58 °C for 30 s followed by 72 °C for 20 s, and 2 min at 72 °C. Each amplified region was Sanger sequenced to verify the correct sequence was amplified. The correctly amplified regions were ligated using T4 DNA ligase (New England BioLabs, USA) into a CpG-free Lucia luciferase reporter vector (pCpGfree-Lucia; InvivoGen, USA) (Fig. 5a) digested with ScaI. Both regions were inserted immediately upstream of a human EF1 promoter. Each ligation reaction contained a 5:1 insert: vector molar ratio, and was incubated overnight at 16 °C, then incubated 30 min at room temperature and heat inactivated at 65 °C.

The plasmids were transformed into E. coli GT115 (InvivoGen, USA) using the Mix and Go E. coli transformation kit (Zymo Research, USA) with 1 h of outgrowth, and grown on Zeocin selective plates and media (InvivoGen, USA). Plasmid DNA from transformed liquid cultures was extracted using the QIAprep Spin Miniprep Kit (Qiagen, USA), and the direction of each insert within the plasmid was determined by Sanger sequencing using the forward primer previously listed for each region.

Patch methylation of IL32-promoter region

0.5 µg of each complete plasmid were methylated via M.SssI methyltransferase (New England BioLabs, USA) or mock methylated (methyltransferase omitted). Each methylation reaction contained 640 mM S-adenosylmethionine and 16U M.SssI, and was incubated overnight at 37 °C and heat inactivated at 65 °C for 20 min. The CpG-free promoter vector is modified to contain no CpG sites, so the inserted promoter regions are exclusively methylated. Complete methylation was verified by digesting 100 ng of plasmid with SacI and methylation-sensitive restriction enzymes (HpyCH4IV for region 1, and AciI for region 2) at 37 °C for 1 h (New England BioLabs, USA). For region 1, the expected digestion product size was either 3.7 kb if methylated or 1.8 kb if unmethylated. For region 2, the expected product size was 3.7 kb if methylated or 1.6 and 1.9 kb if unmethylated. Only methylation reactions that were verified as completely methylated by gel electrophoresis were used in transfection.

Luciferase reporter assay of methylated and unmethylated IL32 promoter regions

HEK 293 cells in an opaque white 96 well plate were transfected in triplicate with 50 ng control firefly luciferase plasmid (Promega, USA) and 50 ng of pCpGfree-Lucia constructs, using Lipofectamine 3000 (Invitrogen, USA) according to manufacturer’s instruction. The pCpGfree-Lucia constructs used were either the methylated or mock methylated plasmids containing either region 1 or region 2, as well as a vector containing no insert, and a DNA-free control. After 48 h, the luciferase activity was measured after simultaneously treating an aliquot of the supernatant media with QUANTI-Luc reagent (InvivoGen, USA) to measure Lucia luciferase activity, and treating the adherent cells with Dual-Glo Luciferase reagent (Promega, USA) to measure firefly luciferase activity, and immediately reading the resulting luminescence with a Synergy H1 microplate reader (BioTek, USA) with a gain of 100 units. The reads were background corrected to the DNA-free control, then controlled for transfection efficiency by reporting the ratio of Lucia/firefly luciferase luminescence. These ratios were compared to determine the effect of methylation on luciferase activity using a Student’s t test with a statistical significance threshold of α ≤ 0.05 (Fig. 5b).



Systemic lupus erythematosus




Database of Annotation, Visualization and Integrated Discovery


Gene ontology


Long interspersed nucleotide element 1






Yoruba from Ibadan


CEPH (Centre d’Etude du Polymorphisme Humain) from Utah




False discovery rate


  1. 1.

    Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, et al. Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci USA. 2005;102(30):10604–9. doi:10.1073/pnas.0500398102.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Cooper GS, Stroehla BC. The epidemiology of autoimmune diseases. Autoimmun Rev. 2003;2(3):119–25. doi:10.1016/s1568-9972(03)00006-5.

    Article  PubMed  Google Scholar 

  3. 3.

    Somers EC, Marder W, Cagnoli P, Lewis EE, DeGuire P, Gordon C, et al. Population-based incidence and prevalence of systemic lupus erythematosus: the Michigan Lupus Epidemiology and Surveillance program. Arthritis Rheumatol. 2014;66(2):369–78. doi:10.1002/art.38238.

    Article  PubMed  PubMed Central  Google Scholar 

  4. 4.

    Zhang FF, Cardarelli R, Carroll J, Fulda KG, Kaur M, Gonzalez K, et al. Significant differences in global genomic DNA methylation by gender and race/ethnicity in peripheral blood. Epigenetics. 2014;6(5):623–9. doi:10.4161/epi.6.5.15335.

    Article  Google Scholar 

  5. 5.

    Heyn H, Moran S, Hernando-Herraez I, Sayols S, Gomez A, Sandoval J, et al. DNA methylation contributes to natural human variation. Genome Res. 2013;23(9):1363–72. doi:10.1101/gr.154187.112.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Coit P, Jeffries M, Altorok N, Dozmorov MG, Koelsch KA, Wren JD, et al. Genome-wide DNA methylation study suggests epigenetic accessibility and transcriptional poising of interferon-regulated genes in naive CD4+ T cells from lupus patients. J Autoimmun. 2013;43:78–84. doi:10.1016/j.jaut.2013.04.003.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Coit P, Renauer P, Jeffries MA, Merrill JT, McCune WJ, Maksimowicz-McKinnon K, et al. Renal involvement in lupus is characterized by unique DNA methylation changes in naive CD4+ T cells. J Autoimmun. 2015;61:29–35. doi:10.1016/j.jaut.2015.05.003.

    CAS  Article  PubMed  Google Scholar 

  8. 8.

    Diez-Guerra FJ. Neurogranin, a link between calcium/calmodulin and protein kinase C signaling in synaptic plasticity. IUBMB Life. 2010;62(8):597–606. doi:10.1002/iub.357.

    CAS  Article  PubMed  Google Scholar 

  9. 9.

    Huffman DL, O’Halloran TV. Function, structure, and mechanism of intracellular copper trafficking proteins. Annu Rev Biochem. 2001;70:677–701. doi:10.1146/annurev.biochem.70.1.677.

    CAS  Article  PubMed  Google Scholar 

  10. 10.

    Alonso A, Sasin J, Bottini N, Friedberg I, Friedberg I, Osterman A, et al. Protein tyrosine phosphatases in the human genome. Cell. 2004;117(6):699–711. doi:10.1016/j.cell.2004.05.018.

    CAS  Article  PubMed  Google Scholar 

  11. 11.

    Wren JD. Extending the mutual information measure to rank inferred literature relationships. BMC Bioinform. 2004;5:145. doi:10.1186/1471-2105-5-145.

    Article  Google Scholar 

  12. 12.

    Wren JD, Garner HR. Shared relationship analysis: ranking set cohesion and commonalities within a literature-derived relationship network. Bioinformatics. 2004;20(2):191–8.

    CAS  Article  PubMed  Google Scholar 

  13. 13.

    Wren JD, Bekeredjian R, Stewart JA, Shohet RV, Garner HR. Knowledge discovery by automated identification and ranking of implicit relationships. Bioinformatics. 2004;20(3):389–98. doi:10.1093/bioinformatics/btg421.

    CAS  Article  PubMed  Google Scholar 

  14. 14.

    Wren JD. A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide. Bioinformatics. 2009;25(13):1694–701. doi:10.1093/bioinformatics/btp290.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Bestor TH. The DNA methyltransferases of mammals. Hum Mol Genet. 2000;9(16):2395–402. doi:10.1093/hmg/9.16.2395.

    CAS  Article  PubMed  Google Scholar 

  16. 16.

    Shoemaker R, Deng J, Wang W, Zhang K. Allele-specific methylation is prevalent and is contributed by CpG-SNPs in the human genome. Genome Res. 2010;20(7):883–9. doi:10.1101/gr.104695.109.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Devireddy LR, Green MR. Transcriptional program of apoptosis induction following interleukin 2 deprivation: identification of RC3, a calcium/calmodulin binding protein, as a novel proapoptotic factor. Mol Cell Biol. 2003;23(13):4532–41. doi:10.1128/mcb.23.13.4532-4541.2003.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  18. 18.

    Moulton VR, Tsokos GC. T cell signaling abnormalities contribute to aberrant immune cell function and autoimmunity. J Clin Invest. 2015;125(6):2220–7. doi:10.1172/JCI78087.

    Article  PubMed  Google Scholar 

  19. 19.

    Li S, Jiang W, Xiong Y, Wang X, Li Y, Shen S, et al. Neurogranin overexpression on peripheral blood mononuclear cells from patients with systemic lupus erythematosus. Chin J Rheumatol. 2008;12(4):265–8.

    CAS  Google Scholar 

  20. 20.

    Luo X, He X, Wu C, Ni J. Dong Ll, Li S. The discovery of novel splicing variants of neurogranin and their role in the pathogenesis of systemic lupus erythematosus. Arthritis Rheumatol. 2014;66(10 (Supplement)):S1173–4.

    Google Scholar 

  21. 21.

    Pan G, O’Rourke K, Chinnaiyan AM, Gentz R, Ebner R, Ni J, et al. The receptor for the cytotoxic ligand TRAIL. Science. 1997;276(5309):111–3. doi:10.1126/science.276.5309.111.

    CAS  Article  PubMed  Google Scholar 

  22. 22.

    Kim SH, Han SY, Azam T, Yoon DY, Dinarello CA. Interleukin-32: a cytokine and inducer of TNFalpha. Immunity. 2005;22(1):131–42. doi:10.1016/j.immuni.2004.12.003.

    CAS  PubMed  Google Scholar 

  23. 23.

    Goda C, Kanaji T, Kanaji S, Tanaka G, Arima K, Ohno S, et al. Involvement of IL-32 in activation-induced cell death in T cells. Int Immunol. 2006;18(2):233–40. doi:10.1093/intimm/dxh339.

    CAS  Article  PubMed  Google Scholar 

  24. 24.

    Dhir V, Singh AP, Aggarwal A, Naik S, Misra R. Increased T-lymphocyte apoptosis in lupus correlates with disease activity and may be responsible for reduced T-cell frequency: a cross-sectional and longitudinal study. Lupus. 2009;18(9):785–91. doi:10.1177/0961203309103152.

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Rebbeck TR. Molecular epidemiology of the human glutathione S-transferase genotypes GSTM1 and GSTT1 in cancer susceptibility. Cancer Epidemiol Biomark Prev. 1997;6(9):733–43.

    CAS  Google Scholar 

  26. 26.

    Lofgren SE, Delgado-Vega AM, Gallant CJ, Sanchez E, Frostegard J, Truedsson L, et al. A 3′-untranslated region variant is associated with impaired expression of CD226 in T and natural killer T cells and is associated with susceptibility to systemic lupus erythematosus. Arthritis Rheum. 2010;62(11):3404–14. doi:10.1002/art.27677.

    Article  PubMed  Google Scholar 

  27. 27.

    Chung SA, Tian C, Taylor KE, Lee AT, Ortmann WA, Hom G, et al. European population substructure is associated with mucocutaneous manifestations and autoantibody production in systemic lupus erythematosus. Arthritis Rheum. 2009;60(8):2448–56. doi:10.1002/art.24707.

    Article  PubMed  PubMed Central  Google Scholar 

  28. 28.

    Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, et al. Minfi: a flexible and comprehensive bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–9. doi:10.1093/bioinformatics/btu049.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Team RC. R: A language and environment for statistical computing. 2014.

  30. 30.

    Jeffrey T. Leek WEJ, Hilary S. Parker, Elana J. Fertig, Andrew E. Jaffe, John D. Storey. sva: Surrogate Variable Analysis.

  31. 31.

    da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. doi:10.1038/nprot.2008.211.

    CAS  Article  Google Scholar 

  32. 32.

    da Huang W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37(1):1–13. doi:10.1093/nar/gkn923.

    Article  PubMed Central  Google Scholar 

  33. 33.

    Lewin J, Schmitt AO, Adorjan P, Hildmann T, Piepenbrock C. Quantitative DNA methylation analysis based on four-dye trace data from direct sequencing of PCR amplificates. Bioinformatics. 2004;20(17):3005–12. doi:10.1093/bioinformatics/bth346.

    CAS  Article  PubMed  Google Scholar 

Download references

Authors’ contributions

PC performed the DNA methylome experiments, performed statistical and bioinformatics analysis, and wrote the manuscript; MO performed the bisulfite sequencing and genotyping experiments; EG performed the patch methylation and reporter assay experiments; KMM provided critical material for the study; JDW performed bioinformatics analysis; AHS conceived the study, designed the experiments, and wrote the manuscript. All authors approved this version of the manuscript and agree with submission. All authors were involved in drafting or revising the manuscript. All authors read and approved the final manuscript.


Research reported in this publication was supported by the National Institute of Allergy and Infectious Diseases, NIH under award number R01AI097134.

Competing interests

The authors declare that they have no competing interests.

Author information



Corresponding author

Correspondence to Amr H. Sawalha.

Additional file

Additional file 1.

Supplementary material (Table S1 and Figure S1).

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Coit, P., Ognenovski, M., Gensterblum, E. et al. Ethnicity-specific epigenetic variation in naïve CD4+ T cells and the susceptibility to autoimmunity. Epigenetics & Chromatin 8, 49 (2015).

Download citation


  • Epigenetic
  • Autoimmunity
  • Lupus
  • Ethnicity specific
  • Genetic
  • T cell