- Open Access
Generation of bivalent chromatin domains during cell fate decisions
Epigenetics & Chromatinvolume 4, Article number: 9 (2011)
In self-renewing, pluripotent cells, bivalent chromatin modification is thought to silence (H3K27me3) lineage control genes while 'poising' (H3K4me3) them for subsequent activation during differentiation, implying an important role for epigenetic modification in directing cell fate decisions. However, rather than representing an equivalently balanced epigenetic mark, the patterns and levels of histone modifications at bivalent genes can vary widely and the criteria for identifying this chromatin signature are poorly defined.
Here, we initially show how chromatin status alters during lineage commitment and differentiation at a single well characterised bivalent locus. In addition we have determined how chromatin modifications at this locus change with gene expression in both ensemble and single cell analyses. We also show, on a global scale, how mRNA expression may be reflected in the ratio of H3K4me3/H3K27me3.
While truly 'poised' bivalently modified genes may exist, the original hypothesis that all bivalent genes are epigenetically premarked for subsequent expression might be oversimplistic. In fact, from the data presented in the present work, it is equally possible that many genes that appear to be bivalent in pluripotent and multipotent cells may simply be stochastically expressed at low levels in the process of multilineage priming. Although both situations could be considered to be forms of 'poising', the underlying mechanisms and the associated implications are clearly different.
In recent years it has been suggested that the epigenetic programme may play a key role in determining cell fate, including the decision to undergo self-renewal or commitment. Based on genome-wide chromatin immunoprecipitation (ChIP) studies combined with expression analysis, it has been suggested that the chromatin associated with many genes controlling lineage fate decisions is uniquely marked in stem cells. Their histone signature is referred to as bivalent as it includes modifications associated both with repression (H3K27me3) imposed by the polycomb group proteins (PcG), and activation (H3K4me3) encoded by the Set/MLL histone methyltransferase, the mammalian homologue of the trithorax group proteins (trxG) [1–6]. Despite having both 'active' and 'repressive' chromatin marks, such genes were thought not to be expressed. Taken together, these observations led to an attractive model suggesting that a preimposed epigenetic signature suppresses expression of lineage control genes in stem cells (maintaining a pluripotent state) while at the same time 'poising' such genes for subsequent activation (reviewed in ). In favour of this, many lineage-control genes have a bivalent signature [1–5]. However, as the model has evolved, more recently it has been shown that RNA polymerase II (PolII) may be present but stalled at the promoters of bivalent genes [8, 9] and that short (abortive) transcripts may be detected at their promoters . Furthermore, although embryonic stem cells (ES cells) lacking the PcG repressive complex 2 (PRC2) aberrantly express developmental regulators  they maintain pluripotency . Similarly, two recent experiments in which components of the SET1/MLL core subunit (Dpy-30, RbBP5 and WDR5) were reduced to similar levels showed opposite phenotypes. In one study there was maintenance of self-renewal with a defect in differentiation , and in another there was a loss of self-renewal . Together these observations suggest that the current models explaining the significance bivalently marked chromatin may require revision.
An understated problem in testing the prevailing bivalent chromatin hypothesis is that the criteria for identifying such signatures are poorly defined. Closer analysis of publicly available chromatin datasets from human embryonic stem (ES) cells shows that even contiguous bivalent chromatin domains can be modified in widely different ways with respect to the relative levels and the distributions of H3K4me3 and H3K27me3 across the locus (see Additional file 1). Sequential ChIP analyses of a few developmental genes have shown that H3K4me3 and H3K27me3 may colocalise and, by extrapolation, it has been implied that all genes whose promoters are marked (to any degree) by both modifications are truly bivalent. According to this paradigm, many specialised, lineage-specific genes are bivalent [3, 4, 15] and as cells differentiate, chromatin modifications resolve into active or repressed states. However, since the original observations, it has become clear that bivalent chromatin modifications (indistinguishable from those seen in pluripotent cells) can also be newly established and/or maintained in differentiating cells [16–19]. Therefore, the functional significance of such bivalently marked genes has been questioned  and more studies have been urged to determine the mechanisms underlying these chromatin structures .
The human α globin genes are located within a well characterised multigene cluster whose analysis has elucidated many of the general principles underlying the transcriptional and epigenetic regulation of mammalian gene expression. The α globin cluster (5'-HBZ-HBM-HBA2-HBA1-HBQ-3') provides unequivocal examples of specialised, tissue-specific genes consistently scored as bivalent in ES cells [4, 5] (Figure 1). Their fully activated expression depends on one or more of four remote conserved regulatory elements (MCS-R1 to R4) , which interact with their promoters in erythroid cells via a looping mechanism . Although expressed in a strictly tissue-specific and developmental-stage-specific manner during erythropoiesis, the α globin promoters and much of the body of the associated genes lie within unmethylated CpG islands . Previously, we have described many aspects of the transcription factor and epigenetic programmes associated with hematopoiesis and how they are played out on the α globin cluster [24–27]. Here, we have used this model to investigate in detail the relationship between chromatin marks and mRNA expression during commitment and differentiation into erythroid cells. It appears that, rather than carrying a preimposed bivalent epigenetic signature which silences them, in pluripotent ES cells the α globin genes are repressed by PcG and comodified at readily detectable levels by H3K4me3 when expressed, even at basal levels. To ensure that characterisation of the α globin genes is revealing a general principle that could be relevant to other bivalent genes, we performed global analysis of H3K4me3/H3K27me3 modification and gene expression. The results suggest that our understanding of chromatin bivalency at the α globin locus may explain similar marks found at many other bivalent genes both in pluripotent and differentiating cells and highlights an alternative mechanism for generating bivalent domains.
Results and discussion
H3K4me3 at the α globin genes occurs at a low but significant level in ES cells and increases during erythroid differentiation
To dissect the mechanism(s) underlying epigenetic 'bivalency' in ES cells, we studied histone modifications across the telomeric region of chromosome 16 containing the human α globin locus. We found that, whereas H3K4me1 and H3K4me2 were both enriched at the α globin locus (Figure 1a, i and 1ii), H3K4me3 (considered to be a sensitive mark of recent or ongoing transcriptional activity) was relatively low (Figure 1a, iii). However, quantitative real time PCR (qPCR) (Figure 1b, iii) showed that the level of H3K4me3 at the α globin promoter in ES cells was significantly higher than in lymphocytes in which the α globin genes are considered to be fully repressed (see below). Similar results were obtained at the promoter of the HBM gene, a minor α globin-like gene whose promoter is also associated with a large CpG island .
To quantify this low level of H3K4me3, we compared the degree of enrichment seen in pluripotent ES cells to that seen in an artificially mixed population of erythroid and non-erythroid cells. This showed (Figure 1b, iii) that the H3K4me3 enrichment seen in ES cells at the α globin gene was even less than that obtained in a mixed population of cells consisting of 10% erythroid cells (presumably fully modified by H3K4me3) (Figure 1b, iii) and 90% lymphocytes (unmodified by H3K4me3, Figure 1b, iii).
These results show that the chromatin associated with the α globin genes is modified at a significant but low level by H3K4me3 in chromatin derived from a population of pluripotent stem cells. However, the level of H3K4me3 modification increases dramatically (15-fold) as cells differentiate into erythroid cells (Figure 1b, iii). This phenomenon is common to many other genes (previously noted to be bivalent in ES cells), which are expressed at high levels late in erythroid differentiation (for example, BLVRB, FAM83F, MTSS1, TNXB) . Conversely, H3K4me3 is barely detectable in lymphocytes in which the α globin genes are repressed.
H3K27me3 at the α globin genes occurs at high levels in ES cells and decreases during erythroid differentiation
We have previously shown that α globin expression is repressed in non-erythroid cells by PcG and its associated silencing mark H3K27me3. This repression is put in place early in development and then is either reduced in the erythroid lineage or maintained in non-globin-expressing cell types .
Here, we extended these observations to determine the pattern of H3K27me3 in pluripotent cells across the telomeric 500 kb of chromosome 16p. As noted in several non-erythroid differentiated cells , H3K27me3 enrichment extended across a broad region of the α globin cluster in ES cells (Figure 1a, iv).
Of importance, we next determined the relative levels of H3K27me3 enrichment at the HBM and α globin promoters in ES cells and lymphocytes in which the α globin genes are repressed. ChIP-chip results (Figure 1, iv and ), in accordance with qPCR data (Figure 1b, iv), indicate that the locus is modified to a similar extent in both cell types. These findings suggest that the α globin genes are highly (possibly maximally) modified by H3K27me3 in pluripotent cells and that this level of modification is maintained when cells differentiate into non-erythroid lineages. By contrast, in the erythroid population, as the PcG is completely cleared , H3K27me3 is reduced fourfold compared to that seen in non-erythroid cells (Figure 1b, iv) but not totally removed.
The α globin locus is bivalently modified in ES cells
Chromatin modification at the α globin locus thus resembles that seen at other bivalent domains (for example, CDX2 (see Additional file 1)) at which there is a high level of H3K27me3, and a low level of H3K4me3 which increases (with expression) or decreases (with silencing) in specific lineages as cells differentiate. To determine if the observed bivalent architecture results from colocalisation of H3K27me3 and H3K4me3 rather than simply reflecting the presence of two distinct subpopulations of active and silent cells, we performed sequential ChIP analyses.
At α globin and HBM promoters, chromatin precipitated with an antibody against H3K27me3 was sequentially precipitated by an antibody against H3K4me3 (Figure 2a). Similar results were seen by the 'reverse' sequential ChIP (Figure 2b). This shows that at least some chromatin at the α globin genes is truly comodified by H3K27me3 and H3K4me3.
The α globin locus is expressed at basal levels in a significant proportion of pluripotent cells
Chromatin modification may influence the probability that a locus is transcribed and/or reflect its recent transcriptional state. As for many bivalent genes, the assertion that the α globin genes are not expressed in pluripotent cells is based on the failure to identify binding of PolII in ChIP experiments and the very low (background) signals obtained on RNA microarray analyses. However, using more sensitive assays it has become increasingly clear that in pluripotent cells many bivalent genes are being transcribed either to produce a variety of short, abortive transcripts [8, 10] or to produce very low levels of full length RNA transcripts [15, 29].
To determine the transcriptional status of the α globin bivalent domain, we firstly looked for, but could not detect, high levels of 5' abortive transcripts (see Additional file 2). Then, we analysed expression of spliced transcripts at the cell population level. We found that in ES cells the level of α globin mRNA, although very low compared to erythroblasts (approximately 30,000 to 40,000 times less), was at least 10 times higher than that measured in Epstein-Barr virus (EBV)-transformed lymphoblastoid (EBV-Ly) cell lines (Figure 3a). Similarly, HBM mRNA could be detected in ES cells (although 50 times less than α globin) but was not detectable in lymphoblastoid cells (see Additional file 2). The lower expression of HBM than α globin might be due to differences in promoter sequences or in mRNA stability of this minor globin gene . By comparison, no β globin RNA transcripts were detected in ES and lymphoblastoid cells. We next estimated how many ES cells within each population express α globin using single-cell RT-PCR with primers that detect full-length mRNA transcripts. Using a multiplex analysis, this showed a detectable level of α globin in 12% of OCT4 positive ES cells (Figure 3b). Since a single normal erythroid cell contains approximately 20,000 (α + β) globin RNA molecules , it can be estimated that pluripotent ES cells express α globin only at basal (two or three copies per cell) levels. This is consistent with observations that chromatin modifications (H3K36me3 and H3K9me3) associated with high rates of transcription were not detected in ES cells (see Additional file 3). By contrast, no α globin expression was detected in more than 100 EBV-Ly cells (Figure 3c), confirming that in these cells, unlike ES cells, the state of chromatin at the α globin locus is in a completely repressed transcriptional configuration.
Changes in expression and chromatin modification in hematopoietic stem cells and progenitors
To examine how α globin expression changes during hematopoiesis, we studied primary hematopoietic progenitors at various stages of commitment and differentiation (Figure 3a). Mixed populations enriched for hemopoietic stem cells (HSCs) and common myeloid progenitors (CMPs) exhibited a level of α globin gene expression higher than in ES cells. Expression then increased further in fractions enriched for megakaryocyte-erythroid progenitors (MEPs) but decreased in granulocyte-monocyte progenitors (GMPs), confirming that a gradual restriction in the differentiation potential is associated with an upregulation of lineage-specific genes and silencing of lineage-inappropriate genes.
In HSCs we detected high levels of H3K27me3 and relatively low levels of H3K4me3 (Figure 3d); a pattern consistent with the accepted criteria for a bivalent chromatin signature (and as defined for this gene in pluripotent ES cells). The levels of H3K27me3 (and its associated methyltransferase EZH2) in HSCs, CMPs, GMPs, MEPs and EBV-Ly cells were similar and decreased as MEPs differentiated into erythroid cells (Figure 3d and see Additional file 4). By contrast the levels of H3K4me3, associated with the parallel increase of expression, started to increase as CMPs differentiated into MEPs, prior to the clearance of PcG and H3K27me3. Therefore, at each of these stages of differentiation, the ratio of H3K4me3/H3K27me3 changed, reflecting the level of α globin expression (Figure 3a).
Bivalent genes with a higher H3K4me3 occupancy at the promoter are more often transcribed in ES cells
In genome-wide analyses it has been shown that, in general, the level of H3K4me3 modification correlates with gene expression and H3K27me3 correlates with silencing [16, 17, 31, 32]. But does this correlation also apply to bivalent genes? The detailed analysis of a single gene presented here shows that, at the bivalent α globin locus, changes in the levels of basal gene expression are reflected by changes in the H3K4me3 mark. Is this behaviour an exception? Here we evaluated the relationship between H3K4me3/H3K27me3 modifications and gene expression at other, previously identified bivalent genes.
On a global scale, we determined how the H3K4me3/H3K27me3 ratio differed at bivalent genes known to be expressed at different levels in ES cells. Data from human ES cell studies [4, 6] were crossreferenced and the bivalent genes subdivided into three 'bins' according to their expression. We found that in a population of ES cells, the chromatin associated with genes with the highest expression had the highest H3K4me3/H3K27me3 ratios, while those with the lowest relative expression had the lowest ratios (Figure 4a). In addition, there was a significant positive association between the absolute level of RNA expression and the H3K4me3/H3K27me3 ratio at the associated transcription start sites (TSSs) (P = 2.2 × 10-16) (Figure 4b). We also analysed the same data set plotting for each individual bivalent gene the ratio of H3K4me3/H3K27me3 against expression. Clearly they follow the same trend but, due both to biological variation and insensitivity of microarray data to low levels of transcription, there is a considerable scatter (see Additional file 5).
In a cell population, the relative levels of H3K4me3/H3K27me3 at previously identified bivalent genes could reflect the relative proportions of cells in which the gene in question is marked exclusively by H3K4me3 or H3K27me3. To address this we looked at a subset of the previously studied bivalent loci with the highest levels of H3K27me3 (top 20%). It seemed likely that, for this set, the genes in question would be modified by H3K27me3 in most, if not all, cells. So it was of interest that, even in this subgroup, we found a significant positive association between the absolute levels of RNA expression and the levels of associated H3K4me3 (P = 2.25 × 10-8) (see Additional file 6). Therefore, in comparison to other more stringent forms of silencing (for example DNA methylation), PcG silencing may be incomplete and allow stochastic gene expression.
Finally, we examined the relationship between the ratio of H3K4me3/H3K27me3 at specific bivalent genes and the ability to detect single-cell transcripts expressed from these genes. It has been reported that, as for α globin, expression of other bivalent genes can be sporadically detected in single ES cells . The bivalent genes studied by Gibson et al.  were crossreferenced with ChIP data from Ku et al.  and subdivided into three classes according to the proportion of cells in which the transcript was detected. Although the number of genes analysed by single-cell RT-PCR was small, we found that there was a clear tendency for the genes whose transcripts were never detected, to have the lowest H3K4me3/H3K27me3 ratios (Figure 4c).
The central question raised here is whether or not bivalent domains, marked both by repressive (H3K27me3) and active (H3K4me3) histone modifications, represent a preprogrammed epigenetic signature of silent but poised chromatin or more simply reflect different degrees of silencing (H3K27me3) and transcription (H3K4me3) within a population of cells. The observation that most clearly supports the prevailing concept of a preprogrammed epigenetic mark in pluripotent cells is the presence of bivalent domains in which, despite the presence of both activating (H3K4me3) and repressive (H3K27me3) marks, no PolII elongation and full length transcription are detected, leading to the conclusion that all of such genes are poised [8, 9]. However through evaluation of public datasets it is readily notable that the levels and patterns of histone modifications at bivalent genes are very variable. Similarly, a wide range of H3K4me3 levels within bivalent domains of embryonic and haematopoietic stem and progenitor cells has been recently identified [13, 19].
Here, the detailed analysis of a single well characterised bivalent gene (α globin) expressed during haematopoiesis and the global analysis of the relationship between H3K4me3/H3K27me3 ratio and gene expression suggest another explanation for bivalent chromatin domains that might be considered. It is possible that some genes with bivalent signatures may be poised in pluripotent cells by a common but as yet undefined molecular mechanism. If so, then it is not clear why the associated chromatin modifications are so variable. An alternative explanation, proposed here, is that bivalently marked genes are regulated by PcG and marked by H3K27me3; the different levels of associated H3K4me3 may simply be a sensitive marker of different levels of transcriptional activity. The latter scenario is consistent with previous observations showing that many lineage-specific genes can be stochastically expressed at low but variable levels in multipotent cells (so called multilineage priming) .
Although both situations could be considered to be poised the underlying mechanisms and their implications are different. In fact, the first model (repressed but poised for later activation) implies that most of the cells harbour coexisting repressive and activating marks and that both PcG and trxG have to be maintained at the given locus through cell divisions to keep the repressed-but-poised status until further changes in the balance of their activity. By contrast, the second (repressed and marked by variable degrees of stochastic transcription) implies that PcG is not fully efficient at repressing stochastic transcriptional noise. This would provide fertile ground for subsequent activation and changes in cell fate decisions as a tissue-specific transcription factor programme emerges . Definitive experiments to discriminate between these two possibilities will require the development of assays to correlate chromatin modification and gene expression within single cells.
The part of the study involving human participants was approved by the Institutional Ethics Committee (approval number 06/Q1606/110). A written informed consent was obtained from all patients.
Primary cells and cell culture
EBV-Ly cell lines were cultured in RPMI 1640 supplemented with 10% (v/v) fetal calf serum, 2 mM l-glutamine, 50 U/ml penicillin and 50 μg/ml streptomycin. Isolation and culture of primary human erythroblasts was carried out as described previously . The human embryonic stem cell line, H1 (WiCell, Madison, WI, USA), was grown on irradiated mouse embryonic fibroblasts in medium containing Dulbecco's modified Eagle medium (DMEM):F12, serum replacer (Invitrogen, Paisley, UK), l-glutamine, β-mercaptoethanol, non-essential amino acids and basic fibroblast growth factor (8 ng/ml). Cells were passaged every 5-7 days with collagenase IV to maintain undifferentiated human embryonic stem cells.
Flow cytometric analysis and sorting
Normal human bone marrow samples were collected from individuals undergoing total hip replacement for osteoarthritis. CD34+ cells were enriched using MACS (Miltenyi Biotech, Bergisch Gladbach, Germany) immunomagnetic beads and cryopreserved in 90% fetal bovine serum (FBS)/10% dimethylsulfoxide (DMSO). All experiments were carried out using cryopreserved CD34+ cells that were thawed, washed in Iscove's modified Dulbecco's medium (IMDM)/10% FBS and cultured overnight in StemSpanSFEM (StemCell Technologies, Grenoble, France) in the presence of recombinant human stem cell factor (100 ng/ml), Flt3-ligand (100 ng/ml) and thrombopoietin (100 ng/ml) (Peprotech, Rock Hill, NJ, USA).
CD34+ cells were first stained with purified anti-CD2, RPA-2.10; CD3, HIT3a; CD4, RPA-T4; CD7,124-1D1; CD8, RPA-T8; CD10, CB-CALLA; CD11b, ICRF44; CD14, 61D3; CD19, HIB19; CD20, 2H7; CD56, MEM188; GPA, GA-R2 (eBioscience, San Diego, CA, USA). Subsequently cells were stained with Pacific Blue conjugated goat F(ab')2 anti-mouse IgG conjugates (H+L) (Invitrogen). Finally, cells were stained with fluorescein isothiocyanate (FITC)-conjugated anti-CD38 (HIT2), phycoerythrin (PE)-conjugated anti-CD45RA (HI100), PE-Cy7-conjugated anti-CD123 (6H6) (eBioscience), PE-Cy5-conjugated anti-CD34 (581) (Beckman Coulter, High Wycombe, UK) and allophycocyanin (APC)-conjugated anti-CD110 (BAH-1) (Becton Dickinson, Franklin Lakes, NJ, USA). HSCs were isolated as Lin- CD34+ CD38-, CMPs as Lin- CD34+ CD38+ CD123low/+ CD45RA- CD110-, GMPs as Lin- CD34+ CD38+ CD123+ CD45RA+ CD110- and MEPs as Lin- CD34+ CD38+ CD123-/low CD45RA- CD110-. Dead cells were excluded by Hoechst 33258 (Invitrogen) staining or by 7-aminoactinomycin D (Sigma-Aldrich, Gillingham, UK). Appropriate unstained, single stained and Fluorescence Minus One controls were used to determine the background staining level and compensation in each channel. All sorting and analyses were performed on three laser-equipped MoFlo (Dako Cytomation, Ely, UK) or BD FACS AriaII (Becton Dickinson) machines. Single ES cells were sorted with an automated cell deposition unit into 96-well plates. FACS data were analysed with Summit software (Dako Cytomation).
ChIP analyses were performed according to the Millipore ChIP protocol (Millipore 17-295, Billerica, MA, USA), as described previously . Input and immunoprecipitated material were analysed by real time PCR using a series of PCR amplicons and 5'FAM-3'TAMRA probes across the α globin locus . ChIP-chip experiments were carried out on the custom α globin tiling path microarray, as described previously . The enrichment of ChIP DNA over input DNA was calculated as ratio of the background corrected ChIP signal divided by the background corrected input signal (both globally normalised). The antibodies used in the experiments were: anti-mono/di/trimethyl Lys4 histone H3 (07-436, 07-030, and 07-473, respectively), anti-trimethyl Lys27 histone H3 (07-449) and anti-trimethyl Lys9 histone H3 (07-442 lot 24416) from Millipore; anti-trimethyl Lys36 histone H3 (ab9050) from Abcam (Cambridge, UK); anti-RNA-PolII (H224) from Santa Cruz Biotechnology (Santa Cruz, CA, USA), anti-EZH2 (36-6300) from Zymed (San Francisco, CA, USA).
For ChIP in the artificially mixed population of cells, 10% of primary erythroblasts were mixed with 90% of EBV-ly cells. At any one time, about 80% of erythroid cells are positive for nascent α globin RNA transcripts  and 100% of these cells accumulate high levels of globin RNA during the late stages of erythropoiesis. Therefore these cells are presumed to be 100% modified by H3K4me3. By contrast, in lymphocytes, the α globin genes are repressed and the H3K4me3 signal at the α globin promoter is at background level.
Sequential ChIP analyses were performed as follows: 5 μg of H3K4me3 or H3K27me3 antibody were immobilised to 50 μl of protein A agarose beads (Millipore) by crosslinking with 2.5 mM BS3 (Thermo Fisher Scientific, Waltham, MA, USA). Chromatin precipitated using the first antibody was eluted in 500 μl of 0.1 M NaHCO3 and 1% SDS. This solution was then diluted tenfold in ChIP Dilution buffer (Millipore) and subjected to immunoprecipitation using the second antibody. The second antibody was not crosslinked to protein A.
Gene expression analysis
Total RNA was extracted using TriReagent (Sigma). Contaminating DNA was removed from RNA preps with the DNA-free kit (Ambion/Applied Biosystems, Austin, TX, USA) according to the manufacturer's instructions. cDNA was generated using 1-5 μg of total RNA and random primers using the Prostar RT-PCR kit (Stratagene/Agilent Technologies, Santa Clara, CA, USA). Negative control cDNA samples generated without reverse transcriptase were analysed in all experiments. Real time qPCR experiments were carried out on ABI Prism 7000 Sequence Detection System (Applied Biosystems) using a set of primers/probe detecting HBA2 cDNAs (see Additional file 7). The results were normalised to a control sequence in the 18S ribosomal RNA (RNRI) gene (Eurogentec, Southampton, UK). To detect short abortive transcripts at the 5' region of HBA cDNA, two different sets of primers were used in a SybrGreen qPCR reaction (see Additional file 7).
Multiplex single-cell RT-PCR analysis was performed as previously described [33, 37]. Single cells were deposited into 96-well PCR plates using a single cell depositor unit coupled to a fluorescence-activated cell sorting (FACS) ARIAII cell sorter (providing single cells in >99% of the wells, and no wells with more than 1 cell as assessed by routinely sorting fluorescent beads or cells prior to and after single cell sorting). Each well contained 4 μl of lysis buffer (0.4% NP40, 2.3 mM dithiothreitol (DTT), 0.07 mM dNTP, 0.5 U/μl Rnase Inhibitor). Cell lysates were reverse transcribed in a 10 μl reaction with Superscript III Reverse Transcriptase (Invitrogen) and gene-specific primers (R1s in Additional file 7). A first round of PCR (40 cycles) was performed by the addition of a PCR mix containing PCR buffer, 1.25 U of Taq polymerase (Invitrogen) and gene-specific forward primers (F1s in Additional file 7). A total of 1 μl of 1:10 diluted first-round PCR products were amplified in a second-round PCR, which was carried out using fully nested gene-specific primers (F2-R2 in Additional file 7). PCR products were gel electrophoresed and visualised by ethidium bromide staining. Only control-positive (OCT4 in ES cells, HPRT in EBV-Ly) wells were considered as informative and scored. A total of 270 individual human ES cells and 110 EBV-Ly were analysed.
In order to correlate genes expression and chromatin state in human ES cells, expression data (accession number GSE8439 ) and single cell transcript detection data  were crossreferenced with ChIP-Seq data (accession number GSE13084 ). The promoter chromatin state was calculated as relative ratio of the signal derived from the number of H3K4me3 and H3K27me3 sequence reads across a window between -3 kb and +3 kb of the annotated TSS. The relationship between H3K4me3/H3K27me3 ratio and expression was calculated by averaging of the H3K4me3/H3K27me3 ratio within a sliding window 100 observations wide, incrementing by 1, using a Spearman rank correlation. Considering the different source of the two ES cell datasets (H1, male cell line, for expression data and H9, female cell line, for ChIP-Seq data), X-chromosome linked genes were excluded from the analysis.
Azuara V, Perry P, Sauer S, Spivakov M, Jorgensen HF, John RM, Gouti M, Casanova M, Warnes G, Merkenschlager M, Fisher AG: Chromatin signatures of pluripotent cell lines. Nat Cell Biol. 2006, 8: 532-538. 10.1038/ncb1403.
Bernstein BE, Mikkelsen TS, Xie X, Kamal M, Huebert DJ, Cuff J, Fry B, Meissner A, Wernig M, Plath K, Jaenisch R, Wagschal A, Feil R, Schreiber SL, Lander ES: A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell. 2006, 125: 315-326. 10.1016/j.cell.2006.02.041.
Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim TK, Koche RP, Lee W, Mendenhall E, O'Donovan A, Presser A, Russ C, Xie X, Meissner A, Wernig M, Jaenisch R, Nusbaum C, Lander ES, Bernstein BE: Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007, 448: 553-560. 10.1038/nature06008.
Pan G, Tian S, Nie J, Yang C, Ruotti V, Wei H, Jonsdottir GA, Stewart R, Thomson JA: Whole-genome analysis of histone H3 lysine 4 and lysine 27 methylation in human embryonic stem cells. Cell Stem Cell. 2007, 1: 299-312. 10.1016/j.stem.2007.08.003.
Zhao XD, Han X, Chew JL, Liu J, Chiu KP, Choo A, Orlov YL, Sung WK, Shahab A, Kuznetsov VA, Bourque G, Oh S, Ruan Y, Ng HH, Wei CL: Whole-genome mapping of histone H3 Lys4 and 27 trimethylations reveals distinct genomic compartments in human embryonic stem cells. Cell Stem Cell. 2007, 1: 286-298. 10.1016/j.stem.2007.08.004.
Ku M, Koche RP, Rheinbay E, Mendenhall EM, Endoh M, Mikkelsen TS, Presser A, Nusbaum C, Xie X, Chi AS, Adli M, Kasif S, Ptaszek LM, Cowan CA, Lander ES, Koseki H, Bernstein BE: Genomewide analysis of PRC1 and PRC2 occupancy identifies two classes of bivalent domains. PLoS Genet. 2008, 4: e1000242-10.1371/journal.pgen.1000242.
Pietersen AM, van Lohuizen M: Stem cell regulation by polycomb repressors: postponing commitment. Curr Opin Cell Biol. 2008, 20: 201-207. 10.1016/j.ceb.2008.01.004.
Guenther MG, Levine SS, Boyer LA, Jaenisch R, Young RA: A chromatin landmark and transcription initiation at most promoters in human cells. Cell. 2007, 130: 77-88. 10.1016/j.cell.2007.05.042.
Brookes E, Pombo A: Modifications of RNA polymerase II are pivotal in regulating gene expression states. EMBO Rep. 2009, 10: 1213-1219. 10.1038/embor.2009.221.
Kanhere A, Viiri K, Araujo CC, Rasaiyaah J, Bouwman RD, Whyte WA, Pereira CF, Brookes E, Walker K, Bell GW, Pombo A, Fisher AG, Young RA, Jenner RG: Short RNAs are transcribed from repressed polycomb target genes and interact with polycomb repressive complex-2. Mol Cell. 2010, 38: 675-688. 10.1016/j.molcel.2010.03.019.
Boyer LA, Plath K, Zeitlinger J, Brambrink T, Medeiros LA, Lee TI, Levine SS, Wernig M, Tajonar A, Ray MK, Bell GW, Otte AP, Vidal M, Gifford DK, Young RA, Jaenisch R: Polycomb complexes repress developmental regulators in murine embryonic stem cells. Nature. 2006, 441: 349-353. 10.1038/nature04733.
Chamberlain SJ, Yee D, Magnuson T: Polycomb repressive complex 2 is dispensable for maintenance of embryonic stem cell pluripotency. Stem Cells. 2008, 26: 1496-1505. 10.1634/stemcells.2008-0102.
Jiang H, Skukla A, Wang X, Chen WY, Bernstein BE, Roeder RG: Role for Dpy-30 in ES cell-fate specification by regulation of H3K4 methylation within bivalent domains. Cell. 2011, 144: 513-525. 10.1016/j.cell.2011.01.020.
Ang YS, Tsai SY, Lee DF, Monk J, Su J, Ratnakumar K, Ding J, Ge Y, Darr H, Chang B, Wang J, Rendl M, Bernstein E, Schaniel C, Leminschka IR: Wdr5 mediates self-renewal and reprogramming via the embryonic stem cell core transcriptional network. Cell. 2011, 145: 183-197. 10.1016/j.cell.2011.03.003.
Efroni S, Duttagupta R, Cheng J, Dehghani H, Hoeppner DJ, Dash C, Bazett-Jones DP, Le Grice S, McKay RD, Buetow KH, Gingeras TR, Misteli T, Meshorer E: Global transcription in pluripotent embryonic stem cells. Cell Stem Cell. 2008, 2: 437-447. 10.1016/j.stem.2008.03.021.
Roh TY, Cuddapah S, Cui K, Zhao K: The genomic landscape of histone modifications in human T cells. Proc Natl Acad Sci USA. 2006, 103: 15782-15787. 10.1073/pnas.0607617103.
Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K: High-resolution profiling of histone methylations in the human genome. Cell. 2007, 129: 823-837. 10.1016/j.cell.2007.05.009.
Cui K, Zang C, Roh TY, Schones DE, Childs RW, Peng W, Zhao K: Chromatin signatures in multipotent human hematopoietic stem cells indicate the fate of bivalent genes during differentiation. Cell Stem Cell. 2009, 4: 80-93. 10.1016/j.stem.2008.11.011.
Adli M, Zhu J, Bernstein BE: Genome-wide chromatin maps derived from limited numbers of hematopoietic progenitors. Nat Methods. 7: 615-618.
Christophersen NS, Helin K: Epigenetic control of embryonic stem cell fate. J Exp Med. 207: 2287-95.
Hughes JR, Cheng JF, Ventress N, Prabhakar S, Clark K, Anguita E, De Gobbi M, de Jong P, Rubin E, Higgs DR: Annotation of cis-regulatory elements by identification, subclassification, and functional assessment of multispecies conserved sequences. Proc Natl Acad Sci USA. 2005, 102: 9830-9835. 10.1073/pnas.0503401102.
Vernimmen D, Marques-Kranc F, Sharpe JA, Sloane-Stanley JA, Wood WG, Wallace HA, Smith AJ, Higgs DR: Chromosome looping at the human alpha-globin locus is mediated via the major upstream regulatory element (HS-40). Blood. 2009, 114: 4253-4260. 10.1182/blood-2009-03-213439.
Bird AP, Taggart MH, Nicholls RD, Higgs DR: Non-methylated CpG-rich islands at the human alpha-globin locus: implications for evolution of the alpha-globin pseudogene. EMBO J. 1987, 6: 999-1004.
Anguita E, Hughes J, Heyworth C, Blobel GA, Wood WG, Higgs DR: Globin gene activation during haemopoiesis is driven by protein complexes nucleated by GATA-1 and GATA-2. EMBO J. 2004, 23: 2841-2852. 10.1038/sj.emboj.7600274.
Vernimmen D, De Gobbi M, Sloane-Stanley JA, Wood WG, Higgs DR: Long-range chromosomal interactions regulate the timing of the transition between poised and active gene expression. EMBO J. 2007, 26: 2041-2051. 10.1038/sj.emboj.7601654.
De Gobbi M, Anguita E, Hughes J, Sloane-Stanley JA, Sharpe JA, Koch CM, Dunham I, Gibbons RJ, Wood WG, Higgs DR: Tissue-specific histone modification and transcription factor binding in alpha globin gene expression. Blood. 2007, 110: 4503-4510. 10.1182/blood-2007-06-097964.
Garrick D, De Gobbi M, Samara V, Rugless M, Holland M, Ayyub H, Lower K, Sloane-Stanley J, Gray N, Koch C, Dunham I, Higgs DR: The role of the polycomb complex in silencing alpha-globin gene expression in nonerythroid cells. Blood. 2008, 112: 3889-3899. 10.1182/blood-2008-06-161901.
Goh SH, Terry Lee Y, Bhanu NV, Cam MC, Desper R, Martin BM, Moharram R, Gherman RB, Miller JL: A newly discovered human α-globin gene. Blood. 2005, 106: 1466-1472. 10.1182/blood-2005-03-0948.
Gibson JD, Jakuba CM, Boucher N, Holbrook KA, Carter MG, Nelson CE: Single-cell transcript analysis of human embryonic stem cells. Integr Biol (Camb). 2009, 1: 540-551. 10.1039/b908276j.
Bhanu NV, Trice TA, Lee YT, Miller JL: A signaling mechanism for growth-related expression of fetal hemoglobin. Blood. 2004, 103: 1929-1933. 10.1182/blood-2003-05-1624.
Schubeler D, MacAlpine DM, Scalzo D, Wirbelauer C, Kooperberg C, van Leeuwen F, Gottschling DE, O'Neill LP, Turner BM, Delrow J, Bell SP, Groudine M: The histone modification pattern of active genes revealed through genome-wide chromatin analysis of a higher eukaryote. Genes Dev. 2004, 18: 1263-1271. 10.1101/gad.1198204.
Weishaupt H, Sigvardsson M, Attema JL: Epigenetic chromatin states uniquely define the developmental plasticity of murine hematopoietic stem cells. Blood. 2010, 115: 247-256. 10.1182/blood-2009-07-235176.
Hu M, Krause D, Greaves M, Sharkis S, Dexter M, Heyworth C, Enver T: Multilineage gene expression precedes commitment in the hemopoietic system. Genes Dev. 1997, 11: 774-785. 10.1101/gad.11.6.774.
Rase JM, O'Shea EK: Control of stochasticity in eukatyotic gene expression. Science. 2004, 304: 1811-1814. 10.1126/science.1098641.
Pope SH, Fibach E, Sun J, Chin K, Rodgers GP: Two-phase liquid culture system models normal human adult erythropoiesis at the molecular level. Eur J Haematol. 2000, 64: 292-303. 10.1034/j.1600-0609.2000.90032.x.
Brown JM, Leach J, Reittie JE, Atzberger A, Lee-Prudhoe J, Wood WG, Higgs DR, Iborra FJ, Buckle VJ: Coregulated human globin genes are frequently in spatial proximity when active. J Cell Biol. 2006, 172: 177-187. 10.1083/jcb.200507073.
Mansson R, Hultquist A, Luc S, Yang L, Anderson K, Kharazi S, Al-Hashmi S, Liuba K, Thoren L, Adolfsson J, Buza-Vidas N, Qian H, Soneji S, Enver T, Sigvardsson M, Jacobsen SE: Molecular evidence for hierarchical transcriptional lineage priming in fetal and adult stem cells and multipotent progenitors. Immunity. 2007, 26: 407-419. 10.1016/j.immuni.2007.02.013.
Vakoc CR, Sachdeva MM, Wang H, Blobel GA: Profile of histone lysine methylation across transcribed mammalian chromatin. Mol Cel Biol. 2006, 26: 9185-9195. 10.1128/MCB.01529-06.
We thank the CBRG for bioinformatic support, I Dunham, C Langford and the Microarray Facility of the Wellcome Trust Sanger Institute for assistance with microarrays, K Clark for flow sorting, and T Milne for critical reading of the manuscript. This work was supported by the National Institute for Health Research Biomedical Research Centre Programme.
The authors declare that they have no competing interests.
MDG participated in the design of the study, carried out the molecular studies and drafted the manuscript; DG, ML, DV, NG, SL, KML, JAS-S, CP and RR carried out the molecular studies; DG, ML and RJG helped to draft the manuscript; JRH and ST participated in the bioinformatic and statistical analyses; SS performed the statistical analysis; SEWJ, PV, RJG and TE participated in the design of the study and its coordination; DHR conceived the study and participated in its design and coordination and drafted the manuscript. All authors read and approved the final manuscript.