Skip to main content

The cohesin-associated protein Wapal is required for proper Polycomb-mediated gene silencing



The cohesin complex consists of multiple core subunits that play critical roles in mitosis and transcriptional regulation. The cohesin-associated protein Wapal plays a central role in off-loading cohesin to facilitate sister chromatid separation, but its role in regulating mammalian gene expression is not understood. We used embryonic stem cells as a model, given that the well-defined transcriptional regulatory circuits were established through master transcription factors and epigenetic pathways that regulate their ability to maintain a pluripotent state.


RNAi-mediated depletion of Wapal causes a loss of pluripotency, phenocopying loss of core cohesin subunits. Using chromatin immunoprecipitation coupled with next-generation sequencing (ChIP-seq), we determine that Wapal occupies genomic sites distal to genes in combination with CTCF and core cohesin subunits such as Rad21. Interestingly, genomic sites occupied by Wapal appear enriched for cohesin, implying that Wapal does not off-load cohesin at regions it occupies. Wapal depletion induces derepression of Polycomb group (PcG) target genes without altering total levels of Polycomb-mediated histone modifications, implying that PcG enzymatic activity is preserved. By integrating ChIP-seq and gene expression changes data, we identify that Wapal binding is enriched at the promoters of PcG-silenced genes and is required for proper Polycomb repressive complex 2 (PRC2) recruitment. Lastly, we demonstrate that Wapal is required for the interaction of a distal cis-regulatory element (CRE) with the c-Fos promoter.


Collectively, this work indicates that Wapal plays a critical role in silencing of PcG target genes through the interaction of distal CREs with promoters.


Gene expression is regulated by the complex interplay of cis-acting DNA elements and trans acting molecules such as transcription factors (TFs). The cohesin complex plays a critical role in connecting distal cis-acting DNA elements to gene promoters by facilitating DNA loops [13]. In addition, the cohesin complex mediates sister chromatid cohesion during mitosis, ensuring proper genomic segregation [4]. In mitosis, the core cohesin subunits Smc1a, Smc3, and Rad21 are loaded onto sister chromatids by Nipbl in early G1/late telophase and off-loaded starting in prophase by Wapal [5, 6]. While the function of core cohesin subunits and accessory proteins (Nipbl, Wapal) are well established during mitosis, Wapal’s role during transcriptional regulation remains less well defined. The core cohesin subunits facilitate interactions between genes and distal cis-regulatory elements by “looping-out” the intervening chromatin segment [24, 7]. In embryonic stem cells (ESCs), the majority of cohesin-binding sites are co-occupied by CTCF, but a minority of cohesin-binding sites represent transcriptional enhancers [2, 810]. Depletion of cohesin core subunits by RNA interferences (RNAi) in ESCs causes differentiation secondary to either transcriptional or cell-cycle changes [2, 8, 11]. Whether Wapal participates in gene regulation through facilitating DNA loops or even binds to specific genomic regions remains unknown. However, recent studies indicate Wapal plays a role during interphase through controlling the dynamic association of cohesin with chromatin [12, 13]. However, because the precise genomic sites occupied by Wapal are unknown, it is difficult to know its precise role in regulating cohesin’s association with specific chromatin regions.

Given the critical role of cohesin in mitosis, mutations within core subunits would be expected to cause significant mitotic defect(s). A subset of patients with Cornelia de Lange Syndrome (CdLS), characterized by microcephaly, cognitive impairment, abnormal facies, and other malformations has mutations within core subunits of the cohesin complex including Smc1a, Smc3, and Rad21 (reviewed in [14, 15]. However, these patients have sporadic heterozygous mutations, implying that complete loss of cohesin activity through null mutations is incompatible with life. Given that these mutations behave in an autosomal dominant fashion with unaffected parents, it implies that the majority of CdLS mutations occur within the parental germ cells. Surprisingly, CdLS patient samples exhibit a normal cell cycle, implying that cohesin haploinsufficiency does not cause CdLS through alterations in mitosis. Recent work has also demonstrated that heterozygous mutations in cohesin are a common (5–20 %) occurrence in patients with acute myeloid leukemia (AML) and related disorders [16, 17]. Given that AML samples rarely exhibit significant changes in chromosomal number, it again highlights that cohesin mutations likely cause disease by alterations in gene expression.

Compared to the core cohesin subunits, far less is known about the role of Wapal in transcriptional regulation. In mammals, Wapal plays a role in off-loading cohesin to prevent chromatin condensation [13], implying that Wapal likely antagonizes core cohesin subunits during transcriptional regulation. However, because the specific genomic sites occupied by Wapal are unknown, its precise role in mammalian transcriptional regulation remains unclear. In Drosophila, Wapal promotes Polycomb group silencing, although the mechanism is unclear and whether it applies to mammals is unknown [12].

How Polycomb complexes are targeted to specific genomic regions remains a critical question within epigenetics given their important role in cellular differentiation [18]. In Drosophila, Polycomb targeting is mediated by specific distal cis-regulatory elements (CREs) termed Polycomb response elements (PREs) [18, 19]. In contrast, mammalian PREs are rare [19], and other proposed mechanisms for PcG targeting include noncoding RNAs [20], nonspecific silencing of all transcription which must be subsequently relieved by transcriptional activators [21], interactions with other sequence-specific DNA-binding proteins such as CTCF [22, 23], or the presence of CpG islands within the promoter. Collectively, the diversity of proposed targeting mechanisms illustrates the lack of consensus within the field.

To better delineate the role of Wapal in mammalian transcriptional regulation, we have chosen murine ESCs as a model system. ESCs are unique because they possess two critical properties: pluripotency, or the ability to differentiate into all three primitive germ layers (mesoderm, ectoderm, and endoderm) that give rise to the embryo, and self-renewal, or the ability to propagate indefinitely in an undifferentiated state. The transcriptional and epigenetic pathways that regulate both functions have been well defined through a series of genome-wide approaches. Our study demonstrates that Wapal plays a central role in regulating transcription by assisting in Polycomb group-mediated gene silencing.


Wapal depletion causes ESCs to differentiate

Depletion of cohesin complex core subunits (Smc3, Smc1a, Scc1/Rad21) causes a loss of pluripotency when depleted in ESCs [2, 8, 11]. To examine whether loss of Wapal had a similar effect, we depleted Wapal in ESCs using RNAi. We infected ESCs with puromycin-resistance-encoding lentiviruses containing short-hairpin RNAs (shRNAs) to deplete Wapal. As a positive control, we depleted the pluripotency TF Sall4 with a single shRNA; a second shRNA to Sall4 gives similar results [24, 25]. Depletion of both Wapal and Sall4 induced a loss of compact, spherical colonies indicative of differentiation 6 days after infection (Fig. 1a, top panels). Surface expression of the pluripotency marker alkaline phosphatase was partially reduced and highlights the significant morphological changes that occurred following Wapal depletion (Fig. 1a, bottom panels). To quantitate the changes in pluripotency, we used Cas9-mediated genomic editing to create an ESC line with an internal ribosomal entry site:enhanced green fluorescent protein reporter (IRES:EGFP) cassette inserted into the Oct4 locus, allowing EGFP expression to be a surrogate marker for pluripotency. The same approach has been used to generate a flow cytometry-based assay to quantitate changes in pluripotency [26, 27]. After 6 days of puromycin selection, we observed a statistically significant reduction in the mean fluorescent intensity (MFI) of the GFP peak after depleting with shRNAs to Sall4 or Wapal (shRNA #2, Additional file 1: Figure S1a). Wapal shRNA #1 caused a reduction in the MFI but did not reach statistical significance (p value <0.08). Taken together, the changes in cell morphology and alkaline phosphatase activity and decrease in Oct4:IRES:EGFP expression indicate that Wapal depletion induces a loss of pluripotency and subsequent differentiation of ESCs. Similar results have been observed by other groups when depleting Smc3, Smc1a [2], or Rad21 [8], indicating that depletion of Wapal phenocopies the loss of core cohesin subunits on ESC pluripotency.

Fig. 1
figure 1

a ESCs were infected with lentiviruses encoding shRNAs to Sall4 and Wapal. Cells with a high multiplicity of infection (MOI) were selected by addition of puromycin for 6 days. Images were taken by bright field (BF, ×10) or after staining for the pluripotency marker alkaline phosphatase (AP, ×10). b Changes in pluripotency markers (Nanog, Oct4, Sall4, and Rex1) and Wapal were measured by RT-qPCR. Fold change was calculated relatively to cells infected with the empty vector and plotted linearly on the y-axis. Error bars represent SEM of at least two experiments. Asterisk indicates statistically significant reductions compared to empty vector (p value <0.05). c Similar to b, but for differentiation markers Brachyury (mesoderm), Fgf5 (ectoderm), and Cdx2 (trophectoderm). a Log10 scale for fold change is used because of strong induction of these markers. Asterisk indicates statistically significant increases compared to empty vector (p value <0.05). d Protein levels after depletion of Wapal or Sall4 for core cohesin subunits Rad21 and Smc3. GAPDH is shown as a loading control. Intervening, unrelated lanes were removed, but all samples were run on the same gel. e Propidium iodide (PI) staining was used to measure DNA levels in cells by flow cytometry. The percent of cells within each phase of the cell cycle and which exhibited abnormal DNA content (<2N or >4N) is shown. The only statistically significant difference was in the fraction within G0/G1 after depletion of Sall4 or Wapal as compared to empty vector (asterisk indicates p value <0.05)

We further examined the effects of each shRNA on known pluripotency markers (Fig. 1b). Both shRNAs induced a strong knockdown of Wapal by greater than 80 %. Wapal knockdown was associated with a significant reduction in the expression of pluripotency markers such as Oct4, Rex1, and Nanog. To examine whether differentiation markers are increased in expression after Wapal depletion, we screened a panel of lineage-specific markers including Cdx2 (trophectoderm), Brachyury (mesoderm), and Fgf5 (ectoderm). As shown in Fig. 1c, depletion of Wapal is associated with a dramatic increase in each marker, indicating differentiation into these lineages. Collectively, these data indicate that Wapal is required for ESC pluripotency. To determine whether Wapal depletion induces differentiation through indirect effects on other members of the cohesin completion, we performed Western blots to assess the levels of two core subunits of the cohesin complex, Rad21 and Smc3 (Fig. 1d). Wapal depletion had minimal effect on Rad21 or Smc3 protein levels, indicating that the loss of pluripotency is unlikely to be secondary to altered expression of other core cohesin subunits.

Given the role of Wapal in off-loading cohesin from sister chromatids at the end of mitosis, we assessed the cell cycle to determine whether Wapal depletion induced aneuploidy, which could cause a loss of pluripotency. We measured DNA content by propidium iodide (PI) staining in ESCs 48 h after Wapal depletion and assessed the percent of cells in different stages of the cell cycle or that exhibited aneuploidy (Fig. 1e). We chose an earlier time point than our differentiation experiments because subtle changes in the cell cycle early in the process could be responsible for differentiation. Depletion of neither the pluripotency TF Sall4 nor Wapal increased the number of aneuploid cells. We saw a slight increase in the number of cells in G0/G1 when we depleted either Sall4 or Wapal, but little change in the percentage of cells in M or S phase. Collectively, these data indicate that Wapal depletion does not dramatically alter the cell cycle or promote aneuploidy in ESCs. Thus, the loss of pluripotency is unlikely to be caused by a mitotic defect or aneuploidy.

To ensure that the pluripotency changes were not secondary to off-target effects of our shRNAs, we generated an ESC line expressing an epitope (v5)-tagged version of full-length Wapal immune to shRNA #2 by mutating every third base in each codon within the shRNA recognition site. This strategy prevents recognition by the shRNA without altering the linear amino acid sequence. Infection with shRNA #2 on wild-type cells caused a loss of pluripotency as assessed by cell morphology that was completely rescued in cells expressing the full-length Wapal immune to the shRNA (FL Wapal Imm, Additional file 1: Figure S1b). In addition to rescuing the morphology, expression of FL Wapal Imm rescued the changes in pluripotency markers such as Nanog and Oct4 (Additional file 1: Figure S1c) and suppressed the induction of differentiation markers such as Brachyury, Bmp2, Cdx2, and Fgf5 (Additional file 1: Figure S1d). We assessed protein levels of a subset of markers (Additional file 1: Figure S1e), which were decreased (Nanog) or increased (Brachyury) after Wapal depletion. Expression of FL Wapal Imm blocked the protein changes in Nanog and Brachyury observed after Wapal depletion. Collectively, the phenotypic rescue by FL Wapal Imm indicated that the loss of pluripotency we observed is secondary to Wapal depletion and not an off-target effect of RNAi.

Wapal preferentially occupies genomic regions with other cohesin proteins

To identify genomic regions occupied by Wapal, we screened for antibodies which met ENCODE criteria [28] for chromatin immunoprecipitation coupled with next-generation sequencing (ChIP-seq). All antibodies tested were unsuitable for ChIP-seq; therefore, we turned to a metabolic labeling approach to tag Wapal with biotin [24, 29]. As a first step, we determined the level of expression of the tagged Wapal constructs stably expressed in ESCs. By Western blot (Additional file 2: Figure S2a), the total levels of Wapal protein were unchanged between two clones expressing tagged Wapal (FB Wapal) versus a control cell line expressing the biotin ligase BirA in the absence of tagged Wapal. As a positive control, we included a previously published cell line expressing a tagged Nanog (FB Nanog, [29]). Thus, FB Wapal is expressed at subendogenous levels, consistent with our previous published data, and meets ENCODE criteria for ChIP-seq [24, 28, 29]. We performed ChIP-seq on two independent FB Wapal clones and subtracted genomic regions also present in our BirA-only control for downstream analyses (Additional file 2: Figure S2b, see “Methods” section for details). Five Wapal-binding sites were verified by ChIP-qPCR at genomic sites co-occupied by Wapal and other subunits of the cohesin complex (Additional file 2: Figure S2c and below).

As a first step, we analyzed all Wapal-binding sites to determine their distribution within the genome. The majority (≈60 %) were in extragenic regions, and only a small percentage (5 %) were within gene promoters (Fig. 2a). Wapal is known to directly interact with the core cohesin subunit Rad21 during mitosis to facilitate off-loading cohesin [5, 30]. We have also demonstrated that Wapal interacts with the pluripotency-associated transcription factor Nanog [29]. CTCF, which has pleiotropic functions in regulating transcription, directly interacts with core cohesin subunits (Smc3, Rad21, and Stag1/2) to stabilize DNA loops required for long-range chromatin interactions, although a direct CTCF–Wapal interaction has not been reported [3, 31, 32]. We overlapped our Wapal-binding sites with those of Rad21, Nanog, and CTCF to determine whether they co-occupy the same genomic regions (Fig. 2b). We were surprised that Wapal and Nanog showed minimal overlap (4 %) with each other, implying that they bind to distinct genomic regions. In contrast, Wapal showed a high degree of overlap with CTCF (97 %). Genomic sites occupied by Wapal showed a strong overlap with core cohesin subunits (75 % overlap between Wapal and Rad21, Fig. 2b, c). Collectively, this indicates that Wapal co-occupies DNA elements with core cohesin subunits and CTCF. Importantly, given that CTCF recruits core cohesin subunits to DNA elements [3, 33], it is likely that Wapal is recruited to these sites in conjunction with other cohesin subunits rather than through a direct Wapal–CTCF interaction (Additional file 3: Figure S3).

Fig. 2
figure 2

a The genomic distribution of Wapal-binding sites. b Overlap of Wapal-binding sites with CTCF, Nanog, or core subunits of the cohesin complex Smc3 and Rad21. Percentages listed indicate the area(s) of overlap of Wapal-binding sites with a specific factor. c ChIP-seq tracks for Adm to demonstrate the binding of Wapal, CTCF, and cohesin components. The y-axis represents the # of ChIP-seq tags recovered for a given genomic bin. The range for each track is displayed to the right of track name. A lower threshold of 2 was set for all tracks to minimize background. In all cases, the y-axis for BirA and Wapal are the same to demonstrate binding specificity

Wapal does not affect cohesin binding at co-occupied genomic sites

Recently, it has been demonstrated that Wapal antagonizes cohesin binding to chromatin during interphase, similar to its role during mitosis [5, 12, 13]. However, because Wapal’s genomic occupancy was unknown, previous studies were performed with bulk analysis of cohesin binding to chromatin and not at specific genomic sites [12, 13]. We examined all loci occupied by Wapal in combination with core cohesin subunits (Rad21, Smc1a, and Smc3) to determine whether Wapal-occupied genomic sites display lower levels of cohesin binding. Genomic sites occupied by Wapal consistently displayed higher ChIP-tag densities for Rad21, Smc3, or Smc1a (Fig. 3a, b; Additional file 4: Figure S4) as compared to sites where these factors bound without Wapal. To confirm this result directly, we measured the chromatin occupancy of Rad21 after Wapal depletion by ChIP-qPCR at a handful of loci co-occupied by both Wapal and Rad21 (Fig. 3c). If Wapal promotes cohesin of off-loading, Wapal depletion should increase Rad21 occupancy as measured by ChIP-qPCR. We did not observe a statistically significant increase in Rad21 binding at four loci as compared to the empty vector. Collectively, these data would indicate that Wapal depletion did not promote off-loading cohesin at genomic they co-occupy.

Fig. 3
figure 3

a The ChIP-seq tag densities of Rad21 were compared at genomic sites occupied with (red) or without (green) Wapal in a 2.5 kb window around peak center. b ChIP-seq tag densities for each Rad21 binding site within the genome is visualized as individual rows, with the color scale indicating a relative linear gradient from 0 to maximal (100 %) tag values. Two different plots were generated based upon the binding with (left) or without (right) Wapal. Row order is from lowest binding to highest binding. c Wapal was depleted with two, independent shRNAs for 48 h and genomic occupancy of Rad21 measured by ChIP-qPCR. The genomic sites (x-axis) are all co-occupied by both Wapal and Rad21. All sites demonstrated statistically significant enrichment in the empty vector sample compared with input (p value <0.05). Plus symbol indicates a statistically significant reduction from empty vector (p value <0.05)

Wapal depletion is associated with the derepression of Polycomb targets

To assess transcriptome changes after Wapal depletion, we hybridized cDNA generated from ESCs 48 h post-lentiviral infection to microarrays. We chose a short time frame to minimize differentiation effects and enrich for target genes. Wapal-depleted cells showed no changes in morphology as compared to empty vector at this early time point (Additional file 5: Figure S5a). To assess whether pluripotency and differentiation markers are altered at this early time point, we performed reverse transcription coupled with quantitative PCR (RT-qPCR) for various markers that exhibited changes after longer periods (6 days) of Wapal depletion. We observed minimal changes in Nanog, Rad21, or Smc3 (Additional file 5: Figure S5b). Differentiation markers Brachyury, Bmp2, and Cdx2 did not increase. Thus, pluripotency is not compromised, and the cells do not differentiate significantly 48 h after Wapal depletion.

As further proof that the gene expression changes we observed were unique to Wapal depletion and not secondary to differentiation effects, we downloaded and reanalyzed microarray data from ESCs induced to differentiate by depleting two pluripotency-associated transcription factors, Nanog or Oct4 [34]. We compared the 100 genes either upregulated or downregulated by Wapal depletion and mapped changes at the same genes after Nanog or Oct4 depletion, which are known to induce ESCs to differentiate. We observed very little concordance between the gene expression changes seen with Wapal and Nanog/Oct4 depletion, implying that at least initially the gene expression changes are distinct, even though the end result (ESC differentiation) is the same (Fig. 4a). We also wondered whether Wapal depletion would affect the transcriptome differently than loss of a core subunit of the cohesin complex. We depleted Smc3 using two independent shRNAs and compared the gene expression changes with those after Wapal depletion. There was a striking similarity in the gene expression patterns when comparing Wapal- and Smc3-depleted cells (Fig. 4a), implying that the transcriptome changes were very similar. Collectively, our data suggest that Wapal depletion induces transcriptome changes similar to Smc3 loss, but distinct from depletion of Nanog or Oct4. Thus, although depletion of Wapal/Smc3 and Nanog/Oct4 all causes ESCs to differentiate, they are likely through distinct mechanisms.

Fig. 4
figure 4

a The 100 probes most upregulated (red) or downregulated (green) with shRNA #1 to Wapal were identified and the Log2 fold change (Log2 FC) calculated for Wapal shRNA #2, Smc3 shRNAs, and from published studies in which the pluripotency-associated TFs Nanog or Oct4 were depleted by siRNAs. Log2 FC is displayed, with the color scheme indicated. b GSEA was used (see “Methods” section for details) to identify pathways globally dysregulated by Wapal depletion. Six gene sets from pluripotent cells which all exhibited substantial enrichment in Wapal-depleted samples are shown. Positive normalized enrichment score (NES) indicates that the gene set is enriched in Wapal-depleted samples as compared to samples infected with the empty vector. False discovery rate (FDR < 25 %) and p value <0.01 were used to determine statistically significant changes in gene set expression. c ChIP-seq datasets were used to generate a list of high-confidence targets of Polycomb-mediated gene silencing by the presence of both H3K27me3 and H2Aub1 in their promoters (1455 genes). For each gene, the expression changes after Wapal or Smc3 depletion were measured by microarray. For genes with multiple probes, maximal probe deviation was used. Genes that showed minimal change in their expression after Wapal depletion or the two shRNAs did not demonstrate similar changes were eliminated because of possible off-target effects. Log2 fold change is displayed, with a color scheme shown below

To identify pathways or groups of genes dysregulated by Wapal depletion, we utilized gene set enrichment analysis (GSEA) that measures whether a defined set of genes shows a statistically significant, concordant difference between two biological conditions [35]. We screened 3395 different curated gene sets generated by perturbing mammalian cell systems by chemical, RNAi, or genetic means to identify Wapal-induced gene expression changes. We were not surprised that a large number of gene sets (Additional file 6: Table S5) were either upregulated (181) or downregulated (218) after Wapal depletion in a statistically significant manner (p value <0.01 and false discovery rate-FDR <25 %). As a first step, we focused on gene sets derived from pluripotent cells as being more pertinent to our analyses. In particular, we noted that Polycomb-marked genes were derepressed after Wapal depletion in pluripotent cells [3638]. We observed a total of six different Polycomb target gene sets derepressed following Wapal depletion (Fig. 4b) with a high degree of statistical significance. These data are consistent with existing literature in Drosophila that Wapal promotes Polycomb-mediated gene silencing [12]. Two gene sets significantly downregulated after Wapal depletion is described as “core” modules of the pluripotent state, indicating that genes required to maintain ESCs in an undifferentiated state exhibit decreased expression (Additional file 5: Figure S5c). Thus, Wapal may play a role in silencing of Polycomb target genes in mammalian ESCs.

To validate whether Wapal depletion causes a derepression of PcG-silenced genes, we downloaded two different, independent gene sets where PcG target genes were identified [39, 40]. Both of these gene sets demonstrated a statistically significant upregulation after Wapal depletion (Additional file 5: Figure S5d), indicating that the gene sets demonstrated increased expression after Wapal depletion. As a second, independent confirmation, we downloaded and reanalyzed ChIP-seq data for the PRC1 mark of ubiquitination of histone 2A (H2AUb1) [39] and the PRC2 mark trimethylation of histone 3 on lysine 27 (H3K27me3) [41]. Genes marked by either PRC1 (H2Aub1) or PRC2 (H3K27me3) in their promoter showed a global increase after Wapal depletion, again consistent with a role for Wapal in the silencing of PcG genes (Additional file 5: Figure S5e). Given that GSEA analyzes all genes as a group, we performed a differential expression analysis to address whether PcG-silenced genes are globally derepressed or a subset of genes was skewing our analysis. To accomplish this, we generated a high-confidence list of PcG-marked genes by the presence of both H3K27me3 and H2Aub1 in their promoters. Next, we assessed transcriptional changes after Wapal depletion for each gene (Fig. 4c) and observed a 3:1 ratio of genes upregulated versus downregulated after Wapal depletion. A similar pattern was seen after Smc3 depletion (Fig. 4c). Collectively, our data indicate that Wapal depletion is associated with derepression of PcG-silenced genes, similar to the gene expression phenotype observed in Drosophila [12].

Wapal depletion selectively reduces Ring1b occupancy at PcG-marked genes

Recent studies in Drosophila [42] indicate that the cohesin complex plays a critical role in mediating gene expression by sequestering the PRC1 complex at active genes, thereby preventing PRC1 from inducing gene silencing. To address whether this is also occurring in mammalian cells, we performed ChIP-seq with an antibody to Ring1b, a core subunit of the PRC1 complex(es), which mediates histone ubiquitination, after Wapal depletion. We saw robust binding of Ring1b at PcG-silenced genes (1455 total), which decreased after Wapal depletion (Additional file 7: Figure S6a), consistent with Wapal playing a role in PcG targeting. To examine whether Wapal depletion induced a broad redistribution of Ring1b genomic occupancy, we examined a similar number (1455) of high- and low-expression genes from our microarray data. We saw virtually no binding of Ring1b to high-expression genes, which was essentially unchanged after Wapal depletion (Additional file 7: Figure S6b). We saw low-level binding of Ring1b to low-expression genes, which again remained unchanged after Wapal depletion. Collectively, our data imply that loss of Wapal induced the changes in Ring1b occupancy at PcG-marked genes alone and does not induce redistribution across high- and low-expressed genes.

We also hypothesized that if Ring1b genomic changes were secondary to differentiation of ESCs, rather than genomic occupancy induced by Wapal, we should see altered Ring1b occupancy at genes either upregulated or downregulated during differentiation. To address this, we identified the top 1000 genes upregulated or downregulated by depletion of Nanog or Oct4 and assessed changes in Ring1b occupancy (Additional file 7: Figure S6c). We found that both sets of genes showed minimal enrichment for Ring1b and minimal change in Ring1b occupancy after Wapal depletion. Given this finding, it is highly unlikely that the transcriptome and epigenomic changes we observe after Wapal depletion are secondary to ESC differentiation.

Wapal is required for maintenance of the PRC2 mark H3K27me3 at PcG-silenced genes

Given that PcG-mediated gene silencing occurs predominantly through histone modifications deposited within promoters, we were surprised that Wapal depletion caused PcG-marked gene derepression, given that only a small fraction (5 %) of Wapal-binding sites are within promoters. In fact, only 3 % of the high-confidence PcG-marked genes (1455) we identified had a Wapal-binding site within 2 kb of the transcriptional start site (TSS, Additional file 8: Table S1). In Drosophila, PcG genomic targeting is mediated by distal CREs termed Polycomb response elements (PREs), but in mammals PREs are rare [18, 19, 43]. Additionally in Drosophila, cohesin depletion decreases long-range interactions between a PRE and target gene(s) [42]. Given this, we hypothesized that Wapal may bind at distal CREs but through higher-ordered chromatin structures be brought into close physical proximity of gene promoters. In this case, ChIP-seq signals can often be detected at a gene’s promoter, although the binding is not substantial enough to give a positive enrichment through peak-calling algorithms. This type of binding, referred to as indirect binding, has been extensively studied by one group [44] and identifies long-range chromatin interactions critical for proper gene expression. As a first test, we examined the binding of Wapal to PcG-marked genes (Fig. 5) and observed that Wapal exhibited elevated binding at these gene promoters as compared with the BirA control. We also analyzed the subset of PcG-marked genes derepressed after Wapal depletion (Additional file 9: Figure S7a, b) and found a similar result, indicating that Wapal binding is enriched at this subset of promoters as well. A very similar pattern of binding was seen for Smc3 at PcG-marked genes (Additional file 9: Figure S7c, d), indicating that Wapal and core cohesin components behave similarly in terms of binding to PcG-marked genes in ESCs.

Fig. 5
figure 5

a A total of 1455 Polycomb-marked (H3K27me3+/H2Aub1+) genes were identified and the ChIP-seq tag densities measured for Wapal and the negative control cell line expressing BirA alone. A 5 kb window around the TSS of each gene is shown along the x-axis. b Similar to 3B, each row represents an individual Polycomb-marked gene, and the relative binding of Wapal and BirA is shown for each. Row order is determined by binding intensity for Wapal. Color gradient is a relative linear gradient from no binding (0 %) to maximal binding (100 %)

To perform a candidate gene analysis, we identified six PcG-marked genes derepressed after Wapal depletion for further analysis (Additional file 10: Figure S8a). Of these six genes, two (Snai2 and Ppp1r3c) had combined Wapal-/CTCF-binding sites outside the promoter but within 10 kb of the TSS. In both cases (Additional file 10: Figure S8b), a clear peak of Wapal binding near the TSS could be seen, but did not reach the threshold to be identified as a binding site during peak calling. Among the four other genes (HoxD1, Sox11, Twist1, and Lefty1), we were unable to identify a Wapal-binding site within 10 kb of the gene, but nonetheless noted evidence of Wapal enrichment around the TSS (Additional file 11: Figure S9). Collectively, this indicates that Wapal binds predominantly at distal CREs, but nonetheless displays binding at PcG-silenced genes.

To determine whether Wapal depletion caused global alterations in the activity of either PRC1 or PRC2 complexes, we measured protein levels by Western blotting with antibodies specific for PRC marks (H2Aub1 and H3K27me3) along with core subunits Ring1B (PRC1), Ezh2 (PRC2), and Suz12 (PRC2). In each case, we saw trivial changes in the protein expression of the histone marks and core PRC subunits (Additional file 12: Figure S10). This was surprising given our derepression data and implies that global PcG activity is unaffected by loss of Wapal. To further study this issue, we performed ChIP-qPCR at six derepressed genes (HoxD1, Sox11, Twist1, Snai2, Ppp1r3c, and Lefty1) 48 h after depleting Wapal with two shRNAs (Fig. 6). Primers to the Nanog promoter or two intergenic regions (Ctrl1, 2) were included as a negative control; we observed no enrichment at these loci using a nonspecific IgG (data not shown). Given the results of our GSEA, we focused on the PRC2 mark H3K27me3. In addition, per the classical models of PcG action in mammals PRC1 activity is preceded by PRC2-mediated deposition of H3K27me3 [18]. Strikingly, we found a substantial loss in the PRC2 mark H3K27me3 (Fig. 6a). Given these findings, we examined the genomic occupancy of Suz12, a core subunit of the PRC2, at these same genomic elements (Fig. 6b). We saw a statistically significant decrease in Suz12 occupancy at four loci with both shRNAs, indicating that the loss of H3K27me3 is associated with reduced occupancy by PRC2. The remaining two loci (HoxD1 and Lefty1) showed a statistically significant reduction in Suz12 occupancy with only one shRNA, but the trend was consistent with the second shRNA. Collectively, these data demonstrate that Wapal depletion is associated with decreased PRC2 occupancy and activity at derepressed genes.

Fig. 6
figure 6

a ChIP-qPCR was performed at six genomic loci (HoxD1, Sox11, Twist1, Snai2, Ppp1r3c, and Lefty1) all derepressed after Wapal depletion with an antibody specific to H3K27me3. Nanog, which is well expressed in ESCs, is used as a negative control. Two additional negative controls (Ctrl 1 and Ctrl 2) are included—genomic regions within gene deserts. Genomic coordinates amplified by each primer set are given in Additional file 15: Table S4. Asterisk indicates a statistically significant difference from input (p value <0.05). b to a, but with an antibody to a core subunit of the PRC2 complex, Suz12. Asterisk indicates a statistically significant difference from input. Plus symbol indicates a statistically significant reduction in Suz12 enrichment from the empty vector control

Wapal is required for proper DNA looping of a distal CRE in the c-Fos locus

Given our findings of derepression of genes marked by PRC2 after Wapal depletion with binding of Wapal to their promoters, we hypothesized that Wapal may assist with recruiting distal CREs into gene promoters to mediate PRC2 recruitment. There is strong evidence that CTCF, which overlaps with 97 % of Wapal-binding sites (Fig. 2b), plays a role in Polycomb targeting. First, CTCF is required for Polycomb-mediated imprinting of the Igf2/H19 locus in murine cells and directly interacts with Suz12 [22, 45]. Second, CTCF plays a central role in mediating developmental regulation of the Hox clusters, which are classically silenced in non-expressed cell types through a PcG-dependent mechanism [4649]. Third, our prior work indicates that CTCF-binding sites can serve as predictors of Polycomb-mediated gene silencing in ESCs [40]. Thus, there is strong evidence in the literature that CTCF either directly recruits the PRC2 complex to silence gene expression or plays an indirect role by mediating higher-order chromatin structures.

To address whether Wapal plays a role in facilitating chromatin interactions, we focused on the c-Fos locus for several reasons. First, it is silenced in pluripotent cells and derepressed after Wapal depletion (Additional file 13: Figure S11a and [50]). Second, approximately 14 kb downstream of the gene is a combined Wapal-/CTCF-binding site, making it the closest genomic site capable of regulating c-Fos expression in-cis. Third is that the c-Fos locus is relatively isolated, with no other protein-coding loci within 100 kb on either side, making it less likely this Wapal-/CTCF-binding site is acting on other nearby genes.

As a first step, we verified that CTCF mRNA levels were unaffected by Wapal depletion (Additional file 13: Figure S11a), which was consistent with our microarray data (Additional file 14: Table S3). In addition, we verified that Wapal depletion resulted in decreased H3K27me3 at the c-Fos promoter (Additional file 13: Figure S11b). We also detected the presence of H3K27me3 at the Wapal-binding site, indicating that PRC2 activity is also present at this site distal to the c-Fos promoter. Surprisingly, Wapal depletion had minimal effect on H3K27me3 levels at this site, indicating that Wapal depletion did not cause global changes in H3K27me3 levels across the entire locus. Collectively, these data indicate that Wapal is required for Polycomb-mediated silencing of c-Fos expression.

To address whether Wapal plays a role in regulating the interaction between this distal CRE and the c-Fos promoter, we utilized chromosomal conformational capture (3C) to directly measure the interaction frequency between the two. An area spanning 26 kb downstream of the c-Fos promoter was probed for interactions using an anchor primer near the TSS. An interaction was observed between the c-Fos promoter and the Wapal/CTCF distal CRE in cells infected with the empty vector, but no interactions were observed with primers flanking the Wapal-binding site (Fig. 7). Wapal depletion by two shRNAs results in decreased interaction between the 14.3 kb distal CRE and c-Fos promoter. Importantly, this result was observed 2 days following Wapal depletion, prior to cells displaying signs of differentiation. This indicates that Wapal is necessary for the distal CRE to interact with the c-Fos promoter. Collectively, these data imply that Wapal directly binds to a distal CRE within the c-Fos locus and assists with its recruitment to the TSS to mediate transcriptional repression through the PRC2 complex. Whether this is through participating in the chromatin looping directly or indirectly by influencing core cohesin subunits remains an open question.

Fig. 7
figure 7

Top panel shows ChIP-seq tracks for PRC1 (H2Aub1, Ring1b), PRC2 (H3K27me3, Suz12, Ezh2), Wapal, CTCF, and BirA only as a negative control. Scale for each track is listed. Primers for 3C are shown below the gene, with the anchor primer labeled. Bottom panel normalized interaction frequency for different primers from cells infected with the empty vector or two different shRNAs to Wapal. Asterisk indicates a statistically significant (p value <0.05) reduction from empty vector


Our work demonstrates that Wapal is critical to ESC pluripotency. This is consistent with a recent report that Wapal / mice die early in embryogenesis [13]. Given our cell-cycle data (Fig. 1e), it appears that mitosis is unaffected 48 h after Wapal depletion and therefore likely not responsible for the loss of pluripotency we observe. The lack of cell-cycle changes in our experiments may result from redundancy by the separase complex, which can mediate sister chromatid separation independent of Wapal [4, 51]. A recent report using heterokaryons indicates that Rad21 deletion may induce pluripotency factor expression and can improve reprogramming efficiency through increased expression of c-Myc [11]. Given the redundancy of the separase complex and the fact we use a depletion rather than genetic deletion strategy, our findings are most consistent with Wapal playing a critical role in pluripotency through modulating gene expression rather than altering the cell cycle. However, given these contradictory findings, further studies to understand Wapal and cohesin’s role in regulating pluripotency are required to separate effects on the cell cycle versus gene expression.

The biology of the cohesin complex has extensively been studied in mitosis, and the role of different subunits for sister chromatid cohesin has been used as a framework to understand their role during transcriptional regulation. Therefore, the canonical function of Wapal (off-loading cohesin) has been used as a model to understand Wapal’s role during interphase. Genetic deletion of Wapal [13] or a dominant-negative Wapal allele [12] increased bulk cohesin interactions with chromatin, as measured by fluorescence recovery after photobleaching (FRAP). The studies described herein used an alternative approach and identified >8000 genomic sites occupied by Wapal in ESCs by ChIP-seq. By overlaying these data with datasets for other core cohesin subunits, we demonstrate that Wapal co-occupies genomic sites with other cohesin proteins. In fact, genomic sites occupied by cohesin subunits with Wapal demonstrate slightly higher cohesin ChIP-tag density than sites occupied without Wapal, and there was minimal change in Rad21 chromatin occupancy after Wapal depletion (Fig. 3). This illustrates that Wapal does not appear to antagonize cohesin binding at these sites. Our data cannot exclude that Wapal affects the kinetics of cohesin occupancy at shared sites, given that we measure steady-state levels in a population of cells by ChIP-qPCR, rather than dynamic association rates which are measured by FRAP. However, we favor a model in which Wapal binds to specific genomic site in conjunction with other cohesin complex members. We favor this model because Wapal depletion phenocopies the biology of core cohesin subunits such as Rad21 or Smc3 in terms of (1) effects on ESC pluripotency after depletion; (2) chromatin occupancy; (3) transcriptome changes; (4) facilitating chromatin loops. At other sites within the genome not specifically occupied by Wapal, it likely off-loads cohesin to prevent chromatin condensation [13]. This highlights that Wapal’s role in regulating gene expression appears similar to core cohesin subunits in our studies.

Our data demonstrate that Wapal plays a critical role in regulating the epigenome, specifically PRC2-mediated H3K27me3 deposition. Whether this ultimately causes Wapal-depleted cells to lose pluripotency remains an unresolved issue. Deletion of either the catalytic subunit (Ezh2) or the core subunit (EED) required for PRC2 stability causes an early embryonic lethal phenotype in mice. However, both Ezh2 / and EED / ESCs are pluripotent even though they lack PRC2 activity [52, 53]. In contrast, total H3K27me3 levels are unchanged after Wapal depletion by RNAi, implying that global PRC2 activity is preserved, a clear distinction with EED / cells that lack H3K27me3. It is more likely that improper genomic localization of PRC2, rather than a complete loss of its activity, results in a loss of pluripotency. Polycomb-silenced genes are typically developmental regulators [54, 55], and therefore, derepression of even a handful of these loci would be sufficient to mediate a loss of pluripotency. For example, AP-1 family members such as c-Fos and c-Jun appear to be negative regulators of pluripotency, and depletion of either can enhance defined factor reprogramming [50].

Most intriguing is the role of Wapal in repression of c-Fos expression, with Wapal depletion associated with decreased H3K27me3 at the promoter and reduced interaction with a CTCF-occupied distal CRE. Given the high degree of overlap between Wapal and CTCF (97 %), we hypothesize two different models to explain how Wapal and CTCF may facilitate Polycomb targeting to effect gene silencing (Fig. 8). In the first (Fig. 8a), CTCF targets the PRC2 complex through a direct physical interaction with Suz12, similar to what occurs at Igf2/H19 [22]. In this model, Wapal and cohesin work together to stabilize the resulting DNA loop, permitting gene silencing. In the second (Fig. 8b), CTCF blocks a distal transcriptional enhancer from accessing the gene promoter. In this model, Polycomb complexes bind to all sites of transcription and non-specifically silence them [21, 56]. Transcriptional enhancers are then required for Polycomb eviction, thereby facilitating gene expression [57]. The role of CTCF in enhancer blocking is well established and dependent on cohesin function [3]. While our data cannot distinguish between the two models, further studies utilizing genomic editing to delete different CREs coupled with genome-wide chromatin capture technologies (such as 4C, [58]) can be used to address this fundamental question.

Fig. 8
figure 8

Two models to explain the role of Wapal and cohesin in regulating PRC2 targeting. a A CTCF-occupied distal CRE is responsible for recruiting PRC2 to mediate gene silencing. The recruitment is through a direct CTCF:Suz12 interaction [22]. In this model, the CTCF site would function as a mammalian PRE. b Polycomb complexes target all sites of transcription, and need to be “evicted” from the TSS by a transcriptional enhancer. CTCF prevents this by “blocking” enhancer access to gene. Cohesin facilitates the required DNA loop to stabilize the CTCF/gene chromatin interaction

Seminal studies in Drosophila regarding the role of cohesin in regulating gene expression and the epigenome demonstrate some similar but also disparate results from our own. In Drosophila, cohesin depletion decreased long-range interactions between a PRE and the inv-en gene complex [42], which is analogous to our results (Fig. 7). In contrast, work from Dr. Dale Dorsett’s laboratory [42, 59, 60] using the Drosophila wing imaginal disk as a model indicates that cohesin plays a fundamental role in sequestering the PRC1 complex at actively transcribed genes, thereby limiting its availability to bind to silenced genes. This sequestration is facilitated by a protein/protein interaction between cohesin and PRC1 in Drosophila [61]. While this exciting model may explain some of our observation, key distinctions between Drosophila and mammals likely prevent a direct comparison of cohesin’s function in regulating gene expression between these species. Specifically, cohesin genomic occupancy in Drosophila and mammals differs significantly [62]. In mammals, Wapal and cohesin predominantly bind chromatin in conjunction with CTCF [3, 63], mediated by a protein/protein interaction between CTCF and Stag2 [33]. In contrast, there are multiple insulator proteins in Drosophila beyond dCTCF (BEAF-32, Zw5, and Su(Hw)), which lack mammalian homologues and currently a paucity of evidence that these proteins interact with cohesin [64]. In fact, the Drosophila protein CP190 plays a critical role in mediating chromatin looping but lacks a mammalian homologue [44, 65]. Thus, it is likely that fundamental differences in the genomic localization of cohesin between Drosophila and mammals during interphase make direct comparisons between these species challenging.


The core cohesin subunits play a critical role in transcriptional regulation by facilitating DNA loops that permit the interaction of distal cis-regulatory regions with gene promoters. However, the role of the cohesin off-loading protein Wapal in regulating gene expression has not been described. We show that Wapal depletion by RNAi induces the differentiation of ESCs, implying that it is required for pluripotency much like other core cohesin subunits. Wapal co-occupies genomic sites distal to genes in conjunction with other core cohesin subunits and CTCF, and at genomic sites it occupies Wapal does not appear to promote cohesin of off-loading. Wapal depletion causes a global derepression of Polycomb-silenced genes, although overall PcG activity remains unchanged. Lastly, Wapal facilitates the interaction between a distal CRE with the c-Fos promoter. Collectively, these data demonstrate that Wapal plays a critical role in PcG-mediated gene silencing.


ESC culture

Gelatin-adapted ESCs (129SVJ-dervied) were cultured as previously [25]. Bright-field images and alkaline phosphatase staining were performed as previously [24].

Protein, RNA isolation, and quantitative PCR

Total RNA and total protein were collected as previously [25, 66]. If needed, RNA was further purified following manufacturer’s protocol (RNeasy Micro, Qiagen). cDNA conversions were performed using the iScript cDNA Synthesis kit (Biorad) according to manufacturer’s protocols. Quantitative real-time PCR (qPCR) was performed as previously [24, 25]. Primers used for reverse transcriptase-qPCR (RT-qPCR), ChIP-qPCR, and 3C are listed in Additional file 15: Table S4. The genomic regions of the primers (3C) or the amplified region (ChIP-qPCR) are indicated in Additional file 15: Table S4 (mm9).

Western blots

Whole-cell extracts were prepared from cells by lysis in 50 mM Tris (pH 8.0), 10 % glycerol, 0.7 % NP-40 substitute (Sigma, 74385), 0.1 mM EDTA, 250 mM NaCl, 50 mM NaF, and 0.1 mM Na3VO4. Protease inhibitors were added to buffer as follows: 1:1000 DTT (Sigma, 646563), 5:1000 PMSF (Sigma, 93482-50ML-F), and 1:1000 Protease Inhibitor Cocktail (Sigma, P8340-5ML). Equal protein (10–50 μg) was run using standard SDS-PAGE separation techniques and blotted. Protein quantification was performed with Image J. All samples were normalized to a loading control (either GAPDH or total histone H3) and an empty vector or normal control.


Smc3 (Abcam, Ab9263), Sall4 (Abcam, Ab29112), Wapal (Abcam, Ab70741), Scc1/Rad21 (Abcam, Ab992), Nanog (Millipore, AB5731), GAPDH-HRP (Santa Cruz, sc-25778), total H3 (Active Motif, 61278), H3K27me3 (Millipore, 07-449), H3K4me3 (Active Motif, 39916), anti-Brachyury (Santa Cruz, sc-17743), anti-v5 (Invitrogen, 46-0708), Streptavidin–HRP (Invitrogen, SA1007), anti-Suz12 (Santa Cruz, sc-46264), anti-Ezh2 (Millipore, 17-662), anti-Ring1b (Western Blot-Active Motif, 39664; ChIP-seq Abcam, Ab101273), anti-H2Aub1 (Cell Signaling, 8240).

Lentiviral generation and RNAi experiments

RNAi/TRC consortium lentiviral constructs were obtained from either Open Biosystems or Sigma-Aldrich. The following constructs were utilized: pLKO.1 is the parental viral vector without shRNA. Sall4 (TRCN0000097821) was used as a positive control and can be rescued by the Sall4b cDNA [24], indicating that the loss of pluripotency is not secondary to off-target effects. Other lentiviruses used are against Wapal (TRCN0000178287, TRCN0000177268) or Smc3 (TRCN0000109007, TRCN0000160440). Lentiviruses were generated and ESCs infected as done previously [25]. Puromycin selection (4 μg/mL) was started after infection and continued daily for 2–6 days to select for cells with a high multiplicity of infection.

Chromatin immunoprecipitation

Antibody-mediated ChIP and biotin-mediated ChIP (bioChIP) were performed similar to our previous work [25]. Primers used for ChIP-qPCR are indicated in Additional file 15: Table S4. Antibodies used are as above. A modified version of [67] was used for antibody-mediated ChIP on histones. Briefly, cells were cross-linked for 5 min at room temperature with 1 % fresh formaldehyde. Reaction was quenched with glycine (125 mM final concentration) for 5 min at room temperature. Cells were lysed in 1 % SDS lysis buffer (1 % SDS, 10 mM EDTA, 50 mM Tris–HCl pH 8.0, protease inhibitors). Chromatin was sonicated to a mean size of 300–500 bp. Lysed cells were cleared by centrifugation and supernatant diluted 1:20 with ChIP dilution buffer (0.01 % SDS, 1.1 % Triton X-100, 1.2 mM EDTA, 16.7 mM Tris–HCl, pH8.0, 167 mM NaCl, protease inhibitors). In total, 2–4 μg of antibodies were added and incubated overnight with constant rotation at 4 °C. Washed Dynal Protein A beads (Invitrogen) were added and incubated at 4 °C for an additional 90 min. Beads were isolated with a magnet and then washed with ChIP dilution buffer, low-salt buffer (0.1 % SDS, 1 % Triton X-100, 2 mM EDTA, 20 mM Tris–HCl pH 8.0, NaCl 150 mM), LiCl wash (250 mM LiCl, 0.5 % NP-40, 0.5 % deoxycholate, 1 mM EDTA, 10 mM Tris–HCl pH 8.0), and TE. DNA was eluted and de-cross-linked overnight at 65 °C in SDS elution buffer (1 % SDS, 10 mM EDTA, 50 mM Tris–HCl pH 8.0). The next day RNase A and proteinase K were added, and the DNA was precipitated in the presence of glycogen. DNA for downstream applications was quantitated by fluorometry (Qubit, Invitrogen), and equal DNA was used for each ChIP-qPCR and normalized to sheared, non-IP’d genomic DNA (input).

Plasmid and cell line generation

The Wapal open reading frame was obtained from Open Biosystems and used as a template to perform site-directed mutagenesis to disrupt the third base pair of each codon within the shRNA complementary region, thus making it immune to the shRNA without altering the protein-coding sequence. A c-terminal v5 epitope tag was added to aid in detection. The cDNA was then cloned into the pPyCAG iH vector ([68], hygromycin resistance) for expression under a ubiquitous (CAG) promoter. CJ9 ESCs were electroporated with the linearized plasmid in the presence of hygromycin and individual clones isolated, as we have done previously [24]. All constructs were fully sequenced to ensure the coding sequence was correct. Expression of a tagged protein of the appropriate size was confirmed for each clone. The Wapal cDNA was cloned into the Flag/Bio expression vector and electroporated into BirA-expressing ESCs as we have done previously [24]. Individual colonies were expanded, and expression of a tagged, biotin-containing band of the correct size was verified by Western Blot with streptavidin and IP with streptavidin and blotting with anti-Wapal.

Generation of Oct4:IRES:EGFP reporter ESC line

To generate the Oct4:IRES:EGFP reporter line used in Additional file 1: Figure S1A, a published guide RNA (gRNA) and plasmid were used to target CJ9 ESCs [27] and insert an IRES:EGFP cassette into the 3′ UTR of Oct4. Briefly, a guide RNA specific to the 3′ UTR of the Oct4 locus (CTCAGTGATGCTGTTGATC) was cloned into the Cas9-expressing vector px459 [69]. A homology donor repair (HDR) vector containing an IRES:EGFP cassette flanked by homology regions from Oct4 was obtained from Addgene (plasmid #48681) and co-transfected along with the guide RNA into 1–2 × 106 wild-type ESCs. Transfected cells were selected with puromycin (2 μg/mL first 2 days only) and G418 (500 μg/mL daily) until single colonies appeared. Individual clones were isolated and cells that expressed GFP only in the pluripotent state were isolated, expanded, and used for subsequent experiments.


Biotin-mediated Chromatin IP (bioChIP) was performed similar to previously [24, 25]. Briefly, 10–20 ng of ChIP DNA from two independently derived Flag/Bio Wapal clones, and control cells (expressing BirA) alone, along with an equal amount of sheared genomic DNA (input), were used for library generation (NEXTflex Chip-seq kit, Bioo Scientific) according to manufacturer’s protocol. Samples were sequenced on an Illumina HiSeq 2000 at Beijing Genomics Institute and obtained at least 50 million aligned reads per sample. For ChIP-seq on Wapal-depleted cells, a similar approach was used but cells were cross-linked and chromatin prepared 48 h post-infection. Library preparation was performed with NEB E7645 and sequenced on an Illumina NextSeq 500, 75 cycles, single end. The first 5 bp was trimmed from all reads due to lower Q30 scores, and then, the next 49 bp was used for alignment to recreate the analysis pipeline used for Wapal. After adaptor removal, reads were aligned to a reference genome (mm9) using Bowtie [70] with the following parameters (-e 70 -n 2 -k 1 -m 1), and then, peak calling was performed with MACS [71] using the default parameters (p value of 10−5). For each sample (input, BirA ChIP, Wapal ChIP #1 and #2), at least 50 million aligned reads were used for peak calling, and peak calling was performed by comparing experimental samples to input as a control (i.e., two-sided peak calling). Annotation of peaks was done using CisGenome [72]. Published datasets were obtained from the GEO Omnibus and are listed in Additional file 16: Table S2 for the following ChIP-seq datasets (Smc1a, Smc3, Rad21, CTCF, H3K27me3, Nanog, Stag1, Stag2, H2Aub1, Ring1b, Ezh2, and Suz12). All downloaded SRA-type files were analyzed using the same parameters as above. A file in.bed format with all Wapal peaks regions is included in Additional file 8: Table S1. A total of 266 genomic sites displayed significant biotinylation in the BirA-only-expressing cells, likely secondary to biotinylated histones, and are also included in Additional file 8: Table S1. We subtracted these regions from our subsequent analysis and identified a total of 8296 Wapal-binding sites between the two datasets. ChIP-seq tag pileup files were normalized using Haystack [73] when comparisons were made between different wiggle files.

Transcriptome analysis

Total RNA was extracted using Trizol from ESCs infected with either empty vector (pLKO.1) or two, independent shRNAs to Wapal or Smc3. Two biological replicates for each shRNA or empty vector were used. Probes were generated and hybridized to Affymetrix Mouse Genome 430 2.0 arrays as previously [24]. CEL files were normalized using RMA [74] and subsequently processed using GenePattern [75] and GSEA [35] similar to our previous work using default parameters [24] unless otherwise indicated. For our initial GSEA, we used the C2:CGP group of datasets (v5) available through the Molecular Signature Database (MSigDB;

Cell-cycle analysis

ESCs were trypsinized, counted, and washed with PBS-/-. In 100 μL of PBS-/-, 0.5–1 × 106 cells were resuspended, and 1.9 mL of cold 70 % ethanol was added in a dropwise fashion to fix cells. Cells were incubated at −20 °C for 30 min, spun down, and RNase digested at 37 °C for 30 min. Cells were stained with propidium iodide (PI, final concentration 10 μg/mL) for at least 5 min. Flow cytometry was performed and data was analyzed with FlowJo.

Chromosome conformation capture (3C): modified from [76]

Library preparation

For each library generated, approximately 15 × 106 mESC were fixed in a 1 % formaldehyde/10 % FBS solution for 10 min. Nuclei were isolated and digested with 1000U HaeIII (NEB R0108M) overnight at 37 °C while shaking. DNA was ligated with 7000 units of T4 DNA ligase (NEB M0202M) for 5 h at 16 °C. Samples were then de-cross-linked overnight at 65 °C. Following two phenol/chloroform extractions, the library was precipitated with a standard ethanol/sodium acetate protocol.

BAC preparation

Twenty micrograms of BAC RP23206I6 (CHORI) was digested with 1250U HaeIII overnight at 37 °C while shaking. Digested BAC DNA was phenol/chloroform extracted then ethanol precipitated. BAC was then ligated with 7000 units of T4 DNA ligase overnight in a 16 °C water bath. Ligated BAC template was phenol/chloroform extracted and ethanol precipitated. Template was resuspended in TE.

Determination of interaction frequency

Template and BAC loading were normalized by subtracting the CT of the internal primers from the CT of the test primers yielding ∆CT. The ∆CT from the BAC was then subtracted from ∆CT of the template to remove random interactions and PCR biases from the test primers. Interaction frequency is then determined by the value of 2^−(∆CT of template - ∆CT of BAC).

Statistical analysis

Unless otherwise stated, t test comparisons were made and p values <0.05 were considered significantly different. All error bars are SEM derived from three experiments unless specifically stated otherwise.

Genome-wide datasets

All microarray and ChIP-seq data are available in GEO Omnibus (GSE63325).



embryonic stem cell


chromatin immunoprecipitation coupled with next-generation sequencing


Polycomb group


acute myeloid leukemia


Polycomb repressive complex 1


Polycomb repressive complex 2


cis-regulatory elements


Cornelia de Lange Syndrome


transcription factor


RNA interference


Polycomb response element


short hairpin RNAs


internal ribosomal entry site


enhanced green fluorescent protein


mean fluorescent intensity


propidium iodide




reverse transcription


quantitative PCR


gene set enrichment analysis


ubiquitination of histone 2A


trimethylation of histone 3 on lysine 27


transcriptional start site


chromosomal conformational capture


alkaline phosphatase


multiplicity of infection


normalized enrichment scores


false discovery rate




  1. Dorsett D. Cohesin: genomic insights into controlling gene transcription and development. Curr Opin Genet Dev. 2011;21:199–206.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Kagey MH, Newman JJ, Bilodeau S, Zhan Y, Orlando DA, van Berkum NL, Ebmeier CC, Goossens J, Rahl PB, Levine SS, Taatjes DJ, Dekker J, Young RA. Mediator and cohesin connect gene expression and chromatin architecture. Nature. 2010;467:430–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Wendt KS, Yoshida K, Itoh T, Bando M, Koch B, Schirghuber E, Tsutsumi S, Nagae G, Ishihara K, Mishiro T, Yahata K, Imamoto F, Aburatani H, Nakao M, Imamoto N, Maeshima K, Shirahige K, Peters J-M. Cohesin mediates transcriptional insulation by CCCTC-binding factor. Nature. 2008;451:796–801.

    Article  CAS  PubMed  Google Scholar 

  4. Nasmyth K, Haering CH. Cohesin: its roles and mechanisms. Annu Rev Genet. 2009;43:525–58.

    Article  CAS  PubMed  Google Scholar 

  5. Kueng S, Hegemann B, Peters B, Lipp J, Schleiffer A, Mechtler K, Peters J. Wapl controls the dynamic association of cohesin with chromatin. Cell. 2006;127:955–67.

    Article  CAS  PubMed  Google Scholar 

  6. Shintomi K, Hirano T. Releasing cohesin from chromosome arms in early mitosis: opposing actions of Wapl–Pds5 and Sgo1. Genes Dev. 2009;23:2224–36.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Seitan VC, Hao B, Tachibana-Konwalski K, Lavagnolli T, Mira-Bontenbal H, Brown KE, Teng G, Carroll T, Terry A, Horan K, Marks H, Adams DJ, Schatz DG, Aragon L, Fisher AG, Krangel MS, Nasmyth K, Merkenschlager M. A role for cohesin in T-cell-receptor rearrangement and thymocyte differentiation. Nature. 2011;476:467–71.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Nitzsche A, Paszkowski-Rogacz M, Matarese F, Janssen-Megens EM, Hubner NC, Schulz H, De Vries I, Ding L, Huebner N, Mann M, Stunnenberg HG, Buchholz F. RAD21 cooperates with pluripotency transcription factors in the maintenance of embryonic stem cell identity. PLoS One. 2011;6:e19470.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Dowen JM, Fan ZP, Hnisz D, Ren G, Abraham BJ, Zhang LN, Weintraub AS, Schuijers J, Lee TI, Zhao K, Young RA. Control of cell identity genes occurs in insulated neighborhoods in mammalian chromosomes. Cell. 2014;159:374–87.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Dowen JM, Bilodeau S, Orlando DA, Hübner MR, Abraham BJ, Spector DL, Young RA. Multiple structural maintenance of chromosome complexes at transcriptional regulatory elements. Stem Cell Rep. 2013;1:371–8.

    Article  CAS  Google Scholar 

  11. Lavagnolli T, Gupta P, Hörmanseder E, Mira-Bontenbal H, Dharmalingam G, Carroll T, Gurdon JB, Fisher AG, Merkenschlager M. Initiation and maintenance of pluripotency gene expression in the absence of cohesin. Genes Dev. 2015;29:23–38.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Cunningham MD, Gause M, Cheng Y, Noyes A, Dorsett D, Kennison JA, Kassis JA. Wapl antagonizes cohesin binding and promotes Polycomb-group silencing in Drosophila. Development. 2012;139:4172–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Tedeschi A, Wutz G, Huet S, Jaritz M, Wuensche A, Schirghuber E, Davidson IF, Tang W, Cisneros DA, Bhaskara V, Nishiyama T, Vaziri A, Wutz A, Ellenberg J, Peters J-M. Wapl is an essential regulator of chromatin structure and chromosome segregation. Nature. 2013;501:564–8.

    Article  CAS  PubMed  Google Scholar 

  14. Boyle MI, Jespersgaard C, Brøndum-Nielsen K, Bisgaard AM, Tümer Z. Cornelia de Lange syndrome. Clin Genet. 2015;88:1–12.

    Article  CAS  PubMed  Google Scholar 

  15. Liu J, Krantz ID. Cornelia de Lange syndrome, cohesin, and beyond. Clin Genet. 2009;76:303–14.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Cancer Genome Atlas Research Network. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N Engl J Med. 2013;368:2059–74.

    Article  Google Scholar 

  17. Thota S, Viny AD, Makishima H, Spitzer B, Radivoyevitch T, Przychodzen B, Sekeres MA, Levine RL, Maciejewski JP. Genetic alterations of the cohesin complex genes in myeloid malignancies. Blood. 2014;124:1790–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Simon JA, Kingston RE. Occupying chromatin: polycomb mechanisms for getting to genomic targets, stopping transcriptional traffic, and staying put. Mol Cell. 2013;49:808–24.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Woo CJ, Kharchenko PV, Daheron L, Park PJ, Kingston RE. A region of the human HOXD cluster that confers polycomb-group responsiveness. Cell. 2010;140:99–110.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Zhao J, Ohsumi TK, Kung JT, Ogawa Y, Grau DJ, Sarma K, Song JJ, Kingston RE, Borowsky M, Lee JT. Genome-wide identification of polycomb-associated RNAs by RIP-seq. Mol Cell. 2010;40:939–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Kaneko S, Son J, Shen SS, Reinberg D, Bonasio R. PRC2 binds active promoters and contacts nascent RNAs in embryonic stem cells. Nat Struct Mol Biol. 2013;20:1258–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Li T, Hu J-F, Qiu X, Ling J, Chen H, Wang S, Hou A, Vu TH, Hoffman AR. CTCF regulates allelic expression of Igf2 by orchestrating a promoter-polycomb repressive complex 2 intrachromosomal loop. Mol Cell Biol. 2008;28:6473–82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Deng Z, Wang Z, Stong N, Plasschaert R, Moczan A, Chen H-S, Hu S, Wikramasinghe P, Davuluri RV, Bartolomei MS, Riethman H, Lieberman PM. A role for CTCF and cohesin in subtelomere chromatin organization, TERRA transcription, and telomere end protection. EMBO J. 2012;31:4165–78.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Rao S, Zhen S, Roumiantsev S, McDonald LT, Yuan G-C, Orkin SH. Differential roles of Sall4 isoforms in embryonic stem cell pluripotency. Mol Cell Biol. 2010;30:5364–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Pulakanti K, Pinello L, Stelloh C, Blinka S, Allred J, Milanovich S, Kiblawi S, Peterson J, Wang A, Yuan G-C, Rao S. Enhancer transcribed RNAs arise from hypomethylated, Tet-occupied genomic regions. Epigenetics. 2013;8:1303–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Lengner C, Camargo F, Hochedlinger K, Welstead G, Zaidi S, Gokhale S, Scholer H, Tomilin A, Jaenisch R. Oct4 expression is not required for mouse somatic stem cell self-renewal. Cell Stem Cell. 2007;1:403–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Yang H, Wang H, Shivalila CS, Cheng AW, Shi L, Jaenisch R. One-step generation of mice carrying reporter and conditional alleles by CRISPR/Cas-mediated genome engineering. Cell. 2013;154:1370–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, Bernstein BE, Bickel P, Brown JB, Cayting P, Chen Y, DeSalvo G, Epstein C, Fisher-Aylor KI, Euskirchen G, Gerstein M, Gertz J, Hartemink AJ, Hoffman MM, Iyer VR, Jung YL, Karmakar S, Kellis M, Kharchenko PV, Li Q, Liu T, Liu XS, Ma L, Milosavljevic A, Myers RM, et al. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 2012;22:1813–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Wang J, Rao S, Chu J, Shen X, Levasseur DN, Theunissen TW, Orkin SH. A protein interaction network for pluripotency of embryonic stem cells. Nature. 2006;444:364–8.

    Article  CAS  PubMed  Google Scholar 

  30. Wu N, Kong X, Ji Z, Zeng W, Potts PR, Yokomori K, Yu H. Scc1 sumoylation by Mms21 promotes sister chromatid recombination through counteracting Wapl. Genes Dev. 2012;26:1473–85.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Ong C-T, Corces VG. CTCF: an architectural protein bridging genome topology and function. Nat Rev Genet. 2014;15:234–46.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Phillips JE, Corces VG. CTCF: master weaver of the genome. Cell. 2009;137:1194–211.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Xiao T, Wallace J, Felsenfeld G. Specific sites in the C terminus of CTCF interact with the SA2 subunit of the cohesin complex and are required for cohesin-dependent insulation activity. Mol Cell Biol. 2011;31:2174–83.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Loh Y-H, Wu Q, Chew J-L, Vega VB, Zhang W, Chen X, Bourque G, George J, Leong B, Liu J, Wong K-Y, Sung KW, Lee CWH, Zhao X-D, Chiu K-P, Lipovich L, Kuznetsov VA, Robson P, Stanton LW, Wei C-L, Ruan Y, Lim B, Ng H-H. The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells. Nat Genet. 2006;38:431–40.

    Article  CAS  PubMed  Google Scholar 

  35. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545–50.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Ben-Porath I, Thomson MW, Carey VJ, Ge R, Bell GW, Regev A, Weinberg RA. An embryonic stem cell-like gene expression signature in poorly differentiated aggressive human tumors. Nat Genet. 2008;40:499–507.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim T-K, Koche RP, Lee W, Mendenhall E, O’Donovan A, Presser A, Russ C, Xie X, Meissner A, Wernig M, Jaenisch R, Nusbaum C, Lander ES, Bernstein BE. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007;448:553–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Wong DJ, Liu H, Ridky TW, Cassarino D, Segal E, Chang HY. Module map of stem cell genes guides creation of epithelial cancer stem cells. Cell Stem Cell. 2008;2:333–44.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Brookes E, de Santiago I, Hebenstreit D, Morris KJ, Carroll T, Xie SQ, Stock JK, Heidemann M, Eick D, Nozaki N, Kimura H, Ragoussis J, Teichmann SA, Pombo A. Polycomb associates genome-wide with a specific RNA polymerase II variant, and regulates metabolic genes in ESCs. Cell Stem Cell. 2012;10:157–70.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Liu Y, Shao Z, Yuan G-C. Prediction of Polycomb target genes in mouse embryonic stem cells. Genomics. 2010;96:17–26.

    Article  CAS  PubMed  Google Scholar 

  41. Wamstad JA, Alexander JM, Truty RM, Shrikumar A, Li F, Eilertson KE, Ding H, Wylie JN, Pico AR, Capra JA, Erwin G, Kattman SJ, Keller GM, Srivastava D, Levine SS, Pollard KS, Holloway AK, Boyer LA, Bruneau BG. Dynamic and coordinated epigenetic regulation of developmental transitions in the cardiac lineage. Cell. 2012;151:206–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Schaaf CA, Misulovin Z, Gause M, Koenig A, Gohara DW, Watson A, Dorsett D. Cohesin and Polycomb proteins functionally interact to control transcription at silenced and active genes. PLoS Genet. 2013;9:e1003560.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Vasanthi D, Nagabhushan A, Matharu NK, Mishra RK. A functionally conserved Polycomb response element from mouse HoxD complex responds to heterochromatin factors. Sci Rep. 2013;3:3011.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Liang J, Lacroix L, Gamot A, Cuddapah S, Queille S, Lhoumaud P, Lepetit P, Martin PGP, Vogelmann J, Court F, Hennion M, Micas G, Urbach S, Bouchez O, Nöllmann M, Zhao K, Emberly E, Cuvier O. Chromatin immunoprecipitation indirect peaks highlight long-range interactions of insulator proteins and Pol II pausing. Mol Cell. 2014;53:672–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Lin S, Ferguson-Smith AC, Schultz RM, Bartolomei MS. Nonallelic transcriptional roles of CTCF and cohesins at imprinted loci. Mol Cell Biol. 2011;31:3094–104.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Xu M, Zhao G-N, Lv X, Liu G, Wang LY, Hao D-L, Wang J, Liu D-P, Liang C-C. CTCF controls HOXA cluster silencing and mediates PRC2-repressive higher-order chromatin structure in NT2/D1 cells. Mol Cell Biol. 2014;34:3867–79.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Narendra V, Rocha PP, An D, Raviram R, Skok JA, Mazzoni EO, Reinberg D. Transcription. CTCF establishes discrete functional chromatin domains at the Hox clusters during differentiation. Science. 2015;347:1017–21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Noordermeer D, Leleu M, Splinter E, Rougemont J, De Laat W, Duboule D. The dynamic architecture of Hox gene clusters. Science. 2011;334:222–5.

    Article  CAS  PubMed  Google Scholar 

  49. Noordermeer D, Leleu M, Schorderet P, Joye E, Chabaud F, Duboule D. Temporal dynamics and developmental memory of 3D chromatin architecture at Hox gene loci. Elife. 2014;3:e02557.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Liu J, Han Q, Peng T, Peng M, Wei B, Li D, Wang X, Yu S, Yang J, Cao S, Huang K, Hutchins AP, Liu H, Kuang J, Zhou Z, Chen J, Wu H, Guo L, Chen Y, Chen Y, Li X, Wu H, Liao B, He W, Song H, Yao H, Pan G, Chen J, Pei D. The oncogene c-Jun impedes somatic cell reprogramming. Nat Cell Biol. 2015;17:856–67.

    Article  CAS  PubMed  Google Scholar 

  51. Nakajima M, Kumada K, Hatakeyama K, Noda T, Peters J-M, Hirota T. The complete removal of cohesin from chromosome arms depends on separase. J Cell Sci. 2007;120:4188–96.

    Article  CAS  PubMed  Google Scholar 

  52. Shen X, Kim W, Fujiwara Y, Simon MD, Liu Y, Mysliwiec MR, Yuan G-C, Lee Y, Orkin SH. Jumonji modulates Polycomb activity and self-renewal versus differentiation of stem cells. Cell. 2009;139:1303–14.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Denholtz M, Bonora G, Chronis C, Splinter E, De Laat W, Ernst J, Pellegrini M, Plath K. Long-range chromatin contacts in embryonic stem cells reveal a role for pluripotency factors and polycomb proteins in genome organization. Cell Stem Cell. 2013;13:602–16.

    Article  CAS  PubMed  Google Scholar 

  54. Bernstein BE, Mikkelsen TS, Xie X, Kamal M, Huebert DJ, Cuff J, Fry B, Meissner A, Wernig M, Plath K, Jaenisch R, Wagschal A, Feil R, Schreiber SL, Lander ES. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell. 2006;125:315–26.

    Article  CAS  PubMed  Google Scholar 

  55. Ku M, Koche RP, Rheinbay E, Mendenhall EM, Endoh M, Mikkelsen TS, Presser A, Nusbaum C, Xie X, Chi AS, Adli M, Kasif S, Ptaszek LM, Cowan CA, Lander ES, Koseki H, Bernstein BE. Genomewide analysis of PRC1 and PRC2 occupancy identifies two classes of bivalent domains. PLoS Genet. 2008;4:e1000242.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Kaneko S, Son J, Bonasio R, Shen SS. Nascent RNA interaction keeps PRC2 activity poised and in check. Genes Dev. 2014;28:1983–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Vernimmen D, Lynch MD, De Gobbi M, Garrick D, Sharpe JA, Sloane-Stanley JA, Smith AJH, Higgs DR. Polycomb eviction as a new distant enhancer function. Genes Dev. 2011;25:1583–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. van de Werken HJG, de Vree PJP, Splinter E, Holwerda SJB, Klous P, de Wit E, De Laat W. 4C technology: protocols and data analysis. Methods Enzymol. 2012;513:89–112.

    Article  PubMed  Google Scholar 

  59. Dorsett D, Kassis JA. Checks and balances between cohesin minireview and Polycomb in gene silencing and transcription. Curr Biol. 2014;24:R535–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Dorsett D, Merkenschlager M. Cohesin at active genes: a unifying theme for cohesin and gene expression from model organisms to humans. Curr Opin Cell Biol. 2013;25:327–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Strübbe G, Popp C, Schmidt A, Pauli A, Ringrose L, Beisel C, Paro R. Polycomb purification by in vivo biotinylation tagging reveals cohesin and Trithorax group proteins as interaction partners. Proc Natl Acad Sci. 2011;108:5572–7.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Peric-Hupkes D, van Steensel B. Linking cohesin to gene regulation. Cell. 2008;132:925–8.

    Article  CAS  PubMed  Google Scholar 

  63. Parelho V, Hadjur S, Spivakov M, Leleu M, Sauer S, Gregson HC, Jarmuz A, Canzonetta C, Webster Z, Nesterova T, Cobb BS, Yokomori K, Dillon N, Aragon L, Fisher AG, Merkenschlager M. Cohesins functionally associate with CTCF on mammalian chromosome arms. Cell. 2008;132:422–33.

    Article  CAS  PubMed  Google Scholar 

  64. Karch F. In vivo studies of the Drosophila insulator factor CTCF reach a Catch 22. BMC Biol. 2015;13:1–3.

    Article  Google Scholar 

  65. Schwartz YB, Linder-Basso D, Kharchenko PV, Tolstorukov MY, Kim M, Li H-B, Gorchakov AA, Minoda A, Shanower G, Alekseyenko AA, Riddle NC, Jung YL, Gu T, Plachetka A, Elgin SCR, Kuroda MI, Park PJ, Savitsky M, Karpen GH, Pirrotta V. Nature and function of insulator protein binding sites in the Drosophila genome. Genome Res. 2012;22:2188–98.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Milanovich S, Peterson J, Allred J, Stelloh C, Rajasekaran K, Fisher J, Duncan SA, Malarkannan S, Rao S. Sall4 overexpression blocks murine hematopoiesis in a dose-dependent manner. Exp Hematol. 2015;43(53–64):e1–8.

    PubMed  Google Scholar 

  67. Nelson JD, Denisenko O, Bomsztyk K. Protocol for the fast chromatin immunoprecipitation (ChIP) method. Nat Protoc. 2006;1:179–85.

    Article  CAS  PubMed  Google Scholar 

  68. Aubert J, Dunstan H, Chambers I, Smith A. Functional gene screening in embryonic stem cells implicates Wnt antagonism in neural differentiation. Nat Biotechnol. 2002;20:1240–5.

    Article  CAS  PubMed  Google Scholar 

  69. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu X, Jiang W, Marraffini LA, Zhang F. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nussbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9:R137.

    Article  PubMed  PubMed Central  Google Scholar 

  72. Ji H, Jiang H, Ma W, Johnson DS, Myers RM, Wong WH. An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat Biotechnol. 2008;26:1293–300.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  73. Pinello L, Xu J, Orkin SH, Yuan G-C. Analysis of chromatin-state plasticity identifies cell-type-specific regulators of H3K27me3 patterns. Proc Natl Acad Sci. 2014;111:E344–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  74. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4:249–64.

    Article  PubMed  Google Scholar 

  75. Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP. GenePattern 2.0. Nat Genet. 2006;38:500–1.

    Article  CAS  PubMed  Google Scholar 

  76. Hagège H, Klous P, Braem C, Splinter E, Dekker J, Cathala G, De Laat W, Forné T. Quantitative analysis of chromosome conformation capture assays (3C-qPCR). Nat Protoc. 2007;2:1722–33.

    Article  PubMed  Google Scholar 

Download references

Authors’ contributions

CS and MR performed the bulk of the experiments. JP, S. Roumiantsev, SM, and SB performed specific experiments. KP performed the bulk of the computational analyses presented. SJ, MJH, LP, and SR performed the additional analyses. GCY provided critical assistance with the development and design of many of the experiments and computational analyses. S. Rao conceived the study and participated in the design and analysis and performed all the transcriptome analyses shown, as well as writing the manuscript. All authors read and approved the final manuscript.


This work was supported in part by funding from NHBLI (HL087951), a pilot grant from the American Cancer Society (Institutional Research Grant #86-004), the Children’s Research Institute, the Midwest Athletes against Childhood Cancer, and Hyundai Hope on Wheels, and generous philanthropic support from Mr. Douglas Ziegler, all to S. Rao.

Competing interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Sridhar Rao.

Additional information

Cary Stelloh and Michael H. Reimer Jr. have contributed equally to this work

Additional files


Additional file 1: Figure S1. A) Cells expressing EGFP under the control of the Oct4 locus were generated (see Materials and Methods section for details). GFP mean fluorescence intensity (MFI) was measured 2, 4, and 6 days post-infection. All data are normalized to the GFP MFI of cells infected with the empty vector on day 0, displayed on the y-axis. B) Wild-type (WT) or cells expressing a full-length Wapal cDNA immune to shRNA #2 were infected and bright field (10x, top panel) or alkaline phosphatase stained (4x, bottom panel) are shown 6 days post-infection. Domain structure of Wapal is shown to right of images. C) mRNA levels of different cohesin (Wapal and Smc3) and pluripotency (Nanog, Oct4) are shown. A linear scale of relative expression (compared to WT cells infected with the empty vector) is shown on the y-axis. * indicates a statistically significant reduction from the WT+empty vector control (p value<0.05). D) mRNA levels of differentiation markers. A Log10 scale (left) or linear (right) of relative expression (compared to WT cells infected with the empty vector) is shown on the y-axis. * indicates a statistically significant increase from the WT+empty vector control (p value<0.05). + indicates a statistically significant reduction in expression from WT+Wapal shRNA #2 (p-value<0.05). E) Western blots on cells six days after infection. Antibodies used are indicated on the right. E=empty vector, W= Wapal shRNA #2, WT= wild-type.


Additional file 2: Figure S2. A)Western blot of either the parental (BirA) or cells expressing Flag/biotinylated (FB) version of Nanog or Wapal. Streptavidin HRP specifically recognizes the biotinylated versions of the proteins. * indicates nonspecific bands. B) Overlap between the ChIP-seq datasets derived from BirA cells and the two experimental cell lines. C) Five different Wapal sites identified by ChIP-seq were confirmed by ChIP-qPCR. All FB Wapal samples yielded statistically significant (p value <0.05) enrichment compared to input.


Additional file 3: Figure S3. IGV screen captures of the c-Myc locus occupied by multiple members of the cohesin complex, including Wapal. Other loci tested in Figure S2c are shown in other figures (see below). For each graph, the genomic coordinates are shown below the x-axis. The y-axis represents the # of ChIP-seq tags recovered for a given genomic bin. The range for each track is displayed to the right of track name. A lower threshold of 2 was set for all tracks to minimize background. In all cases, the y-axis for BirA and Wapal is the same to demonstrate binding specificity.


Additional file 4: Figure S4. The ChIP-seq tag densities of Smc3 or Smc1a were compared at genomic sites occupied with (Red) or without (Green) Wapal in a 2.5kb window around peak center. ChIP-seq tag densities for each Smc3 or Smc1a binding site within the genome are visualized as individual rows, with the color scale indicating a linear gradient from 0 to maximal tag values. Two different plots were generated based upon the binding with (left) or without (right) Wapal. Row order is from lowest binding to highest binding. # of sites used for each plot is indicated below.


Additional file 5: Figure S5. A) Bright-field images of ESCs 48 hours post-infection with the empty vector or two shRNAs to Wapal. B) RT-qPCR to measure transcript levels 48hrs after Wapal depletion with two shRNAs. * indicates a statistically significant difference from empty vector (p value<0.05). C) Two gene sets recovered from gene set enrichment analysis that showed negative enrichment in the Wapal-depleted cells. NES=normalized enrichment scores, negative score indicates enrichment in the empty vector sample. D) Two independently derived lists of Polycomb target genes were used as gene sets and displayed significant enrichment with positive NES scores in Wapal-depleted samples, indicating that these genes are derepressed after Wapal depletion. E) Similar to (D), but using genes identified as PRC1 marked (H2Aub1) or PRC2 marked (H3K27me3) in their promoter.

Additional file 6: Table S5. All statistically significant altered gene sets from GSEA analysis after Wapal depletion.


Additional file 7: Figure S6. A) The normalized ChIP-seq tag densities of Ring1b were compared at PcG-marked genes in cells infected with the empty vector (Black) or two separate shRNAs to Wapal (Red). X-axis is the distance in bp around TSS, and y-axis is the normalized tag #. Heat maps are similar to S4. A total of 1,455 PcG-marked genes were used for these analyses. B) Ring1b binding before (Black) or after Wapal depletion (Red) was measured at 1455 genes (same # as in A), which were either expressed at low (left) or high (right) levels. C) Similar to B, but genes where went down (left) or up (right) after depletion of Nanog or Oct4 in ESCs are shown.

Additional file 8: Table S1. Genomic sites occupied by Wapal.


Additional file 9: Figure S7. Both (A) and (B) are similar to Figure 5, but only a subset of genes is used. A) 384 Polycomb-marked genes derepressed after Wapal depletion were used to measure the number of ChIP-seq tags recovered for either Wapal or the negative control BirA. B) For 384 genes, the relative binding at each gene is shown for Wapal and BirA alone samples. C) Smc3 binding at 1,455 PcG-marked genes. Smc3 control is an input sample. D) A total of 1,455 PcG-marked genes were used to measure number of ChIP-seq tags recovered for Smc3.


Additional file 10: Figure S8. A) Log2 fold change in expression of six Polycomb-marked genes derepressed after Wapal depletion with two different shRNAs. Color scheme shown to right. B) IGV screen captures of two loci (Snai2 and Ppp1r3c) in which a combined Wapal-/CTCF-binding site is >2kb from the TSS, but also illustrates increased binding of Wapal around the TSS. Other aspects are similar to Figure S3.


Additional file 11: Figure S9. Similar to (S8b), but for additional loci. A clear peak of Wapal binding is not present within 10kb of these four derepressed genes.


Additional file 12: Figure S10. Protein levels after Wapal depletion of various histone marks and core components of the PRC1 and PRC2 complexes. Total H3 is used as a loading control. Quantification is described within Materials and Methods.


Additional file 13: Figure S11. A) mRNA expression of Smc3, CTCF, and c-Fos 48 hours after Wapal depletion is shown. * indicates a statistically significant increased from empty vector (p value<0.05). B) ChIP-qPCR with an antibody to H3K27me3 after Wapal depletion at two genomic elements, the c-Fos promoter and a combined Wapal/CTCF site approximately 16kb downstream of the TSS. The genomic region is shown in Figure S3. * indicates statistically significant increase of empty vector over input (p value<0.05). + indicates statistically significant decrease of Wapal-depleted samples from empty vector (p value<0.05).

Additional file 14: Table S3. Normalized microarray expression data.

Additional file 15: Table S4. List of all primers used.

Additional file 16: Table S2. List of datasets, total # of reads, and # of aligned reads used.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Stelloh, C., Reimer, M.H., Pulakanti, K. et al. The cohesin-associated protein Wapal is required for proper Polycomb-mediated gene silencing. Epigenetics & Chromatin 9, 14 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: