- Open Access
Nucleosomal DNA binding drives the recognition of H3K36-methylated nucleosomes by the PSIP1-PWWP domain
Epigenetics & Chromatinvolume 6, Article number: 12 (2013)
Recognition of histone modifications by specialized protein domains is a key step in the regulation of DNA-mediated processes like gene transcription. The structural basis of these interactions is usually studied using histone peptide models, neglecting the nucleosomal context. Here, we provide the structural and thermodynamic basis for the recognition of H3K36-methylated (H3K36me) nucleosomes by the PSIP1-PWWP domain, based on extensive mutational analysis, advanced nuclear magnetic resonance (NMR), and computational approaches.
The PSIP1-PWWP domain binds H3K36me3 peptide and DNA with low affinity, through distinct, adjacent binding surfaces. PWWP binding to H3K36me nucleosomes is enhanced approximately 10,000-fold compared to a methylated peptide. Based on mutational analyses and NMR data, we derive a structure of the complex showing that the PWWP domain is bound to H3K36me nucleosomes through simultaneous interactions with both methylated histone tail and nucleosomal DNA.
Concerted binding to the methylated histone tail and nucleosomal DNA underlies the high- affinity, specific recognition of H3K36me nucleosomes by the PSIP1-PWWP domain. We propose that this bipartite binding mechanism is a distinctive and general property in the recognition of histone modifications close to the nucleosome core.
Chemical modifications of nucleosomes, the complex of DNA and histone proteins that packages the eukaryotic genome, are key in the regulation of transcription, maintenance of genomic integrity, chromosome condensation and segregation . Modifications such as acetylation or methylation of lysine residues of histone proteins can serve to recruit effector proteins to specific genomic sites . Methylation of lysine-36 in histone H3 (H3K36me) is conserved from yeast to human and is predominantly associated with actively transcribed chromatin . H3K36me has been implicated in diverse processes including splicing, DNA repair, repression of cryptic transcription and histone exchange . Recently, PWWP domains have been identified as H3K36me recognition domains, based on histone peptide interaction studies . PWWP domains feature an aromatic cage, as in other royal Tudor family proteins  that bind the methylated lysine side chain via cation-π interactions . Interestingly, interaction studies have shown that PWWP domains bind methylated H3K36 histone tail peptides with very low affinity [8, 9], in stark contrast with the high affinity recognition of tri-methylated lysine-4 of H3 (H3K4me3) by PHD fingers [10, 11].
Here, we address the structural basis for H3K36me recognition by PWWP domains in the nucleosomal context. Unlike other methylated lysines in the unstructured N-terminal tail of histone H3, K36 is located at the point where the H3 tail protrudes from the nucleosome core . Since PWWP domains were previously also implicated in DNA-binding [13, 14], we hypothesized that the close proximity of the nucleosomal DNA may critically contribute to binding affinity and/or specificity of PWWP domains for H3K36me. We concentrate on the PWWP domain containing protein PSIP1, as its association with naked and chromatinized DNA has been the focus of previous studies [15, 16] and it was recently identified as a H3K36me-interacting protein using synthetic peptides . PSIP1 (LEDGF/p75) was first isolated as an transcriptional co-activator  and tethers the HIV integrase to active host chromatin dependent on its PWWP domain [19, 20]. PSIP1 is an essential subunit of the MLL complex in MLL oncogenic transformations via HOX gene regulation , and is implicated in the homologous recombination pathway for DNA repair .
Using various experimental approaches, we show that concerted, low-affinity interactions of the PSIP1-PWWP domain with both nucleosomal DNA and methylated histone tail result in specific and high-affinity binding to H3K36-methylated nucleosomes. During the final stages of our work, a similar conclusion was reached in another study . Our study underscores this notion with a NMR analysis of the PWWP-nucleosome complex, a structural model of the complex based on experimental interaction data and an extensive in vitro and in vivo validation. Finally, based on a comparison with other PWWP domains and H3K36me-binding modules, we propose that the bipartite recognition of methylated histone tail and nucleosomal DNA is a general feature of H3K36me recognition.
Results and discussion
H3K36 methylation-dependent nucleosome binding by PSIP1-PWWP
To characterize the interaction of the PSIP1-PWWP domain with H3K36me nucleosomes, binding assays with immobilized GST-PWWP and mono-nucleosomal fractions from wild type or mutant Saccharomyces cerevisiae were performed. Elimination of H3K36 methylation by deletion of the SET2 histone methyltransferase gene or alanine substitutions of histone H3K36 strongly reduced the binding of nucleosomes to the PWWP domain (Figure 1a). In contrast, loss of H3K4 methylation in a Δset1 strain or a H3K4A substitution had no effect on PWWP binding but, as expected, eliminated the binding to the TAF3-PHD finger .
To determine the contribution of residues neighboring H3K36, mono-nucleosomes carrying point mutations for residues 31 to 39 in histone H3 were used. Of these only V35A, K36A and K36R affect PWWP-binding (Figure 1b), suggesting limited involvement of the H3 amino acid sequence around the K36 methylation site in determining binding specificity. The specific interaction of the PWWP domain with mono-, di-, and tri-methylated H3K36 was confirmed using biotinylated H3 tail peptides (Figure 1c) and mutation of W21 in the hydrophobic pocket of the domain completely abolished binding even in context of full-length PSIP1 (Additional file 1: Figure S1).
Next, immobilized GST-PWWP was used in binding to mono-nucleosomes prepared from mammalian cells. The bound fractions were enriched for H3K36me3 and H3K36me2 modifications, whereas they showed little enrichment for H3K79me3 and H3K4me3. Comparable results were obtained using an extended fragment of the PWWP domain including the flanking AT-hook region (Figure 1d) or full-length PSIP1 protein (Additional file 1: Figure S1b).
Adjacent PWWP surfaces bind weakly to H3K36me3 peptides and DNA
In order to address the structural basis for its interaction with H3K36-methylated nucleosomes, we solved the solution structure of the PSIP1-PWWP domain (Figure 2a,b, Additional file 1: Figure S2 and Additional file 1: Table S1). As an HDGF-related PWWP domain , the core is formed by a five-stranded β-sheet core onto which two α-helices are packed. PSIP1-PWWP residues M15, Y18, W21, and F44 form an aromatic cage acceptor for a methylated lysine side chain. Notably, the surface of the PSIP1-PWWP domain is rich in basic residues that may interact with nucleosomal DNA (Figure 2b). As a first step in dissecting the contributions of histone tail and nucleosomal DNA in the PSIP1-PWWP-nucleosome interaction, NMR titration experiments were performed using a methylated histone peptide and a 10-bp DNA fragment.
Addition of a H3K36me3 peptide (residues 28 to 41) resulted in clear chemical shift changes for the backbone amide resonances of residues around the aromatic cage and strand β4 of the PWWP domain (Figure 2c,d left panels). The affinity for the H3K36me3 peptide is very low with a KD of 17 mM (Figure 2f left panel). In part, this may be due to the relatively closed conformation of the binding pocket when compared to crystal structures of related PWWP domains bound to methylated peptides (Additional file 1: Figure S2c). Strikingly, this very low-affinity interaction is completely dependent on the methylation of H3K36, as no changes in chemical shift upon addition of non-methylated H3K36 peptide were observed, even at 11 mM of peptide (Additional file 1: Figure S3).
Addition of a dsDNA fragment that was previously suggested as a substrate for PSIP1-PWWP  resulted in chemical shift changes for a distinct set residues localizing to a single basic patch on the PWWP surface (Figure 2c and d, right panel). A KD of 150 ± 50 μM was determined, indicating a low binding affinity for DNA (Figure 2f, right panel). Imino-proton resonances of DNA base pairs did not change in chemical shift or relative intensity during the titration, suggesting that the interaction lacks sequence-specificity (Additional file 1: Figure S4).
The DNA and histone interaction surfaces of the PWWP domain are adjacent and overlap with areas of positive or negative potential, respectively (Figure 2e). This arrangement provides an excellent electrostatic match to the negative phosphate backbone of the DNA and positive histone tail in the context of the nucleosome.
Aromatic cage and basic surface determine binding specificity and affinity
In order to study the interaction of PSIP1-PWWP with H3K36me3 in the nucleosomal context, we made use of the methylated lysine analogue (MLA) approach , in which a cysteine is alkylated to result in a thioether mimic of the methylated lysine (referred to as H3KC36me3). Nucleosomes reconstituted from recombinantly expressed histones including H3KC36me3 are able to bind endogenous PSIP1 from a HeLa nuclear extract in a pull-down assay (Figure 3a). In a control experiment, a H3KC36me3 peptide and its native counterpart were found to bind the PSIP1-PWWP domain to a similar degree in a peptide pull-down assay (Figure 3b). Furthermore, comparison of NMR titration experiments of PWWP domain with either a native H3K36me3 or a H3KC36me3 peptide showed that peptide binding affects the same PWWP residues in a highly similar manner (Figure 3c). Based on the observed binding curve, we estimated a KD value of 11 mM for the H3KC36me3 peptide (95% confidence interval: 6 < KD < 26 mM), comparable to the KD of the native peptide. This correspondence increases further when restricting the fit to the same titration interval, in which case the best-fit KD is 13 mM for the native H3K36me3 peptide (95% confidence interval: 9 < KD < 22 mM) (Figure 3d).
Having established the validity of the MLA approach, an extensive mutational analysis of the PWWP domain was carried out to dissect the contributions of the histone tail and DNA interaction surfaces to the binding affinity and specificity for methylated nucleosomes using electrophoretic mobility shift assays (EMSA). Wild type GST-PWWP preferentially binds to methylated over non-methylated nucleosomes (Figure 4a). A residual band shift observed in cases of non-modified nucleosomes suggests that PSIP1-PWWP can also associate non-specifically to nucleosomes, at least under the low ionic strength conditions of the EMSA experiment. Alanine substitutions of the aromatic cage residues M15 and Y18 strongly reduce specificity towards H3KC36me3 nucleosomes as shown from the similar patterns observed for modified and non-modified nucleosomes (Figure 4a). Notably, their effect on binding affinity is much less pronounced. Likewise, alanine mutation of two residues flanking the other side of the aromatic cage (H48 and E49) results predominantly in a loss of specificity (Additional file 1: Figure S5a). Alanine mutations of W21 and F44 that are both part of the aromatic cage showed no binding whatsoever (Figure 4a). These mutations may interfere with proper folding of the domain as they are part of the hydrophobic core of the protein and also suggested by their reduced soluble expression levels.
Mutations in the putative nucleosomal DNA-binding surface show a markedly different result. The K70A mutant binds with a lower affinity to nucleosomes, but retains preference for the modified nucleosomes. Strikingly, alanine substitutions of the solvent exposed R74 abolished the interaction with nucleosomes (Figure 4a). In contrast, the K34A mutant showed a comparable binding pattern to wild type PWWP (Figure 4a). Nearly all charge mutants showed severely reduced binding, but retained specificity for H3KC36me3 nucleosomes (Additional file 1: Figure S5a). Of these, K39 (shown in Figure 4a) and K56 were initially not found to interact with a DNA fragment in the NMR titration experiment, suggesting that binding to nucleosomal DNA involves a larger interaction surface.
To investigate the in vivo relevance of our findings, we examined the contribution of the PWWP domain to the mobility and distribution of PSIP1 in cells. To this end wt and mutant PWWPs were introduced in the context of full-length PSIP1 fused to GFP and expressed in HeLa cells (Additional file 1: Figure S6). The nuclear mobility of the GFP-PSIP1 proteins was measured by fluorescence recovery after photobleaching (FRAP). HeLa cells expressing NLS-GFP or histone H2B-GFP were included as highly mobile and immobile controls, respectively (Figure 4b left panel). FRAP curves for GFP-PSIP1 are indicative of transient chromatin-binding (Figure 4b green curve), consistent with previously published results . Disruption of domain integrity (W21A), as well as mutations in the aromatic cage (M15A) or the DNA interaction surface (K70A, R74A), resulted in faster recovery of fluorescence after bleaching (Figure 4b center and right panels), demonstrating the requirement of both DNA and histone tail interaction surfaces for stable association with chromatin.
To examine the effect of PWWP mutations on the genomic distribution of PSIP1, the GFP-PSIP1 cell lines were used for chromatin immunoprecipitation followed by high-throughput sequencing (ChIPseq). To correlate binding of the PSIP1 proteins to H3K36 methylation, genes were divided into two groups: high H3K36me3- (red) or low H3K36me3- (blue) containing genes (Additional file 1: Figure S7). Wild type PSIP1 protein was selectively enriched on high H3K36me3 genes (Figure 4c and Additional file 1: Figure S7b), while aromatic cage mutant M15A showed no enrichment, in accordance with our in vitro data. Mutation of K70 did not significantly affect the genomic distribution of PSIP1 in correspondence with the mild effect on the affinity for nucleosomes of this mutant in EMSA.
Concerted binding of methylated histone tail and nucleosomal DNA
The PWWP-H3KC36me3 nucleosome interaction was further analyzed using state-of-the-art solution NMR tailored for large supramolecular complexes. Recently, it was shown that a comprehensive characterization of protein-nucleosome interactions can be obtained  using methyl-group based NMR (methyl-TROSY) , in which only the histone methyl groups of isoleucine, leucine and valine (ILV) residues are observed.
Here, we used MLA nucleosomes with ILV methyl-group-labeled histone H3 (spectrum in Additional file 1: Figure S8a) and monitored the H3V35 methyl groups to probe the PSIP1-PWWP interaction in a site-specific manner. Notably, the tri-methyl lysine mimic itself is not observable in these experiments as it is not isotope labeled. In the unbound state, the methyl group resonances of H3V35 (as well as H3L20) are very intense compared to those from the nucleosome core (Additional file 1: Figure S8b), reflecting the highly dynamic nature of the N-terminal tail. Addition of ILV-labeled PWWP domain resulted in a clear change in peak position of the methyl groups of H3V35, without affecting other methyl groups in the nucleosome core or the H3-tail (Figure 5a and Additional file 1: Figure S8a). Binding of PSIP1-PWWP also causes a local loss of flexibility of the H3 N-terminal tail around H3K36, as indicated by the sharp decrease in H3V35 peak intensity in the bound state, but not that of H3L20 (Figure 5b and Additional file 1: Figure S8b). For PSIP1-PWWP, the methyl group of I42, close to the aromatic cage, showed a distinct change in chemical shift during the titration experiment (Additional file 1: Figure S8a). Together, these changes demonstrate the specific interaction between the PSIP1-PWWP aromatic cage and the methylated H3 tail in the nucleosomal context.
Based on fitting the experimental line shapes of the H3V35a methyl group to a 1:2 (nucleosome:PWWP)-binding model, we find that the dissociation constant of PSIP1-PWWP-binding to the H3KC36me3 side chain within the nucleosome is 1.5 μM (Figure 5b, 5c and Additional file 1: Figure S9). The interaction is highly dynamic: the dissociation rate (koff) is ca. 500 s-1, corresponding to a lifetime of the complex (1/koff) of approximately 2 ms. The affinity found here at physiological ionic strength is comparable to the KD value estimated from the gel-shift essay (ca. 0.5 μM), recorded at lower ionic strength and temperature (Figure 4d). Strikingly, the affinity of PSIP1-PWWP for methylated nucleosomes is four orders of magnitude higher than for a methylated peptide (KD 17 mM) and two orders higher than for isolated DNA (KD 150 μM). The enhanced affinity is due to simultaneous binding of both methylated histone tail (see chemical shift perturbations of H3V35, Figure 5b) and nucleosomal DNA (see the binding defects of DNA interaction surface mutants, Figure 4a). The magnitude of such enhancement in binding affinity upon linking of two binding sites cannot simply be predicted from the affinities for the isolated binding sites, as it depends crucially on the relative orientation of the linked sites and the length and flexibility of the linker [30, 31]. Following the framework of Zhou , the enhancement may be expressed in the form of an effective concentration given by (KD,tail × KD,DNA)/KD,nucleosome, which in our case evaluates to 1.5 × 10− 4 × 1.7 × 10− 2/1.5 × 10− 6 = 1.7 M. This enhancement value is significantly higher than typical values in the mM range as found for linked DNA-binding domains or bivalent pharmaceuticals [31, 32]. This suggests that there are limited entropic losses and structural rearrangements upon binding. Thus, our data indicate that both the DNA and histone interaction surfaces of PSIP1-PWWP domain combine in a concerted manner to result in high-affinity binding to H3K36me3 nucleosomes.
Structure of PSIP1-PWWP-H3K36me3 nucleosome complex
Our experimentally observed chemical shift perturbations and mutational analysis were used to derive a structural model for the PSIP1-PWWP-H3K36me3 nucleosome complex using the docking program HADDOCK . In order to reliably sample a large conformational space for the flexible H3 tail, the flexible multi-domain docking protocol  was used. In addition, a large surface of DNA around the H3 tail exit site was systematically explored to sample all possible interaction sites on the nucleosomal DNA (Additional file 1: Figure S10). After clustering and cross-validation of the final ensemble of solutions (Additional file 1: Figure S11a, Additional file 1: Table S2), we find one cluster of solutions that is in agreement with the cross-validation data, with the lowest energy structure shown in Figure 6a. The PWWP domain bridges the two DNA gyres around super helical location (SHL) -1/+6, right at the H3 tail entry/exit site. There is an excellent electrostatic match with the nucleosomal DNA, while at the same time the H3K36me3 side chain is snugly captured by the aromatic cage (Figure 6b,c). Notably, the H3V35 methyl groups are more than 6 Å away from ILV methyl groups in the PWWP domain in the model, consistent with the absence of intermolecular NOEs (data not shown).
PWWP-DNA contacts made by the basic residues are mainly to the phosphate or sugar backbone (Figure 6c). The sequence-specific contacts of residue R74 seen in the lowest energy-structure are not conserved in the cluster of solutions. Overall, residues K73, R74 and K75 bind the DNA non-specifically around SHL +6, while K39 and K56 interact with the other DNA gyre at SHL −1. Residue K16 and K14 sit in between the two gyres and can interact with either. Notably, residues K67 and K70 do not mediate intermolecular interactions in many structures of this cluster, which may reflect the relatively minor binding defects of their alanine mutants in EMSA (see also Additional file 1: Table S2).
Here we determined the molecular basis of H3K36me nucleosome recognition by the PSIP1-PWWP domain. We show that the interaction with nucleosomal DNA is responsible for an approximately 10,000-fold enhancement in binding affinity into an in vivo relevant range. A similar conclusion was reached in the work of Eidahl et al. from a comparison of an NMR- based estimated binding affinity for H3K36me3 peptides and a pull-down assay-based measurement of the affinity for H3K36Cme3 nucleosomes . While full length PSIP1 contains additional DNA binding domains [16, 35], disruption of the PWWP basic surface markedly reduces in vivo chromatin binding ability of full-length PSIP1 as shown in this work and previously . Moreover, mutations in the DNA interaction surface were previously shown to result in a dramatically reduced HIV-infectivity in cells , underscoring the functional significance of the bipartite nucleosome-binding for HIV integration and other PSIP1-dependent cellular processes.
Most H3K36me3-binding proteins depend on a PWWP domain for proper chromatin binding, despite their low affinity for methylated peptides. Therefore, nucleosomal DNA-binding may be a general driving force for the recognition of H3K36me3 nucleosomes in vivo. Superposition of homologous PWWP domains onto the structural model of the complex shows that these domains all share similar configurations of an aromatic cage and basic patches that potentially enable concerted binding to both methylated H3K36 side chain and nucleosomal DNA (Figure 7a). This holds true both for the close homolog HDGF2-PWWP domain, and more distantly related PWWP domains of MSH6, DNMT3b, BRPF1 and WHSC1L1. These proteins are involved in the DNA damage response, DNA and histone methylation and acetylation. Moreover, the bipartite binding mode observed for PSIP1 may extend beyond PWWP domains to other domains that bind H3K36me as well as to the recognition of other histone modifications close to the nucleosomal DNA, such as H4K20me. The MRG15 chromodomain that binds H3K36me also shows appropriately configuration of an aromatic cage and basic regions (Figure 7b). Similarly, another chromodomain and PWWP domain that bind H4K20me have been shown to have a distinct basic binding surface for DNA [37, 38]. In contrast, PHD finger domains that specifically bind to H3K4me3 at the N-terminus of H3, have an extensive negative surface potential  and are therefore likely repelled from the nucleosomal DNA.
In conclusion, we propose that recognition of H3K36-methylated chromatin not only occurs through the methylated lysine side chain and its amino acid sequence context, but also through the nucleosomal DNA. We propose that this mechanism also applies to the recognition of other modifications close to the nucleosome core, such as H4K20me and H3K79me. This mechanism testifies to the fact that recognition of histone modifications relies on the binding to modified histone residues embedded in the chromatin fiber. Just as histones are not merely packaging material in chromatin, the nucleosomal DNA is not inert in the readout of the epigenetic modifications.
Protein expression and purification
The PWWP domain of human PSIP1 (3 to 100) or the PWWP+AT (3 to 207) were expressed as GST-fusions in BL21-DE3 or Rosetta 2 bacterial strains at 37°C in either LB medium or M9 minimal medium with 15NH4Cl and/or 13C-glucose. The protein was purified by binding to a glutathione agarose (GA) column (Sigma-Aldrich, St. Louis, MO) and eluted with 50 mM reduced glutathione (Sigma-Aldrich, St. Louis, MO). After thrombin digestion, PWWP was purified over a Sephadex-75 (HiLoad 16/60, GE Healthcare, Uppsale, Sweden) column in buffer A (50mM Tris-HCl pH 7, 100 mM KCl, 1 mM DTT, 1 mM EDTA, 0.5 mM PMSF and protease inhibitors), applied to a MonoS HR5/5 in buffer A and eluted using a linear gradient (0.1 to 1 M KCl).
Drosophila histones were expressed, purified and alkylated as previously described [26, 40]. Histones used for NMR studies were produced in M9 minimal medium containing desired isotopes. Methyl-labeling of Ile-δ1-[13CH3] and Val/Leu-[13CH3,12CD3] (ILV-labeling) followed the procedure of Tugarinov .
Antibodies and plasmids
α-GST (SC), α-PSIP1 (A300-848A, Bethyl, Montgomery, TX), α-H4 (07 to 108, Upstate), α-H3K36me2 (9758, Cellsignaling), α-H3K4me3 (ab8580,), α-H3K79me3 (ab2621), α-H3K36me1 (ab9048), α-H3K36me3 (ab9050; all “ab” antibodies obtained from Abcam, Cambridge, UK) and α-GFP (gift from Geert Kops) were used for ChIP and immunoblotting.
All GST fusions were cloned into pRPN265NB. PSIP1 cDNA was introduced into pEGFP-C using the Gateway system (Invitrogen, Carlsbad, CA). All point mutations were created using site directed mutagenesis (Stratagene, Santa Clara, CA). Stable GFP-tagged PSIP1 HeLa lines were created by cloning PSIP1 into pCDNA.5/FRT/TO (Invitrogen, Carlsbad, CA) and subsequent recombination into HeLa FRT cells carrying the Tet repressor for inducible expression .
Nucleosome and peptide pull-downs
Mono-nucleosomes were extracted from HeLa or yeast cells by MNase treatment of lysed cells as previously described . Histone H3 mutants were selected from a mutant library . GA beads (Sigma-Aldrich, St. Louis, MO) were covered with GST-fusion proteins, mixed with mono-nucleosomes and washed. Eluted proteins were analyzed by immunoblotting. HeLa mono-nucleosomes were incubated with premixed GFP-fusion protein and GFP binder beads (ChromoTek, Planegg-Martinsried, Germany) and analyzed in a similar way. Peptide pull-downs were performed as described previously .
Nucleosome reconstitution and EMSA
The 601-DNA ‘Widom’ template was amplified using PCR, purified using anion exchange chromatography and used for reconstitution using salt-gradient deposition. Nucleosomes were incubated with GA purified GST-PWWP protein in 0.2X TBE and analyzed by native 5% 60:1 acryl-bis gel electrophoresis. Either 1.5 or 3 pmol of nucleosome was used in all experiments and up to 3 molar equivalents of protein in 8 μL load volume. All steps were performed at 4°C. Gels were stained with ethidium-bromide and analyzed on a Gel-Doc XR+ system (Bio-Rad, Hercules, CA). If applicable, band densities corresponding to free, singly and doubly bound nucleosomes were quantified using ImageJ software package and subsequently fitted together to a 2:1 binding model using in-house written MatLAB routine (MATLAB version 7.13.0, The MathWorks Inc., Natick, MA).
FRAP studies were performed using a Zeiss 510 META confocal LSM (Zeiss, Oberkocken, Germany) as previously described . GFP protein expression was induced with 0.5 μg/ml doxycycline for five hours.
Chromatin preparation and ChIP were essentially performed as described [43, 44]. Libraries were sequenced on AB/SOLiD 5500XL, producing 48 bp reads. Sequencing reads were mapped with Burrows-Wheeler Aligner (BWA-0.5.8c) (settings: -c -l 25 -k 2 -n 10) . As a gene set, the known protein-coding genes as annotated in Ensembl 67 were used (http://www.ensembl.org). The number of reads mapped to each gene was normalized to the total number of reads mapping inside genes per sample. A separation of H3K36me3 enriched and non-enriched was made based on the density plot of the read density. Genes were filtered to have at least 50 sequencing reads in the GFP tagged PSIP1 ChIP-seq data. All plots were created using the R package (http://www.r-project.org/).
Samples used for assignment and structure calculation contained ca. 1 mM PWWP domain in 90/10% H2O/D2O with 20 mM NaPi buffer at pH 6.2. Interaction studies were done at 0.3 mM PWWP in 20 mM NaPi pH 7.0 with 100 mM NaCl. ILV-labeled H3KC36me3 nucleosome sample contained 116 μM nucleosome in 20 mM NaPi pH 7 with 100 mM NaCl.
Peptides were extensively lyophilized and dissolved in NMR buffer to a stock concentration of 110 mM. Cysteine peptides were alkylated according to the MLA protocol  and purified using a Sephadex G-10 (GE Healthcare, Uppsale, Sweden) column followed by cation exchange chromatography. The purity of the peptide was confirmed by NMR. Annealed DNA oligos (Eurogentec, Liege, Belgium) were lyophilized and dissolved in NMR buffer to a stock concentration of 11.5 mM. Titration of H3KC36me3 nucleosomes was done using a PWWP stock of 1.28 mM.
PSIP1-PWWP structure determination
NMR experiments for assignment, and structure calculation of the PSIP1-PWWP domain were carried out at 293K on a 600 or 750 MHz Bruker Avance II spectrometer (Bruker Biospin, Rheinstetten, Germany). Processing was done using the NMRPipe package . Spectra were analyzed using Sparky (Goddard and Kneller, UCSF, USA). Backbone assignments were obtained using MARS  based on HNCACB and CBCACONH spectra. Side chain resonances were assigned using CCH-TOCSY, CBHD and NOESY spectra. Overall assignment completeness was 97.1% for all non-labile protons. Backbone dihedral angle restraints were derived using TALOS+ . Distance restraints were derived from 13C- and 15N-edited 3D NOESY spectra (mixing time 120 ms). The NOE cross peaks were assigned and converted into distance restraints using CYANA 3.0 [49, 50]. First, 10 ensembles of 100 structures were calculated by using CYANA using different random number seeds. Out of the 10 resulting distance restraint lists, only the restraints that were reproduced in all cases were retained to produce a final restraint list. This final list was then used to calculate 100 structures in CNS 1.2 , which were subsequently refined in explicit water by using the RECOORD protocol . The final ensemble containing the 20 lowest-energy structures, contained neither distance violations > 0.5 A, nor dihedral angle violation > 5°, and was validated by using the iCing validation suite .
Titration experiments and data analysis
Interaction studies of the PSIP1-PWWP domain were carried out at 293K on a 600 MHz Bruker Avance II spectrometer. Nucleosome spectra were recorded at 308K on a 900 MHz Bruker Avance III spectrometer with a TCI cryo-probe. Titration data were fitted using MatLAB scripts either using the fast-exchange assumption in case of fitting CSP derived binding curves or using explicit evaluation of the exchange matrix, and subsequent calculation of the FID in case of line shape fitting (see supporting materials in Kato et al.  for details).
All molecular graphics were prepared using open-source PyMOL (The PyMOL Molecular Graphics System, Version 1.4, Schrödinger, LLC). Electrostatic surfaces were calculated using the adaptive Poisson-Boltzman solver  and the AMBER force field.
Docking protocol PSIP1-PWWP–nucleosome complex
We used our experimental chemical shift perturbation, transferred-NOESY and mutagenesis data, together with available literature data to create a structural model for the PSIP1-PWWP-nucleosome complex with Haddock version 2.1  and CNS 1.3 [51, 55]. In what follows, we describe the docking procedure.
In short, the docking was divided in two stages: i) docking of the H3 N-terminal tail to the PSIP1-PWWP domain guided by the chemical shift perturbation, transferred-NOESY and mutation data, and using homology-derived interaction restraints from the homologous BRPF1-H3K36me3, HDGF2-H4K20me3/H3K79me3 crystal structures; ii) docking of the PWWP-H3K36me3 complex to the nucleosome, again guided by the chemical shift perturbation and mutation data, together with restraints to enforce the covalent attachment of the H3-tail to the remainder of H3. This approach was based on the flexible multi-domain docking protocol described by Karaca et al. . It allows the efficient docking of the PSIP1-PWWP domain to both to K36me3 side chain in the H3 tail and to the nucleosomal DNA, and at the same time, to sample a large conformational space for the flexible H3 tail. This procedure is described in details in the Supplementary Material.
PDB accession codes
The solution structure of the PSIP1-PWWP domain is accessible from the Protein Data Bank, [PDB: ID 3ZEH]. The structural model of the H3K36me3-nucleosome-PSIP1-PWWP domain complex is deposited under [PDB: ID 3ZH1] and is also available from the author’s website: http://www.nmr.chem.uu.nl/~hugo.
Fischle W, Wang Y, Allis CD: Histone and chromatin cross-talk. Curr Opin Cell Biol. 2003, 15: 172-183. 10.1016/S0955-0674(03)00013-9.
Kouzarides T: Chromatin modifications and their function. Cell. 2007, 128: 693-705. 10.1016/j.cell.2007.02.005.
Barski A, Cuddapah S, Cui K, Roh T-Y, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K: High-resolution profiling of histone methylations in the human genome. Cell. 2007, 129: 823-837. 10.1016/j.cell.2007.05.009.
Wagner EJ, Carpenter PB: Understanding the language of Lys36 methylation at histone H3. Nat Rev Mol Cell Biol. 2012, 13: 115-126. 10.1038/nrm3274.
Vermeulen M, Eberl HC, Matarese F, Marks H, Denissov S, Butter F, Lee KK, Olsen JV, Hyman AA, Stunnenberg HG, Mann M: Quantitative interaction proteomics and genome-wide profiling of epigenetic histone marks and their readers. Cell. 2010, 142: 967-980. 10.1016/j.cell.2010.08.020.
Maurer-Stroh S, Dickens NJ, Hughes-Davies L, Kouzarides T, Eisenhaber F, Ponting CP: The Tudor domain ‘Royal Family’: tudor, plant agenet, chromo, PWWP and MBT domains. Trends Biochem Sci. 2003, 28: 69-74. 10.1016/S0968-0004(03)00004-5.
Hughes RM, Wiggins KR, Khorasanizadeh S, Waters ML: Recognition of trimethyl-lysine by a chromodomain is not driven by the hydrophobic effect. Proc Natl Acad Sci USA. 2007, 104: 11184-11188. 10.1073/pnas.0610850104.
Vezzoli A, Bonadies N, Allen MD, Freund SMV, Santiveri CM, Kvinlaug BT, Huntly BJP, Göttgens B, Bycroft M: Molecular basis of histone H3K36me3 recognition by the PWWP domain of Brpf1. Nat Struct Mol Biol. 2010, 17: 617-619. 10.1038/nsmb.1797.
Wu H, Zeng H, Lam R, Tempel W, Amaya MF, Xu C, Dombrovski L, Qiu W, Wang Y, Min J: Structural and histone-binding ability characterizations of human PWWP domains. PLoS One. 2011, 6: e18919-10.1371/journal.pone.0018919.
Li H, Ilin S, Wang W, Duncan EM, Wysocka J, Allis CD, Patel DJ: Molecular basis for site-specific read-out of histone H3K4me3 by the BPTF PHD finger of NURF. Nature. 2006, 442: 91-95.
Peña PV, Davrazou F, Shi X, Walter KL, Verkhusha VV, Gozani OP, Zhao R, Kutateladze TG: Molecular mechanism of histone H3K4me3 recognition by plant homeodomain of ING2. Nature. 2006, 442: 100-103.
Luger K, Mäder AW, Richmond RK, Sargent DF, Richmond TJ: Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature. 1997, 389: 251-260. 10.1038/38444.
Lukasik SM, Cierpicki T, Borloz M, Grembecka J, Everett A, Bushweller JH: High resolution structure of the HDGF PWWP domain: a potential DNA-binding domain. Protein Sci. 2006, 15: 314-323. 10.1110/ps.051751706.
Laguri C, Duband-Goulet I, Friedrich N, Axt M, Belin P, Callebaut I, Gilquin B, Zinn-Justin S, Couprie J: Human mismatch repair protein MSH6 contains a PWWP domain that targets double-stranded DNA. Biochemistry. 2008, 47: 6199-6207. 10.1021/bi7024639.
Llano M, Vanegas M, Hutchins N, Thompson D, Delgado S, Poeschla EM: Identification and characterization of the chromatin-binding domains of the HIV-1 integrase interactor LEDGF/p75. J Mol Biol. 2006, 360: 760-773. 10.1016/j.jmb.2006.04.073.
Turlure F, Maertens G, Rahman S, Cherepanov P, Engelman A: A tripartite DNA-binding element, comprised of the nuclear localization signal and two AT-hook motifs, mediates the association of LEDGF/p75 with chromatin in vivo. Nucleic Acids Res. 2006, 34: 1653-1665. 10.1093/nar/gkl052.
Pradeepa MM, Sutherland HG, Ule J, Grimes GR, Bickmore WA: Psip1/Ledgf p52 binds methylated histone H3K36 and splicing factors and contributes to the regulation of alternative splicing. PLoS Genet. 2012, 8: e1002717-10.1371/journal.pgen.1002717.
Ge H, Si Y, Roeder RG: Isolation of cDNAs encoding novel transcription coactivators p52 and p75 reveals an alternate regulatory mechanism of transcriptional activation. EMBO J. 1998, 17: 6723-6729. 10.1093/emboj/17.22.6723.
Ferris AL, Wu X, Hughes CM, Stewart C, Smith SJ, Milne TA, Wang GG, Shun M-C, Allis CD, Engelman A, Hughes SH: Lens epithelium-derived growth factor fusion proteins redirect HIV-1 DNA integration. Proc Natl Acad Sci USA. 2010, 107: 3135-3140. 10.1073/pnas.0914142107.
Gijsbers R, Vets S, De Rijck J, Ocwieja KE, Ronen K, Malani N, Bushman FD, Debyser Z: Role of the PWWP domain of lens epithelium-derived growth factor (LEDGF)/p75 cofactor in lentiviral integration targeting. J Biol Chem. 2011, 286: 41812-41825. 10.1074/jbc.M111.255711.
Yokoyama A, Cleary ML: Menin critically links MLL proteins with LEDGF on cancer-associated target genes. Cancer Cell. 2008, 14: 36-46. 10.1016/j.ccr.2008.05.003.
Daugaard M, Baude A, Fugger K, Povlsen LK, Beck H, Sørensen CS, Petersen NHT, Sorensen PHB, Lukas C, Bartek J, Lukas J, Rohde M, Jäättelä M: LEDGF (p75) promotes DNA-end resection and homologous recombination. Nat Struct Mol Biol. 2012, 19: 803-810. 10.1038/nsmb.2314.
Eidahl JO, Crowe BL, North JA, McKee CJ, Shkriabai N, Feng L, Plumb M, Graham RL, Gorelick RJ, Hess S, Poirier MG, Foster MP, Kvaratskhelia M: Structural basis for high-affinity binding of LEDGF PWWP to mononucleosomes. Nucleic Acids Res. 2013, 41: 3924-3936. 10.1093/nar/gkt074.
Vermeulen M, Mulder KW, Denissov S, Pijnappel WWMP, van Schaik FMA, Varier RA, Baltissen MPA, Stunnenberg HG, Mann M, Timmers HTM: Selective anchoring of TFIID to nucleosomes by trimethylation of histone H3 lysine 4. Cell. 2007, 131: 58-69. 10.1016/j.cell.2007.08.016.
Singh DP, Kubo E, Takamura Y, Shinohara T, Kumar A, Chylack LT, Fatma N: DNA-binding domains and nuclear localization signal of LEDGF: contribution of two helix-turn-helix (HTH)-like domains and a stretch of 58 amino acids of the N-terminal to the trans-activation potential of LEDGF. J Mol Biol. 2006, 355: 379-394. 10.1016/j.jmb.2005.10.054.
Simon MD, Chu F, Racki LR, de la Cruz CC, Burlingame AL, Panning B, Narlikar GJ, Shokat KM: The site-specific installation of methyl-lysine analogs into recombinant histones. Cell. 2007, 128: 1003-1012. 10.1016/j.cell.2006.12.041.
Hendrix J, Gijsbers R, De Rijck J, Voet A, Hotta J-I, McNeely M, Hofkens J, Debyser Z, Engelborghs Y: The transcriptional co-activator LEDGF/p75 displays a dynamic scan-and-lock mechanism for chromatin tethering. Nucleic Acids Res. 2011, 39: 1310-1325. 10.1093/nar/gkq933.
Kato H, van Ingen H, Zhou B-R, Feng H, Bustin M, Kay LE, Bai Y: Architecture of the high mobility group nucleosomal protein 2-nucleosome complex as revealed by methyl-based NMR. Proc Natl Acad Sci USA. 2011, 108: 12283-12288. 10.1073/pnas.1105848108.
Tugarinov V, Hwang PM, Ollerenshaw JE, Kay LE: Cross-correlated relaxation enhanced 1H[bond]13C NMR spectroscopy of methyl groups in very high molecular weight proteins and protein complexes. J Am Chem Soc. 2003, 125: 10420-10428. 10.1021/ja030153x.
Jencks WP: On the attribution and additivity of binding energies. Proc Natl Acad Sci USA. 1981, 78: 4046-4050. 10.1073/pnas.78.7.4046.
Zhou H-X: Polymer models of protein stability, folding, and interactions. Biochemistry. 2004, 43: 2141-2154. 10.1021/bi036269n.
Rees DC, Congreve M, Murray CW, Carr R: Fragment-based lead discovery. Nat Rev Mol Cell Biol. 2004, 3: 660-672.
Dominguez C, Boelens R, Bonvin AMJJ: HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. J Am Chem Soc. 2003, 125: 1731-1737. 10.1021/ja026939x.
Karaca E, Bonvin AMJJ: A multidomain flexible docking approach to deal with large conformational changes in the modeling of biomolecular complexes. Structure. 2011, 19: 555-565. 10.1016/j.str.2011.01.014.
Tsutsui KM, Sano K, Hosoya O, Miyamoto T, Tsutsui K: Nuclear protein LEDGF/p75 recognizes supercoiled DNA by a novel DNA-binding domain. Nucleic Acids Res. 2011, 39: 5067-5081. 10.1093/nar/gkr088.
Shun M-C, Botbol Y, Li X, Di Nunzio F, Daigle JE, Yan N, Lieberman J, Lavigne M, Engelman A: Identification and characterization of PWWP domain residues critical for LEDGF/p75 chromatin binding and human immunodeficiency virus type 1 infectivity. J Virol. 2008, 82: 11555-11567. 10.1128/JVI.01561-08.
Kim D, Blus BJ, Chandra V, Huang P, Rastinejad F, Khorasanizadeh S: Corecognition of DNA and a methylated histone tail by the MSL3 chromodomain. Nat Struct Mol Biol. 2010, 17: 1027-1029. 10.1038/nsmb.1856.
Qiu Y, Zhang W, Zhao C, Wang Y, Wang W, Zhang J, Zhang Z, Li G, Shi Y, Tu X, Wu J: Solution structure of Pdp1 PWWP domain reveals its unique binding sites for methylated H4K20 and DNA. Biochem J. 2011, 442: 527-538.
van Ingen H, van Schaik FMA, Wienk H, Ballering J, Rehmann H, Dechesne AC, Kruijzer JAW, Liskamp RMJ, Timmers HTM, Boelens R: Structural insight into the recognition of the H3K4me3 mark by the TFIID subunit TAF3. Structure. 2008, 16: 1245-1256. 10.1016/j.str.2008.04.015.
Dyer PN, Edayathumangalam RS, White CL, Bao Y, Chakravarthy S, Muthurajan UM, Luger K: Reconstitution of nucleosome core particles from recombinant histones and DNA. Meth Enzymol. 2004, 375: 23-44.
Tighe A, Staples O, Taylor S: Mps1 kinase activity restrains anaphase during an unperturbed mitosis and targets Mad2 to kinetochores. J Cell Biol. 2008, 181: 893-901. 10.1083/jcb.200712028.
Dai J, Hyland EM, Yuan DS, Huang H, Bader JS, Boeke JD: Probing nucleosome function: a highly versatile library of synthetic histone H3 and H4 mutants. Cell. 2008, 134: 1066-1078. 10.1016/j.cell.2008.07.019.
de Graaf P, Mousson F, Geverts B, Scheer E, Tora L, Houtsmuller AB, Timmers HTM: Chromatin interaction of TATA-binding protein is dynamically regulated in human cells. J Cell Sci. 2010, 123: 2663-2671. 10.1242/jcs.064097.
Mokry M, Hatzis P, de Bruijn E, Koster J, Versteeg R, Schuijers J, van de Wetering M, Guryev V, Clevers H, Cuppen E: Efficient double fragmentation ChIP-seq provides nucleotide resolution protein-DNA binding profiles. PLoS One. 2010, 5: e15092-10.1371/journal.pone.0015092.
Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.
Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A: NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J Biomol NMR. 1995, 6: 277-293.
Jung YS, Zweckstetter M: Mars - robust automatic backbone assignment of proteins. J Biomol NMR. 2004, 30: 11-23.
Shen Y, Delaglio F, Cornilescu G, Bax A: TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J Biomol NMR. 2009, 44: 213-223. 10.1007/s10858-009-9333-z.
López-Méndez B, Güntert P: Automated protein structure determination from NMR spectra. J Am Chem Soc. 2006, 128: 13112-13122. 10.1021/ja061136l.
Herrmann T, Güntert P, Wüthrich K: Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J Mol Biol. 2002, 319: 209-227. 10.1016/S0022-2836(02)00241-3.
Brünger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL: Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr D: Biol Crystallogr. 1998, 54: 905-921.
Nederveen AJ, Doreleijers JF, Vranken W, Miller Z, Spronk CAEM, Nabuurs SB, Güntert P, Livny M, Markley JL, Nilges M, Ulrich EL, Kaptein R, Bonvin AMJJ: RECOORD: a recalculated coordinate database of 500+ proteins from the PDB using restraints from the BioMagResBank. Proteins. 2005, 59: 662-672. 10.1002/prot.20408.
Doreleijers JF, da Silva Sousa AW, Krieger E, Nabuurs SB, Spronk CAEM, Stevens TJ, Vranken WF, Vriend G, Vuister GW: CING: an integrated residue-based structure validation program suite. J Biomol NMR. 2012, 54: 267-283. 10.1007/s10858-012-9669-7.
Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA: Electrostatics of nanosystems: application to microtubules and the ribosome. Proc Natl Acad Sci USA. 2001, 98: 10037-10041. 10.1073/pnas.181342398.
Brunger AT: Version 1.2 of the crystallography and NMR system. Nat Protoc. 2007, 2: 2728-2733. 10.1038/nprot.2007.406.
We thank S Taylor, R Roeder, Y Bai and G Kops for reagents; P de Graaf for help with FRAP and the HADDOCK Team (ASJ Melquiond, E Karaca, M van Dijk and Prof. AMJJ Bonvin) for valuable discussions on the docking. This work was supported by the Chemical Sciences division of Dutch Science foundation NWO through a Veni fellowship to HvI (700.59.401), a NWO-Chemical Sciences TOP grant to MT (700.57.302) and by the Netherlands Proteomics Centre. RB was supported by NWO-Chemical Sciences for the 900 MHz NMR and the TCI cryoprobe.
The authors declare that they have no competing interests.
RvN carried out the biochemical and cell biology experiments, participated in its design and wrote the manuscript with HvI. RvS participated in the biochemical experiments. MS, SvH and EC participated in the ChIP-seq experiments and helped to draft the manuscript. RB participated in the design of the study and helped to draft the manuscript. MT and HvI conceived of the study, participated in its design and coordination and helped to draft the manuscript. HvI performed the NMR experiments, solved the structures and performed the modeling. All authors read and approved the final manuscript.
Electronic supplementary material
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.