Skip to main content

Systemic epigenetic response to recombinant lentiviral vectors independent of proviral integration



Lentiviral vectors (LV) are widely used for various gene transfer or gene therapy applications. The effects of LV on target cells are expected to be limited to gene delivery. Yet, human hematopoietic CD34+ cells respond to functional LVs as well as several types of non-integrating LVs by genome-wide DNA methylation changes.


A new algorithm for the analysis of 450K Illumina data showed that these changes were marked by de novo methylation. The same 4126 cytosines located in islands corresponding to 1059 genes were systematically methylated. This effect required cellular entry of the viral particle in the cells but not the genomic integration of the vector cassette. Some LV preparations induced only mild sporadic changes while others had strong effects suggesting that LV batch heterogeneity may be related to the extent of the epigenetic response.


These findings identify a previously uncharacterized but consistent cellular response to viral components and provide a novel example of environmentally modified epigenome.


The maintenance of a coordinated gene expression pattern is largely dependent on the chromatin organization in the cell nucleus. Epigenetic mechanisms such as DNA methylation play a key role in determining chromatin structure. In addition, these mechanisms represent an interface between environment and genome function [1].

Lentiviral vectors (LV) are convenient tools to achieve stable gene transfer in target cells for research, engineering or clinical applications such as gene therapy studies [2]. Integration-defective LV have also been designed for various transient gene delivery applications [3]. It is presumed that for all such uses, LV would not modify the function of transduced cells other than by effects of the transgene itself. Nonetheless several types of cellular responses have been caused by LV independently of the transgene. The proviral DNA was found to trigger transient innate immune responses in dendritic cells [4] and in mice [5] and its genomic insertion may potentially cause genotoxicity [2].

Recent studies have clearly demonstrated that LV can have a strong and long lasting epigenetic effects on transduced cells. For example, LV-s alone have been reported to epigenetically reprogram fibroblasts in culture independently of other external factors [6, 7]. We recently provided the first direct evidence that the in vitro transduction of hematopoietic stem and progenitor cells (HSC) with LVs induces DNA methylation of CpG nucleotides within 24 h after the first contact between the cells and vectors [8]. The epigenetic effects of LV have not been characterized in depth and it is not known if they are caused by the lentiviral provirus or by the viral components before integration of the viral genome.

The epigenetic alterations in HSC seem particularly important to assess given that self-renewal and differentiation of HSC are highly dependent on epigenetic mechanisms [913]. A potential impact of LV-induced epigenetic effects has to be considered when LV-s are used in gene therapy protocols based on LV-mediated gene transfer into patient-autologous CD34+ HSC to treat genetic diseases [2]. It is therefore important to investigate the exact nature of LV induced epigenetic changes in human HSCs in conditions relevant to gene therapy protocols.

The present study was conducted to determine the distribution of DNA methylation changes in the cellular genome in response to LV and the relation between them and the integration of the vector genome. We show that in addition to randomly occurring modifications a defined set of CpG islands associated to more than 1000 genes are systematically methylated. The DNA methylation changes occurred independently of the integration of the viral genome and only required cellular entry of the viral particle. Importantly, only some vector batches induce reproducible changes, other batches of the same vector produce only sporadic effects.


The Infinium Human Methylation 450K BeadChip (Illumina) provides a high resolution quantitative assessment of the CpG methylation level of relevant genomic regions. A major difficulty when comparing two methylation profiles with this method is to separate small but “true” changes from the background noise. Noise filtering can be performed intuitively by the use of pre-determine fixed threshold above which methylation changes are considered real. However, this traditional approach also eliminates small “true” changes. To overcome this difficulty, we have established an improved method for the detection of small methylation changes based on their repeated occurrence.

New method for the analysis of the methylation changes using the Infinium 450K array

A new algorithm called “double average technique” (DAT) was designed to identify very small, but reproducible differences between two series of samples after normalization of the raw data (see the workflow of the data analysis and representation on Fig. 1). This highly sensitive method identifies only reproducible differences in cytosine methylation level present in all samples compared to their reference controls. Sporadic differences are filtered. The method is implemented on triplicate experiments requiring the comparison of 3 samples with 3 corresponding controls. Thus, after hybridization on the 450K bead chip, 6 β-values are obtained for each CpG site tested. The sliding average (<β>) of the β-values of 3 consecutive CpG-s for each sample (t 1, t 2 and t 3) and for each of the corresponding control (c 1, c 2 and c 3) are calculated. The methylation of a given CpG site is considered to be different between the sample and control triplicates if all three values of the sliding averages of the samples are above (t 1, t 2, t 3 ≥ increased methylation) or below (t 1, t 2, t 3 ≤ decreased methylation) of the arithmetic average of the 6 <β>-values [(t 1 + t 2 + t 3 + c 1 + c 2 + c 3)/6] of the samples and controls. This constraint confers to the algorithm a very high stringency and allows the identification of all CpG sites with increased or decreased methylation independently of the extent of the difference and without applying the same fixed threshold to all CpG-s on the chip. The list of these CpG-s provides information on the total number of modified cytosines, their position in the genome and the extent of the increase or decrease of the methylation at each site.

Fig. 1
figure 1

Workflow of data analysis and representation. The normalized β-values of triplicate samples and their corresponding controls are analyzed using the DAT algorithm as described in the text. a First, a moving average <β> is calculated for each CpG position in each sample and control. The average of all six sample <β> values is calculated β. b The methylation level of a given CpG is considered as “increased” in the samples compared to controls if all the <β> values are higher then β or decreased if lower then β. The above operation is repeated for all CpG-s analyzed by the Illumina chip. c Three parameters are calculated on the basis of the lists of “increased” and “decreased” CpG-s: (1) the total number of CpG-s (N); (2) chi2 calculated on the basis of the observed distribution of the CpG cluster sizes compared to the simulated random distribution of cluster sizes obtained with an identical number of CpG sites; (3) the median delta-β value of all the N CpG-s

Global assessment of the difference between the control and experimental triplicates is provided by three parameters. The first parameter (N) is the total number of CpG-s with altered methylation compared to controls. The second parameter shows the clustering of methylation changes in the genome. The value of this parameter is equal to the chi2 when comparing the distribution of the modified CpG-s to the theoretical random distribution of the same number of sites. The third parameter is the median difference between the β-values (delta-β) of the experimental and control triplicates for increased or decreased CpG-s. The three parameters were calculated separately for CpG-s with increased and decreased methylation.

DNA methylation in CD34+ cells increases in response to LV transduction

First, we sought to determine if the previously observed changes in umbilical cord blood CD34+ cells using a Infinium 27K chip [8] could be confirmed with the Infinium 450K chip on a broader set of CpGs in CD34+ cells obtained from apheresed adult donors, a material more relevant to clinical use. In a pilot study we compared one of the samples examined in our previous report [8] with a DNA sample isolated from LV transduced CD34+ cells isolated from cytokine-mobilized human peripheral blood. We confirmed that CpG methylation changes in the cellular genome were induced in response to LV transduction in the CD34+ cells isolated from the two different sources (not shown). In all subsequent experiments we used CD34+ cells obtained from apheresed blood of adult individuals.

We then transduced CD34+ cells of 12 different donors using multiple batches of vector (Table 1) to examine the methylation profile of their genomic DNA using Infinium 450K chip runs. The characteristics of the vectors used and their potency is reported in Additional file 1. All 5 batches of LVs tested encoded GFP and were produced by transient transfection using second or third generation constructs. Three batches were concentrated by ultracentrifugation (LV1–LV3). In order to investigate the potential contribution of product contaminants, two batches (LV4–LV5) were purified and concentrated by ion exchange chromatography followed by ultrafiltration and gel filtration, a method that extensively removes protein and DNA contaminants from the vector preparation [14]. We analyzed the effects of several batches of non-integrating LV which were generated by completely removing the vector system HIV-1 integrase as a result of a stop codon in the pol gene (dINT) or by omitting the transfer plasmid leading to so-called “genome-empty” particles (dGEN). Two batches each of dINT and dGEN preparations were tested and verified to be unable to express a transgene and to integrate (Table 1). In addition, we also tested in these same experiments 3 batches of vectors unable to infect cells due to the omission of the envelope glycoprotein plasmid in the production process (dENV). All such defective vectors, except for the genome-deficient (dGEN) vectors included a transgene cassette coding for the green fluorescent protein (GFP). All were concentrated by ultracentrifugation (Additional file 1). They were tested in parallel with the fully functional LV using comparable amounts of particles on the basis of physical titer (Additional file 1).

Table 1 Effects of different batches of vectors and particles on CD34+ cells

Using a similar transduction protocol as described earlier [8], CD34+ cells were stimulated with early-acting cytokines for 20 h, cultured with LV (2 consecutive hits) during 24 h before genomic DNA was prepared for methylation analysis. Table 1 shows that all batches of functional LV resulted in similar transduction levels based on measures of the average vector copy number per cell (vcn) integrated and based on the percentage of cells expressing the GFP transgene after a short additional culture. No transduction was observed with the various defective vectors. Controls were set-up with cells from the same donors that were tested in parallel, cultured identically to the test cultures but without adding LV to the culture medium.

The genomic DNA from the infected cells was extracted, bisulfite-converted, subjected to Infinium 450K array hybridization and analyzed using the DAT method. Volcano plot representation of the median and maximal delta-β values of CpG-s with increased and decreased methylation in triplicate samples of some representative experiments are shown on Fig. 2a. We calculated the three parameters (Fig. 1) that characterized the differences between the sample triplicates and their matched controls. In order to compare the effects observed with different vector preparations, we performed hierarchical clustering of the results. Data obtained for increased and decreased methylation were submitted to independent analysis (Fig. 2b).

Fig. 2
figure 2

Analysis of DNA methylation changes. Analyses of 4 different chip runs (a, b, c, d) testing a total of 12 different batches of vector annotated accordingly (i.e. dENV1a and dENV1b indicates a repeat testing of dENV1 batch on chip a and chip b). a Volcano plots of representative experiments. From left to right: methylation differences observed in cells infected with an integrative lentiviral vector showing a high effect (LV2), integrase-deficient particles (dINT2), genome-deficient (dGEN2) and envelope-deficient (dENV1) particles as compared to control cells cultured but without any vector. The last plot on the right represents a control triplicate compared to another control triplicate (as in bottom line of b). Each point on a volcano plot represents the maximal delta-β value of a CpG site in a given triplicate as a function of the median of the three delta-β values of that triplicate. Only the CpG-s identified as displaying altered methylation between the samples and their corresponding controls are shown. Points representing CpG sites with increased and decreased methylation are displayed on the right and left sites of the plot. Note the higher number of points and higher median and maximal delta-β values with increased methylation in the first three samples. b Hierarchical classification of the experimental conditions depending on the extent of the increase (left) or the decrease (right) of the genomic DNA methylation in CD34+ cells. The classification was done using the three parameters indicated in the table: number of changed CpGs (N), chi2 of genomic distribution and median delta-β

When increased methylation is considered, the various LV-s segregated in two different clusters (Fig. 2b). The first cluster (“high”) included 2 of the 3 LV batches produced by ultracentrifugation and all of the integrase-deficient vectors and genome-deficient vector batches (Table 1). In addition to the relatively high number of methylated CpG-s, the distinguishing feature of methylation profiles in this group was the strong non-random distribution in the genome (high χ 2) and relatively high proportion of the cells harboring methylation at these sites (high delta-β). The second cluster (“low”) included the 2 chromatography-purified and one ultracentrifuged batches of LV and all of the dENV batches. The “low” group also contained the background control of the experiments (Ctrl) that consists in a comparison between two sets of uninfected cell triplicates. The “low” group was characterized by higher dispersion of the modified CpG-s (lower χ 2) and lower frequency in the cell population (lower delta-β). Repeated testing of the same LV batches provided the same categorization of their effects as seen for LV1, LV3, LV4.

Cluster analysis was also performed on the parameters for decreased CpG methylation. No clear segregation into two groups can be observed. The two batches (dGEN1 and dINT1) clustered separately from the others showing “high” effects on decreased methylation but this was not reproduced in a second independent experiment (Fig. 2b). Overall, the number of demethylated CpG-s and their incidence was often lower than the corresponding numbers for increased methylation in the same experimental conditions. This suggests that the cellular response to the different vectors involves principally de-novo methylation at a variable number of CpG sites and to variable extent. Nevertheless, some demethylation may also occur. The methylation changes are clearly unrelated to the integration of the viral genome in the host genome, because non-integrative vectors also induce methylation.

The genomic localization of the changes shows recurrent methylation pattern

To further characterize the DNA methylation changes, we compared the lists of CpG-s with increased and decreased methylation in the “high” and “low” clusters. The lists in the “low” cluster did not share common CpG-s. Similarly, the lists of CpG-s with decreased methylation did not overlap significantly with one another. By contrast, the comparison of the 8 lists of CpG-s with increased methylation obtained in the “high” group revealed a substantial overlap. The methylation of 4126 CpG sites became higher in all 8 conditions that involved different vector types (Fig. 3), but also CD34+ cells from different donors (Table 1). The probability to get such a high number of identical changes fortuitously is essentially zero (p < 10−60). These sites represent 7–20 % of all CpG-s with increased methylation that were detected in the various conditions and almost 1 % of all sites present on the Illumina array. This high number of recurrently de novo methylated CpG-s (commonly-modified CpG-s, CM-CpG) in a subset of genes suggests that the exposure of the cells to vector particles triggers the action of a common mechanism.

Fig. 3
figure 3

Venn diagram indicating the overlap between the eight different conditions identified as having “high” effect on increased methylation. 4126 is the number of CM-CpG sites. The different conditions were A dINT1b (76382 CpG-s); B dINTa (20337 CpG-s); C LV3a (29123 CpG-s); D LV3b (24818 CpG-s); E LV2 (29007 CpG-s); F dINT2 (57624 CpG-s); G dGEN2 (31059 CpG-s); H dGEN1b (20566 CpG-s), as defined on the Table 1

The majority (86 %) of the 4126 common modified CpG sites were found in CpG islands (CGI) (Fig. 4) in gene promoters, 5′UTRs, 1st exons and enhancers of 1059 genes (Additional file 2). The CpG-s in island “shores”, “shelves” or at non-assigned positions were strikingly underrepresented on our list (Fig. 4). The preferential localization of the CM-CpG-s to islands explains the high χ 2 score obtained when the clustered distribution of the modifications were calculated (Fig. 2b).

Fig. 4
figure 4

Genomic localization of the CM-CpG-s. a Distribution of all CpG-s interrogated by the array according to their sequence context as provided by the manufacturer. b Sequence environment of the CM-CpG-s

We compared the list of the 1059 genes with that of genes reported to be expressed in CD34+ cells [15]. Of the 928 genes associated with CM-CpG-s and represented on the chip only 265 genes were expressed and 663 were silent in pre-activated CD34+ cells (the remaining 131 genes were not interrogated by the chip). This is significantly higher than the proportion of the non-expressed genes in the dataset (529 genes expressed versus 399 silent genes).

The integration of the HIV-1-derived LV-s in the host cell’s genome is not random. Although the DNA methylation changes occur independently of the integration, it is possible that the potential integration sites and the location of the CM-CpG-s are somehow related. To clarify this we calculated the distances between each CM-CpG and the position of closest integration site in a dataset of more than 30,000 LV integration sites [15]. The distribution of the distances was found to be random, showing that the sites targeted by the methylation changes and by the integration are unrelated.

In order to determine if these genes belong to specific functional categories, we have verified their functional ontology annotation using the DAVID tool. No significant enrichment was found. The 5 best represented functional categories of genes were involved in: regulation of transcription (GO:0045449; 224 genes, p < 7 × 10−14, 1.6 fold enrichment), regulation of RNA metabolic processes (GO:0051252; 185 genes, p < 1.1 × 10−19, 1.9 fold enrichment), regulation of transcription DNA dependent (GO:0006355; 184 genes, p < 2.4 × 10−19, 1.9 fold enrichment), positive regulation of biosynthetic processes (GO:0009891, p < 9.6 × 10−11, 2.1 fold enrichment) and neuron differentiation (GO:0030182, 77 genes, p < 4.7 × 10−20, 3.2 fold enrichment). Although any of these categories showed significant enrichment, examples of genes found on the list are known to impact the normal or pathological cell differentiation: genes coding for zinc finger-, homeobox-containing transcription factors, nuclear receptors,—de novo DNA methyl transferase Dnmt3A and SWI/SNF a chromatin structure modifying enzyme. Other genes on the list are implicated specifically in hematopoietic differentiation (e.g. GATA2, VENTX or CEBPd) or have important immunologic function (CD8a and b or BCL2L11). We also compared our gene list to the comprehensive cancer gene list established on the basis of observations from several laboratories ( Importantly, the two lists contained 152 genes in common. These observations clearly show that the methylated CM-CpG islands are not targeted for the biological function of the genes they regulate. Nevertheless, if the expression of these genes is perturbed by the methylation reprogramming, the functional consequences may be important and the differentiation of the cells may be impacted.


The purpose of the present study was twofold: to characterize the genomic distribution of the DNA methylation changes in response to LV and to investigate if these changes are related to the genomic integration of the vector. To this end, a stringent algorithm was developed to detect reproducible and clustered methylation changes. The main finding in this paper is that CD34+ cells respond to some batches of LV by systematic de novo methylation of the same CpG islands in the proximity of 1059 genes. The highly significant reproducibility of the “strong” effect in cells of 24 different donors and 8 different vector batches underscores the importance of the cellular response. However, other LV batches induce a milder “low” effect characterized by fewer modified CpG-s in dispersed genomic locations without recurrent changes and in a smaller proportion of cells. These observations can be understood as a heterogeneous manifestation of the same type of cellular response: some LV batches pass the threshold and induce “high” cellular reaction, while others stay below and elicit only a mild ‘low” effect. Although the precise mechanisms for the establishment of de novo methylation patterns in the genome are not yet elucidated [1618], rapid changes are possible due to the highly dynamic maintenance of the DNA methylation [16, 18]. Various types of environmental stress are known to generate epigenetic response. For example, physiological stress related to under- or over-nutrition during fetal life is associated with global shifts towards DNA hypermethylation in CD34+ cells [19]. Stress related to in vitro culture of HSC-s also increases the methylation at specific sites in correlation with the loss of differentiation potential [13]. Our observations provide a new example of environmental stimulus or stress causing essentially de novo methylation.

An important issue is to evaluate the functional consequences of this response. In this respect, we observe that genes that become methylated belong to a wide range of functional pathways and not to specific categories. Therefore, these genes are probably not targeted for their function, but most likely because of their genomic location. Most of them are not expressed at the stage when the modification occurs, which reinforces this hypothesis. If the de novo methylation is maintained and perturbs gene expression at a later stage, the potential phenotypic effects are expected to be highly variable. This will make their study particularly difficult in future investigations.

The induced methylation changes were clearly independent of the integration of the vector genome in the cellular DNA and were not caused by proviral DNA because both the non-integrative dINT and dGEN vectors had the same effect. No difference in potency for CD34+ transduction or titer could distinguish the LV with “high” or “low” effects. Furthermore, the commonly methylated positions did not colocalize in a statistically significant manner with LV genomic integration sites reported in CD34+ cells by others [15]. We think that the observed changes represent an active cellular response to some vector components, but the identity of the trigger remains unknown. The HIV integrase is required for the steps of reverse transcription, for the formation of the preintegration complex and mediates genomic insertion by tethering the preintegration complex to the host cell chromatin and associated factors [20]. dINT particles which completely lack integrase due to a stop codon in the pol gene sequence exerted a strong DNA methylation effect on HSC which is therefore independent of any of all these steps described. The effect is also independent the accumulation of episomal LTR-circles which do not occur here. In some models, LV enhance chromatin remodelling and nuclear reprogramming via the stimulation of TLR3 pathway [6, 7]. It is unlikely that TLR3 is involved in our system, as double-stranded viral RNA that classically triggers TLR3 is not be produced significantly by dINT and dGEN vectors. Whereas we can exclude that the DNA methylation is caused by the integration of LV or by the provirus, there are other possibilities to consider based on the requirement for cell entry to generate the “high” cellular response. A single viral protein can cause genome-wide epigenetic changes as exemplified by adenovirus E1A [21, 22]. One possibility is that a minimal concentration of a viral component is necessary to induce the effect. A role of the VSVg envelope protein can be envisioned. VSVg is present on all LV tested even those with “low” effects but the number of VSVg molecules per particle are known to vary from 600 to 2200 per virion depending on the batches [23] and this could explain the variability of effects between LV batches. VSVg has been reported to constitute tubulovesicules in LV preparations that entrap plasmid DNA and induce TLR9-mediated innate immune responses in target cells [24]. In such case, standardized methods and the inclusion of a purification step should homogenize VSVg quality and quantity on LV and may be important to reduce a DNA methylation effect. Indeed, standardized LV preparations purified using ion exchange chromatography systematically induced “low” responses. Routine measurements of VSVg on LV batches will have to be developed to monitor this effect.

Another possible explanation is that the entry of large amounts of lentiviral vector components with non-integrative particles triggers the cellular response. All vector preparations, even infectious LV batches are heterogeneous, they contain varying levels of lentiviral protein components such as reverse transcriptase [23] and include no more than 1 % of infectious particles when comparing infectious and physical titers. Such inter-batch differences are not detected by infectivity measurements (Additional file 1). These non-integrative particles, when introduced in the cell, may trigger DNA methylation changes, as is observed with the dINT or dGEN. Additional studies will be needed to further address this point.

Altogether, our observations strongly suggest that LV batch quality is important, can be optimized at the upstream stages and may have a major impact on the transduced cells. Further studies are needed to develop quality control procedures to assess the DNA methylation effects of different preparations and to ensure that only LV with “low” properties would be used to transduce HSC.

LV-s are increasingly popular as gene transfer tools in the everyday laboratory practice, but also in therapeutic strategies where gene transfer is required. Integration-defective LV are also frequently preferred when transient expression of the transgene is required as, for example, in genome editing using CRISPR/Cas technology. Our observations which were made in clinically-relevant conditions (using apheresed CD34+ cells, the same cytokines and medium as used in clinical trials [25] and purified LV) point to the necessity of further investigating the mechanisms leading to the epigenetic changes. Understanding these mechanisms could open the way for further improving the safety of LV-s.


Using a sensitive pipeline for data analysis we identified a previously uncharacterized but consistent cellular response to viral components that involves a systematic de novo methylation of more than 4000 CpG-s associated with 1059 different genes in the genome. These observations provide a novel example of environmental stress modified epigenome.


Cell culture

Human mobilized peripheral blood cells were obtained with informed consent according to international ethical principles and according to the French national bioethic law (no 2011-814). The study no DC-2012-1655 was approved by the French Ministry of Research and Higher Education-Bioethics office in September 2012. The anonymous samples were provided to us by accredited laboratories including a biological resource center (MYOBANK-AFM in Paris ( code BB-0033-00012), the French blood bank (EFS) (Hospital Henri Mondor, Créteil), the cell therapy laboratories of Gustave Roussy Institute (Villejuif, France) or University Hospital Center of Amiens (Amiens, France). The CD34+ haematopoietic stem cell fraction was purified by Ficoll gradient centrifugation and magnetic bead cell sorting according to the manufacturer’s instructions (Miltenyi). Purified cells were systematically analyzed by FACS and the CD34+ population was always >95 %. In total 12 purified samples from 12 different donors were used in the present study.

LV transduction

CD34+ cells were transduced as described previously [8]. Briefly, after purification 1 × 106 cells were pre-activated overnight at 37 °C in X-VIVO culture medium, supplemented with antibiotics and cytokines (Flt3-ligand (300 ng/ml), stem cell factor (300 ng/ml), trombopoietin (100 ng/ml) and IL3 (20 ng/ml). Transduction was performed in the presence of 4 μg/ml protamine sulfate (Sigma) with 5 × 107 IG/ml (infectious genome) LV or equivalent amounts of p24 as specified in Additional file 1. A 2nd round of transduction was performed in the same conditions 6 h later. Transduction reaction was stopped 24 h after the beginning of the first round of transfection and cells were collected. After washing the cells a small fraction was put back to culture in preactivating culture supplemented with 10 % SVF and transfection efficiency was verified 8 days later by FACS of GFP+ cells. Integrated vector copy number was determined at the same time on the same fraction by qPCR.

Lentiviral vectors

HIV-1 derived lentiviral vectors (LV) expressing eGFP under the control of the human PGK promoter and pseudotyped with VSVg using a 3- or 4-plasmid system as indicated in Additional file 1 were produced by transient transfection of 293T cells. The 3-plasmid system used the p8.74 plasmid coding for HIV-1 gag pol rev tat accessory genes. The 4-plasmid system used the pKLgagpol and pKRev plasmids. The culture medium was collected, concentrated by ultracentrifugation or chromatography and aliquots were stored at −80 °C. The chromatography-purified preparations were produced as described [14]. Vector titers were determined as infectious genome/ml (IG/ml) by qPCR after infection of HCT116 cells or as physical titers based on measured of P24 content by ELISA. Several deficient vectors were used in this study in addition to LV. dENV and dGEN vectors were respectively produced by transient transfection as above omitting the VSV-G encoding plasmid or the transfer plasmid encoding for eGFP. dINT particles were obtained by using the p8.74-INTSTOP plasmid in which a stop codon mutation was introduced in the pol gene. Titers of dINT, dGEN and dENV particles was determined by P24 content.

Array based DNA methylation profiling and data analysis

LV transfection of the cells was followed by genomic DNA extraction. From each sample 500 ng genomic DNA was bisulfite treated using EZ DNA methylation kit and submitted to hybridization to the Infinium Human Mehtylation 450K BeadChip (Illumina) according to the manufacturer’s instruction. Results of the methylation assay were expressed as β values (ranging between 0—non-methylated, and 1—methylated) for each individual CpG present on the array. Raw data were extracted and normalized using Genome Studio. Only non-polymorphic CpG-s located on somatic chromosomes were considered. (The data are available under GEO accession number: GSE72867). We used the DAT algorithm to score individual sites for their methylation status. The DAT algorithm was implemented in “R”. The genes assigned to the CpGs were identified according to the array’s annotation. Functional annotation of the genes was performed using the online DAVID Functional annotation tool (



lentiviral vectors


hematopoietic stem and progenitor cells


double average technique


commonly modified CpG-s


green fluorescent protein


human immunodeficiency virus


vesicular stomatitis virus glycoprotein


vector copy number


  1. Jaenisch R, Bird A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet. 2003;33(Suppl):245–54.

    Article  CAS  PubMed  Google Scholar 

  2. Kaufmann KB, Buning H, Galy A, Schambach A, Grez M. Gene therapy on the move. EMBO Mol Med. 2013;5:1642–61.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Liu KC, Lin BS, Gao AD, Ma HY, Zhao M, Zhang R, Yan HH, Yi XF, Lin SJ, Que JW, et al. Integrase-deficient lentivirus: opportunities and challenges for human gene therapy. Curr Gene Ther. 2014;14:352–64.

    Article  PubMed  Google Scholar 

  4. Breckpot K, Escors D, Arce F, Lopes L, Karwacz K, Van Lint S, Keyaerts M, Collins M. HIV-1 lentiviral vector immunogenicity is mediated by Toll-like receptor 3 (TLR3) and TLR7. J Virol. 2010;84:5627–36.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Agudo J, Ruzo A, Kitur K, Sachidanandam R, Blander JM, Brown BD. A TLR and non-TLR mediated innate response to lentiviruses restricts hepatocyte entry and can be ameliorated by pharmacological blockade. Mol Ther. 2012;20:2257–67.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Kane NM, Nowrouzi A, Mukherjee S, Blundell MP, Greig JA, Lee WK, Houslay MD, Milligan G, Mountford JC, von Kalle C, et al. Lentivirus-mediated reprogramming of somatic cells in the absence of transgenic transcription factors. Mol Ther. 2010;18:2139–45.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Lee J, Sayed N, Hunter A, Au KF, Wong WH, Mocarski ES, Pera RR, Yakubov E, Cooke JP. Activation of innate immunity is required for efficient nuclear reprogramming. Cell. 2012;151:547–58.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Yamagata Y, Parietti V, Stockholm D, Corre G, Poinsignon C, Touleimat N, Delafoy D, Besse C, Tost J, Galy A, et al. Lentiviral transduction of CD34(+) cells induces genome-wide epigenetic modifications. PLoS ONE. 2012;7:e48943.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Araki H, Yoshinaga K, Boccuni P, Zhao Y, Hoffman R, Mahmud N. Chromatin-modifying agents permit human hematopoietic stem cells to undergo multiple cell divisions while retaining their repopulating potential. Blood. 2007;109:3570–8.

    Article  CAS  PubMed  Google Scholar 

  10. Chaurasia P, Gajzer DC, Schaniel C, D’Souza S, Hoffman R. Epigenetic reprogramming induces the expansion of cord blood stem cells. J Clin Investig. 2014;124:2378–95.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. De Felice L, Tatarelli C, Mascolo MG, Gregorj C, Agostini F, Fiorini R, Gelmetti V, Pascale S, Padula F, Petrucci MT, et al. Histone deacetylase inhibitor valproic acid enhances the cytokine-induced expansion of human hematopoietic stem cells. Cancer Res. 2005;65:1505–13.

    Article  PubMed  Google Scholar 

  12. Ji H, Ehrlich LI, Seita J, Murakami P, Doi A, Lindau P, Lee H, Aryee MJ, Irizarry RA, Kim K, et al. Comprehensive methylome map of lineage commitment from haematopoietic progenitors. Nature. 2010;467:338–42.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Weidner CI, Walenda T, Lin Q, Wolfler MM, Denecke B, Costa IG, Zenke M, Wagner W. Hematopoietic stem and progenitor cells acquire distinct DNA-hypermethylation during in vitro culture. Sci Rep. 2013;3:3372.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Merten OW, Charrier S, Laroudie N, Fauchille S, Dugue C, Jenny C, Audit M, Zanta-Boussif MA, Chautard H, Radrizzani M, et al. Large-scale manufacture and characterization of a lentiviral vector produced for clinical ex vivo gene therapy application. Hum Gene Ther. 2011;22:343–56.

    Article  CAS  PubMed  Google Scholar 

  15. Cattoglio C, Pellin D, Rizzi E, Maruggi G, Corti G, Miselli F, Sartori D, Guffanti A, Di Serio C, Ambrosi A, et al. High-definition mapping of retroviral integration sites identifies active regulatory elements in human multipotent hematopoietic progenitors. Blood. 2010;116:5507–17.

    Article  CAS  PubMed  Google Scholar 

  16. Law JA, Jacobsen SE. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11:204–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Wu SC, Zhang Y. Active DNA demethylation: many roads lead to Rome. Nat Rev Mol Cell Biol. 2010;11:607–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Yamagata Y, Szabo P, Szuts D, Bacquet C, Aranyi T, Paldi A. Rapid turnover of DNA methylation in human cells. Epigenetics. 2012;7:141–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Delahaye F, Wijetunga NA, Heo HJ, Tozour JN, Zhao YM, Greally JM, Einstein FH. Sexual dimorphism in epigenomic responses of stem cells to extreme fetal growth. Nat Commun. 2014;5:5187.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Craigie R, Bushman FD. HIV DNA integration. Cold Spring Harbor perspectives in medicine. 2012;2:a006890.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Ferrari R, Pellegrini M, Horwitz GA, Xie W, Berk AJ, Kurdistani SK. Epigenetic reprogramming by adenovirus e1a. Science. 2008;321:1086–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Horwitz GA, Zhang K, McBrian MA, Grunstein M, Kurdistani SK, Berk AJ. Adenovirus small e1a alters global patterns of histone modification. Science. 2008;321:1084–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Denard J, Rundwasser S, Laroudie N, Gonnet F, Naldini L, Radrizzani M, Galy A, Merten OW, Danos O, Svinartchouk F. Quantitative proteomic analysis of lentiviral vectors using 2-DE. Proteomics. 2009;9:3666–76.

    Article  CAS  PubMed  Google Scholar 

  24. Pichlmair A, Reis e Sousa C. Innate recognition of viruses. Immunity. 2007;27:370–83.

    Article  CAS  PubMed  Google Scholar 

  25. Hacein-Bey Abina S, Gaspar HB, Blondeau J, Caccavelli L, Charrier S, Buckland K, Picard C, Six E, Himoudi N, Gilmour K, et al. Outcomes following gene therapy in patients with severe Wiskott-Aldrich syndrome. JAMA. 2015;313:1550–63.

    Article  PubMed  Google Scholar 

Download references

Authors’ contributions

TA and DS performed the experiments and prepared the figures. RY and CP produced the vectors. TW and GC performed the statistical analysis. TA, DS, AP, NT and JT performed the bioinformatic analysis. TA, DS, AG and AP analyzed the data. AP conceived the project. AP and AG designed the experiments and wrote the manuscript. All authors reviewed the manuscript. All authors read and approved the final manuscript.


Not applicable.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

The data sets supporting the conclusions of this article are available in the NCBI GEO ( under Accession Number GSE72867.

Consent for publication

All authors read and approved the final manuscript.

Ethics approval

In France, an IRB approval is not needed to perform non-interventional, anonymized research experiments on blood products. We need to comply with the Bioethics law and to declare the use and storage of human materials to the government (Ministry of Research). Our declaration number is DC-2012-1655, as indicated in the methods. We obtain, under contractual agreement, our blood cells from accredited laboratories (such as the French Blood Bank) which comply with the law, and these laboratories are listed in the Methods section.


This work was supported by the ANR Grant Stochagene Grant No. BSV6 014 02 and Genethon. TA was supported by the OTKA K 100638 and RBUCE/UP Grants.

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Anne Galy or Andràs Paldi.

Additional information

Tamas Aranyi and Daniel Stockholm contributed equally to this work

Additional files

Additional file 1. Characteristics of different batches of vector and particles tested on CD34+ cells.


Additional file 2. Table with the names and ID-s of the genes associated with CM-CpG-s and the number of CM-CpG-s in the island.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aranyi, T., Stockholm, D., Yao, R. et al. Systemic epigenetic response to recombinant lentiviral vectors independent of proviral integration. Epigenetics & Chromatin 9, 29 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: