- Open Access
The methylome of the marbled crayfish links gene body methylation to stable expression of poorly accessible genes
Epigenetics & Chromatin volume 11, Article number: 57 (2018)
The parthenogenetic marbled crayfish (Procambarus virginalis) is a novel species that has rapidly invaded and colonized various different habitats. Adaptation to different environments appears to be independent of the selection of genetic variants, but epigenetic programming of the marbled crayfish genome remains to be understood.
Here, we provide a comprehensive analysis of DNA methylation in marbled crayfish. Whole-genome bisulfite sequencing of multiple replicates and different tissues revealed a methylation pattern that is characterized by gene body methylation of housekeeping genes. Interestingly, this pattern was largely tissue invariant, suggesting a function that is unrelated to cell fate specification. Indeed, integrative analysis of DNA methylation, chromatin accessibility and mRNA expression patterns revealed that gene body methylation correlated with limited chromatin accessibility and stable gene expression, while low-methylated genes often resided in chromatin with higher accessibility and showed increased expression variation. Interestingly, marbled crayfish also showed reduced gene body methylation and higher gene expression variability when compared with their noninvasive mother species, Procambarus fallax.
Our results provide novel insights into invertebrate gene body methylation and its potential role in adaptive gene regulation.
The marbled crayfish (Procambarus virginalis) represents a novel freshwater crayfish species that emerged in the German aquarium trade in 1995 . Marbled crayfish reproduce by apomictic parthenogenesis, thus producing large numbers of genetically identical offspring [2,3,4]. It is assumed that P. virginalis originated from a very recent evolutionary macromutation in the Florida slough crayfish Procambarus fallax [5, 6]. Comparative whole-genome sequencing of a diverse set of animals from various sources has shown that the global population can be considered a single genetic clone with negligible genetic variation .
Despite their genetic homogeneity, marbled crayfish have successfully invaded and colonized a variety of habitats in subtropical and temperate regions [8, 9]. This is exemplified by the rapid propagation of marbled crayfish on Madagascar, where the animals have increased their distribution area 100-fold over the past 10 years . Of note, the genetic homogeneity of the population precludes the selection of genetic variants as an explanation for rapid adaptation. As such, it is important to understand epigenetic regulation in marbled crayfish.
DNA methylation represents a conserved and well-established epigenetic modification [10,11,12]. It is mediated by the family of DNA methyltransferases (DNMTs) which catalyze the methylation of genomic cytosine residues in a wide range of organisms and provide an important toolkit for epigenetic regulation . Two previous studies have used capillary electrophoresis and mass spectrometry to demonstrate the presence of DNA methylation in marbled crayfish [4, 6]. In addition, changes in global DNA methylation levels have been linked to phenotypic variants . However, information about the DNA methylation toolkit, the patterning of DNA methylation and its potential function has been lacking.
Single-base resolution methylation maps have been generated for a large variety of species and have shown a surprising diversity of DNA methylation patterns in the animal kingdom [14,15,16], ranging from almost ubiquitous methylation to low levels or no methylation. Also, different species can show methylation in distinct subgenomic compartments, including promoters, repetitive elements and gene bodies [14, 15]. Interestingly, gene body methylation is often associated with housekeeping genes [17, 18].
The function of gene body methylation remains to be fully understood [19, 20]. The modification has been linked to various aspects of gene regulation, including transcriptional elongation, mRNA splicing, chromatin structure and the suppression of cryptic intragenic promoters in transcribed chromatin [21,22,23,24]. In invertebrates, the preferential methylation of highly conserved genes with housekeeping functions and their moderate but stable expression similarly suggests a regulating function for gene expression, possibly through the suppression of transcriptional noise or expression variation [17, 18, 25, 26]. It has also been shown that conflicting chromatin states, determined by the absence of active histone marks and presence of repressive histone marks, are associated with high levels of transcription noise in actively transcribed genes . However, a clear mechanistic understanding of the role of gene body methylation in gene expression variability remains lacking.
We have now used whole-genome bisulfite sequencing to establish high-resolution methylation maps of P. virginalis from several independent animals and tissues. We also performed RNA-seq and chromatin accessibility assay sequencing (ATAC-seq) to integratively analyze the effect of gene body methylation on chromatin accessibility states and gene expression. The results reveal a DNA methylation pattern that is characterized by tissue-invariant gene body methylation of housekeeping genes. While gene body methylation was negatively associated with chromatin accessibility, we also found that active genes with variable expression were less accessible than stably expressed genes. Comparing gene body methylation levels and gene expression variation levels in the marbled crayfish and its parent species P. fallax, we observe overall lower gene body methylation levels and higher gene expression variation levels in the marbled crayfish. Together, these findings establish the methylome of an emerging invasive animal and provide novel insight into invertebrate gene body methylation.
Identification of a conserved DNA methylation system in marbled crayfish
We have recently assembled the transcriptome of the marbled crayfish and also established a draft assembly of the complete genome sequence . Genome annotation identified single crayfish homologs for Dnmt1, Dnmt3 and a Tet hydroxymethylase . Virtual translation of the corresponding transcripts produced protein sequences with robust sequence conservation to functionally characterized honeybee and human homologs. A more detailed analysis of the predicted crayfish Dnmt1 homolog revealed a protein with a length of 1566 amino acids, which contained all the known Dnmt1 protein domains in the correct order (Fig. 1a). Furthermore, our analysis of the predicted marbled crayfish Dnmt3 protein identified a protein with 1112 amino acids with the known Dnmt3 protein domains (Fig. 1b). We also investigated the predicted crayfish Tet enzyme. This revealed a protein of 1313 amino acids with substantial sequence homology to honeybee and human Tet enzymes, including two conserved oxygenase domains (Fig. 1c). Together, these findings suggest the presence of a conserved DNA methylation and demethylation system in marbled crayfish.
To confirm the expression of the marbled crayfish DNA methylation system, we used qRT-PCR analysis of various developmental stages and dissected tissues from adult animals. Based on published data from Daphnia pulex  and an evaluation of five different housekeeping genes (Additional file 1), TATA-box-binding protein (TBP) mRNA was used as an internal reference. The results showed low mRNA levels for all three genes in early embryonic stages. Dnmt1 became strongly upregulated in embryonic stage 1.5, while Dnmt3 expression continuously increased during embryogenesis (Fig. 1d). Tet mRNA levels became strongly increased during mid-embryogenesis and remained high in juveniles (Fig. 1d). In adult tissues, Dnmt1 was stably expressed at moderate levels (Fig. 1e), which is consistent with a general maintenance methyltransferase function of the enzyme. The expression pattern of Dnmt3 appeared to be more tissue specific, with the highest level in hemocytes and lowest level in the ovary (Fig. 1e). Tet expression was high in most tissues, but low in the ovary (Fig. 1e). Together, these data show that the components of the DNA methylation system are dynamically expressed during marbled crayfish development and in adult tissues.
Characterization of the marbled crayfish methylome
We used whole-genome bisulfite sequencing to characterize methylation patterns at single-base resolution. Sequencing of hepatopancreas DNA at 9.8 × genome coverage (Additional file 2) uncovered a methylation pattern that was CpG specific, bimodal and symmetric (Additional file 3) and thus recapitulates key hallmarks of other known animal methylomes [14, 15]. A comparative analysis in 2-kb sliding windows (Fig. 2a) revealed that the marbled crayfish methylome showed substantially more highly methylated windows than other known crustacean methylomes [29, 30]. This suggests that the marbled crayfish genome is relatively highly methylated.
We also quantified CpG methylation levels for various subgenomic features, such as repeats, exons and introns. This revealed that repeats were relatively hypomethylated (Fig. 2b), while methylation was strongly enriched at gene bodies (Fig. 2b, c). More specifically, average methylation levels were found slightly increased over the genome average in 5’-UTRs and more strongly increased in exons, introns and 3’-UTRs (Fig. 2b). Interestingly, gene body methylation showed a bimodal distribution, with distinct populations of low-methylated and high-methylated genes (Fig. 2d). Further analysis revealed that methylation preferentially targets long, CpG-poor and evolutionarily conserved genes (Additional file 4). These characteristics represent defining features of housekeeping genes, and indeed, methylation levels of housekeeping genes were strongly elevated when compared with other genes (Fig. 2e). Together, our findings thus suggest that DNA methylation in marbled crayfish is enriched at gene bodies and identify housekeeping genes as an important methylation target.
The marbled crayfish methylome is largely tissue invariant
To further characterize DNA methylation in marbled crayfish, we performed whole-genome bisulfite sequencing on DNA samples from seven additional animals and tissues (Additional file 2). Our analysis included two distinct adult tissues (hepatopancreas and abdominal musculature), each from three independent animals and from various sources (laboratory stocks and wild catches). In addition, we also included single replicates from early embryos (stage 1.7) and a third adult tissue (hemocytes). Genome coverage ranged from 9 × to 23 × (Additional file 2), thus ensuring sufficient analytical power. Strikingly, a comparative analysis of all eight methylomes failed to reveal tissue-specific or developmental stage-specific changes (Fig. 3a). This was also confirmed by a Wilcoxon rank-sum test, which did not identify any significantly differentially methylated genes between abdominal musculature and hepatopancreas. A specific analysis of housekeeping genes confirmed their pronounced methylation and again suggested that the marbled crayfish methylome is largely tissue invariant (Fig. 3b). It remains possible that a subset of variably methylated genes show moderate context-dependent methylation changes, but a substantially greater number of samples will be required for their identification .
Further analysis also showed that the majority of repetitive elements was unmethylated, while some minor repeat classes, such as DNA transposons and TcMar-Tigger elements, showed higher methylation levels (Fig. 3c, Additional file 5). On the genome level, repeat methylation appeared strongly associated with gene body methylation, as repeats outside of genes had lower methylation levels compared to repeats that were located within genes (Fig. 3d). Of note, repeat methylation again appeared largely invariant between different tissues (Additional file 5), which is consistent with the methylation patterns observed for genes.
Gene body methylation, chromatin accessibility and gene expression variability
To investigate potential gene regulatory functions of DNA methylation in marbled crayfish, we integrated our methylation datasets with RNA-seq datasets that were obtained from three independent hepatopancreas and abdominal musculature samples, respectively (Additional file 6). The results showed a parabolic correlation between gene body methylation and gene expression levels, with the highest and lowest expression ranks being relatively undermethylated (Additional file 7). These results are similar to findings originally made in plants [22, 32]. For further insight, we also addressed the relationship between DNA methylation and chromatin accessibility. For this purpose, we used ATAC-seq  to generate high-resolution chromatin accessibility maps that could be analyzed together with DNA methylation maps. ATAC-seq was successfully established for marbled crayfish hemocytes (Additional file 8), which are isolated cells that are suitable for ATAC-seq analysis (Additional file 9). We also generated RNA-seq data from hemocytes (Additional file 10) and integrated the WGBS data from hemocytes into our analysis.
From the ATAC-seq datasets (N = 3), we identified 89,156 accessible peaks for hemocytes. Of these, 4558 overlapped with promoter regions. Heatmaps for methylated and unmethylated gene bodies show that chromatin accessibility around the transcription start site is more prominent when methylation is low (Additional file 11). Consistent with observations in mouse embryonic stem cells , the most expressed genes (quintile 5) had a considerably elevated level of chromatin accessibility (Fig. 4a). We also found low to moderately methylated genes (genes with a mean gene body methylation ratio of 0–0.4) to be distinctly more accessible than highly methylated genes (Fig. 4b). Consistently, housekeeping genes, which are often strongly methylated in marbled crayfish, were found in chromatin states with limited accessibility (Fig. 4c).
In further analyses, we also explored the relationship between gene body methylation, chromatin accessibility and gene expression variability. Consistent with earlier observations in other organisms [25, 26, 35], we observed that low-methylated genes show greater gene expression variability (Fig. 4d). This inverse correlation between gene body methylation and gene expression variability was also conserved in the two other tissues that were analyzed, hepatopancreas and abdominal muscle (Additional file 12). Also, convincing examples for high-methylated genes with low gene expression variability, and low-methylated genes with high expression variability were identified (Additional file 12). We next divided genes into low-methylated (average methylation ratio < 0.4) and high-methylated (average methylation ratio > 0.4) gene to understand the association between gene expression variation and chromatin accessibility. The results showed that low-methylated genes with stable expression were distinctly more accessible than low-methylated genes with high gene expression variability (Fig. 4e). In contrast, high-methylated genes showed no major difference in accessibility for stably and variably expressed genes (Fig. 4f). These findings suggest that gene body methylation promotes stable expression of poorly accessible genes.
Increased gene body methylation and reduced gene expression variability in P. fallax
The marbled crayfish is a recent clonal descendant of the sexually reproducing slough crayfish, P. fallax [5, 6]. However, P. fallax shows no evidence for invasiveness and populates a defined area in Florida and southern Georgia [36,37,38]. We therefore generated three P. fallax datasets (2 × hepatopancreas, 1 × abdominal musculature), with genome coverage ranging from 10 × to 11 × (Additional file 2) for a comparative analysis of P. fallax and marbled crayfish methylomes. As the two species are defined by a very close phylogenetic and genetic relationship , reads could be mapped to the marbled crayfish genome. The results revealed a methylation pattern that was overall similar to P. virginalis (Fig. 5a). However, we also identified 2357 genes with species-specific methylation differences, the majority of which (> 90%) appeared more methylated in P. fallax (Fig. 5b). Overall, gene body methylation levels were significantly reduced in marbled crayfish (Fig. 5c, P < 2.2e−16), consistent with earlier findings that suggested a moderate but significant reduction in global DNA methylation levels during the transition from P. fallax to marbled crayfish . Notably, gene expression variability was significantly elevated in marbled crayfish (Fig. 5d, P < 5.58e−13). Whether gene body hypomethylation facilitates the phenotypic adaptation of marbled crayfish through increased gene expression variability will have to be addressed in future studies.
DNA methylation is a highly conserved modification in the animal kingdom [14, 15]. However, relatively little is known about its potential function in epigenetic regulation outside of mammals. Our study provides an in-depth analysis of DNA methylation in marbled crayfish, an emerging model organism and new invasive species that is characterized by genetic uniformity and high adaptive potential.
A large number of arthropod methylomes have been published over the past years [14,15,16]. This includes the methylomes from two crustaceans, Daphnia  and P. hawaiensis . However, all known arthropod methylomes have so far been analyzed using DNA preparations from whole animals or a single, specific tissue. We carried out a direct comparison of arthropod methylation patterns from different tissues and from different animals. Our results show that the marbled crayfish methylome is highly conserved between different tissues and therefore substantially different from paradigmatic mammalian methylomes. Tissue-invariant methylation patterns have also been concluded from a comparison of single sperm and muscle methylomes from the tunicate Ciona intestinalis . However, this feature has not been investigated systematically yet and it will be interesting to determine its conservation in invertebrates.
The stability of DNA methylation levels and patterns in marbled crayfish contrasts the dynamic expression of the DNA methylation toolkit during development and in different adult tissues and may suggest additional non-catalytic functions of these enzymes . It should also be noted that the erasure of parental DNA methylation patterns is considered a fundamental requirement for the establishment of totipotency and organismal development in mammals [41, 42]. It is possible that DNA methylation reprogramming in marbled crayfish occurs before the earliest developmental stage that could be investigated by whole-genome bisulfite sequencing (stage 1.7). Alternatively, DNA methylation may not play a major role in the development of the parthenogenetic marbled crayfish.
The marbled crayfish methylome also showed several additional defining features. This includes the relatively sparse methylation of repeats. While similar observations have been made in a few other organisms, including honeybees , repeat methylation is a functionally important feature of many animal and plant genomes [44,45,46]. Also, while mammalian genomes are characterized by dynamic methylation at regulatory regions, such as promoters and enhancers [47,48,49], the marbled crayfish methylome is characterized by gene body methylation.
Gene body methylation is often associated with actively transcribed genes . However, we found that gene body methylation in the marbled crayfish does not show a clear correlation with gene expression levels. Our genome-wide chromatin accessibility analysis revealed that the most highly accessible genes were methylated at low levels and most strongly expressed. Similar findings were also recently published in mouse embryonic stem cells . Furthermore, and in agreement with previous observations in insects and in human tissues [25, 26, 35], our analyses showed that gene body methylation inversely correlates with gene expression variability. Finally, the results from our integrative analysis of whole-genome bisulfite sequencing, ATAC-seq and RNA-seq identified two distinct states and shed light on the potential function of gene body methylation (Fig. 6): high-methylated genes (such as housekeeping genes) reside in poorly accessible chromatin and are stably expressed, while low-methylated genes reside in open chromatin and are variably expressed. In this context, gene body methylation might function to increase the specificity of DNA-binding proteins, such as transcription factors , in poorly accessible chromatin structures and thus result in a more stable pattern of gene expression.
Of note, we also found a large number of genes to be hypomethylated in marbled crayfish when compared with its parent species, P. fallax. In addition, we also observed that gene expression variability was significantly increased in marbled crayfish. Variable gene expression has recently been identified as a key mechanism for coral adaptation to a variable environment . Furthermore, epigenetic variability has been shown to increase fitness in simulations with fluctuating environments . Altogether, our findings thus raise the interesting possibility that hypomethylation of gene bodies and its associated gene expression variability provide a mechanism for the adaptability and invasive potential of marbled crayfish.
Laboratory animals were bred as described before  and handled according to institutional guidelines for the care of laboratory animals. Wild animals were collected from lake Moosweiher (Germany, N48°01.844′E07°48.368′), Moramanga (Madagascar, S18°47.350′E48°14.764′) and lake Reilingen (Germany, N49°17.649′E08°32.672′), in compliance with local fishery regulations. Additional information is provided in Additional file 2.
Identification of P. virginalis Dnmt and Tet homologs
P. virginalis Dnmt1, Dnmt3 and Tet homologs were identified by using the respective annotated coding sequences from Daphnia pulex. Blast (v. 2.2.29 +) was used to align assembled marbled crayfish transcripts against the Daphnia pulex sequences. Candidate sequences were validated by searching with Blast against the non-redundant protein sequence database. Read coverage was analyzed by using Bowtie2 (v. 2.2.3)  to map transcriptome reads to the respective sequences. Finally, alignments were illustrated in IGV (v. 2.3.34) . In addition, the 3′ SMARTER RACE kit (Takara) was used to resolve remaining ambiguities at the 3′-end of Dnmt3.
RNA and DNA preparation
Samples of organs and tissues from adult crayfish for DNA and RNA preparation were taken under a dissection microscope, snap-frozen in liquid nitrogen and stored at − 80 °C until extraction of nucleic acids. Embryonated eggs and juveniles were snap-frozen in liquid nitrogen. Genomic DNA was isolated using the Blood and Cell Culture DNA Kit (Qiagen, Hilden, Germany), and total RNA was purified with Trizol (Invitrogen, Darmstadt, Germany).
For first-strand cDNA synthesis, RNA was reverse-transcribed using the QuantiTec Reverse Transcription Kit (Qiagen, Hilden, Germany). qRT-PCR analyses were performed on a LightCycler 480 Real-Time PCR System (Roche, Mannheim, Germany) using the Absolute QPCR SYBR Green Mix (Thermo Scientific, St. Leon-Rot, Germany). The expression levels of Dnmt1, Dnmt3 and Tet were determined by the mean crossing point (Cp) value of three technical replicates using the TATA-box-binding protein (TBP) as a reference gene. Primer sequences were as follows: DNMT1_for: GGGAGAAGGCACTGATTGG and DNMT1_rev: CGATCATCGTTGTTCACCAG; DNMT3_for: GAATGGAACATCAGCACCTGC and DNMT3_rev: CGGTGCTCTCATTCCACAATC; Tet_for: CCAGTAGAAGTGATCAACAGTG and Tet_rev: CCTCCAATATCTGGATCGTGG; TBP_for: CCACAGCTACAGAACATCG and TBP_rev: CTCATGATGACGGCTGC.
Whole-genome bisulfite sequencing
Genomic DNA was isolated as described above. The TruSeq PCR-Free Library Prep Kit (LT; Illumina, San Diego, US) was used for library preparation and the Epitect Kit (Qiagen) for bisulfite conversion. Library amplification was performed using the Kapa HiFi HotStart Uracil + ReadyMix (2 ×; Kapa Biosystems). Samples were then sequenced on an Illumina HiSeq platform.
Approximately 500 µL of hemolymph was collected from three independent animals using a 23-G needle inserted in the abdomen of a cold-anesthetized crayfish. One volume of anticoagulant solution (0.14 M and NaCl, 0.1 M glucose, 30 mM Na3 Citrate.2 H2O, 26 mM citric acid, 0.5 M EDTA) was added prior to centrifugation for 5 min at 300 x g and 4 °C. After washing the cell pellet twice with sterile and cold PBS 1 ×, 50.000 hemocytes were immediately used for the ATAC library preparation . The transposase reaction was optimized and a 20-min reaction was used to avoid DNA “over transposition.” The subsequent steps were as described in the original protocol . Libraries were sequenced on an Illumina HiSeq platform (Additional file 8).
Whole-genome bisulfite sequencing data analysis
Read pairs were quality trimmed (minimum quality value ≥ 15 and minimum length ≥ 36 bp), and both marbled crayfish and P. fallax data were mapped to the marbled crayfish genome assembly using BSMAP version 2.73 . Correctly mapped read pairs (appropriate orientation and distance to each other) with both reads mapping uniquely to the same scaffold were used for methylation calling. The methylation ratio (methylation calling) for each CpG was determined by the Python script distributed with the BSMAP package. The provided Python script was slightly changed to analyze only reads fulfilling the following additional criteria: (1) minimum quality value of the base at C position ≥ 30 and (2) minimum quality value of the two bases before and after the C position ≥ 20. Only C-positions with a minimum coverage of three reads per strand were used in further analyses. Bisulfite conversion rates and mapping rates are provided in Additional file 2.
Raw data for P. hawaiensis and D. pulex were downloaded from NCBI using accessions PRJNA306836 and GSE60475, respectively. Custom R scripts were used to determine methylation levels in the genome and genomic features. Violin plots were generated for individual methylomes by R’s ggplot2.violinplot function. Housekeeping genes were identified by blasting human housekeeping genes against the genome assembly. Heatmaps were generated by the heatmap.2 function of R. Data were tested for differential methylation using a Wilcoxon rank-sum test. This was applied to each gene as a paired difference test to see whether the means of the two tissue groups are significantly different from each other. The wilcox.test function in R was used with a p value cutoff of 0.1. Barplots for repeat count and methylation were generated in R using ggplot2′s geom_bar function.
RNA-seq data analysis
Rsem  was used to calculate expression levels (TPM values) for each sample of the RNA-seq datasets. TPM values were then used to calculate the coefficient of variation of expression levels per tissue and per species, methylation deciles and correlations between the two. For expression ranks, genes were grouped into quintiles and octiles by their TPM values. For hemocytes, correlation of expression levels (TPMs) between samples was confirmed (Additional file 10) and datasets were pooled for gene expression level analyses.
ATAC-seq data analysis
Raw sequencing data were trimmed and quality filtered (TrimGalore-v0.4.5). Reads were then mapped against the reference genome using Bowtie2 , and duplicates were removed with samtools . Broad ATAC peaks were called using MACS2 . After confirming a high correlation between the three biological replicates (Additional file 9), we pooled the three samples for downstream analyses. ATAC-seq coverage was directly extracted from the merged bam file for regions surrounding the transcription start site by using samtools’ bedcov function. Heatmaps and metagene plots were produced using the image function of R and the geom_smooth function of ggplot2.
Lyko F. The marbled crayfish (Decapoda: Cambaridae) represents an independent new species. Zootaxa. 2017;4363:544–52.
Scholtz G, Braband A, Tolley L, Reimann A, Mittmann B, Lukhaup C, Steuerwald F, Vogt G. Ecology: parthenogenesis in an outsider crayfish. Nature. 2003;421(6925):806.
Martin P, Kohlmann K, Scholtz G. The parthenogenetic Marmorkrebs (marbled crayfish) produces genetically uniform offspring. Naturwissenschaften. 2007;94(10):843–6.
Vogt G, Huber M, Thiemann M, van den Boogaart G, Schmitz OJ, Schubart CD. Production of different phenotypes from the same genotype in the same environment by developmental variation. J Exp Biol. 2008;211(Pt 4):510–23.
Martin P, Dorn NJ, Kawai T, van der Heiden C, Scholtz G. The enigmatic Marmorkrebs (marbled crayfish) is the parthenogenetic form of Procambarus fallax (Hagen, 1870). Contrib Zool. 2010;79:107–18.
Vogt G, Falckenhayn C, Schrimpf A, Schmid K, Hanna K, Panteleit J, Helm M, Schulz R, Lyko F. The marbled crayfish as a paradigm for saltational speciation by autopolyploidy and parthenogenesis in animals. Biol Open. 2015;4(11):1583–94.
Gutekunst J, Andriantsoa R, Falckenhayn C, Hanna K, Stein W, Rasamy JR, Lyko F. Clonal genome evolution and rapid invasive spread of the marbled crayfish. Nat Ecol Evol. 2018;2:567–73.
Jones JPG, Rasamy JR, Harvey A, Toon A, Oidtmann B, Randrianarison MH, Raminosoa N, Ravoahangimalala OR. The perfect invader: a parthenogenic crayfish poses a new threat to Madagascar’s freshwater biodiversity. Biol Invasions. 2009;11:1475–82.
Chucholl C, Morawetz K, Groß H. The clones are coming—strong increase in Marmorkrebs Procambarus fallax (Hagen, 1870) f. virginalis records from Europe. Aquat Invasions. 2012;7:511–9.
Law JA, Jacobsen SE. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11(3):204–20.
Smith ZD, Meissner A. DNA methylation: roles in mammalian development. Nat Rev Genet. 2013;14(3):204–20.
Schubeler D. Function and information content of DNA methylation. Nature. 2015;517(7534):321–6.
Lyko F. The DNA methyltransferase family: a versatile toolkit for epigenetic regulation. Nat Rev Genet. 2018;19:81–92.
Feng S, Cokus SJ, Zhang X, Chen PY, Bostick M, Goll MG, Hetzel J, Jain J, Strauss SH, Halpern ME, et al. Conservation and divergence of methylation patterning in plants and animals. Proc Natl Acad Sci USA. 2010;107(19):8689–94.
Zemach A, McDaniel IE, Silva P, Zilberman D. Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 2010;328(5980):916–9.
Bewick AJ, Vogel KJ, Moore AJ, Schmitz RJ. Evolution of DNA methylation across Insects. Mol Biol Evol. 2017;34(3):654–65.
Suzuki MM, Bird A. DNA methylation landscapes: provocative insights from epigenomics. Nat Rev Genet. 2008;9(6):465–76.
Sarda S, Zeng J, Hunt BG, Yi SV. The evolution of invertebrate gene body methylation. Mol Biol Evol. 2012;29(8):1907–16.
Bewick AJ, Schmitz RJ. Gene body DNA methylation in plants. Curr Opin Plant Biol. 2017;36:103–10.
Zilberman D. An evolutionary case for functional gene body methylation in plants and animals. Genome Biol. 2017;18(1):87.
Lorincz MC, Dickerson DR, Schmitt M, Groudine M. Intragenic DNA methylation alters chromatin structure and elongation efficiency in mammalian cells. Nat Struct Mol Biol. 2004;11(11):1068–75.
Zilberman D, Gehring M, Tran RK, Ballinger T, Henikoff S. Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet. 2007;39(1):61–9.
Maunakea AK, Nagarajan RP, Bilenky M, Ballinger TJ, D’Souza C, Fouse SD, Johnson BE, Hong C, Nielsen C, Zhao Y, et al. Conserved role of intragenic DNA methylation in regulating alternative promoters. Nature. 2010;466(7303):253–7.
Neri F, Rapelli S, Krepelova A, Incarnato D, Parlato C, Basile G, Maldotti M, Anselmi F, Oliviero S. Intragenic DNA methylation prevents spurious transcription initiation. Nature. 2017;543(7643):72–7.
Wang X, Wheeler D, Avery A, Rago A, Choi JH, Colbourne JK, Clark AG, Werren JH. Function and evolution of DNA methylation in Nasonia vitripennis. PLoS Genet. 2013;9(10):e1003872.
Glastad KM, Gokhale K, Liebig J, Goodisman MA. The caste- and sex-specific DNA methylome of the termite Zootermopsis nevadensis. Sci Rep. 2016;6:37110.
Faure AJ, Schmiedel JM, Lehner B. Systematic analysis of the determinants of gene expression noise in embryonic stem cells. Cell Syst. 2017;5(5):471–484 e474.
Spanier KI, Leese F, Mayer C, Colbourne JK, Gilbert D, Pfrender ME, Tollrian R. Predator-induced defences in Daphnia pulex: selection and evaluation of internal reference genes for gene expression studies with real-time PCR. BMC Mol Biol. 2010;11:50.
Asselman J, De Coninck DI, Pfrender ME, De Schamphelaere KA. Gene body methylation patterns in daphnia are associated with gene family size. Genome Biol Evol. 2016;8(4):1185–96.
Kao D, Lai AG, Stamataki E, Rosic S, Konstantinides N, Jarvis E, Di Donfrancesco A, Pouchkina-Stancheva N, Semon M, Grillo M, et al. The genome of the crustacean Parhyale hawaiensis, a model for animal development, regeneration, immunity and lignocellulose digestion. Elife. 2016;5:e20062.
Lea AJ, Vilgalys TP, Durst PAP, Tung J. Maximizing ecological and evolutionary insight in bisulfite sequencing data sets. Nat Ecol Evol. 2017;1:1074–83.
Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW, Chen H, Henderson IR, Shinn P, Pellegrini M, Jacobsen SE, et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell. 2006;126(6):1189–201.
Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods. 2013;10(12):1213–8.
Clark SJ, Argelaguet R, Kapourani CA, Stubbs TM, Lee HJ, Alda-Catalinas C, Krueger F, Sanguinetti G, Kelsey G, Marioni JC, et al. scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells. Nat Commun. 2018;9(1):781.
Huh I, Zeng J, Park T, Yi SV. DNA methylation and transcriptional noise. Epigenetics Chromatin. 2013;6(1):9.
Hobbs HHJ. The crayfishes of Florida. Biol Sci Ser. 1942;3:1–179.
Hobbs HHJ. The crayfishes of Georgia. Smithson Contrib Zool. 1981;318:1–549.
Hendrix AN, Loftus WF. Distribution and relative abundance of the crayfishes Procambarus alleni (Faxon) and P. fallax (Hagen) in southern Florida. Wetlands. 2000;20:194–9.
Suzuki MM, Yoshinari A, Obara M, Takuno S, Shigenobu S, Sasakura Y, Kerr AR, Webb S, Bird A, Nakayama A. Identical sets of methylated and nonmethylated genes in Ciona intestinalis sperm and muscle cells. Epigenetics Chromatin. 2013;6(1):38.
Khoueiry R, Sohni A, Thienpont B, Luo X, Velde JV, Bartoccetti M, Boeckx B, Zwijsen A, Rao A, Lambrechts D, et al. Lineage-specific functions of TET1 in the postimplantation mouse embryo. Nat Genet. 2017;49(7):1061–72.
Feng S, Jacobsen SE, Reik W. Epigenetic reprogramming in plant and animal development. Science. 2010;330(6004):622–7.
Seisenberger S, Peat JR, Reik W. Conceptual links between DNA methylation reprogramming in the early embryo and primordial germ cells. Curr Opin Cell Biol. 2013;25(3):281–8.
Lyko F, Foret S, Kucharski R, Wolf S, Falckenhayn C, Maleszka R. The honey bee epigenomes: differential methylation of brain DNA in queens and workers. PLoS Biol. 2010;8(11):e1000506.
Walsh CP, Chaillet JR, Bestor TH. Transcription of IAP endogenous retroviruses is constrained by cytosine methylation. Nat Genet. 1998;20:116–7.
Miura A, Yonebayashi S, Watanabe K, Toyama T, Shimada H, Kakutani T. Mobilization of transposons by a mutation abolishing full DNA methylation in Arabidopsis. Nature. 2001;411(6834):212–4.
Bourc’his D, Bestor TH. Meiotic catastrophe and retrotransposon reactivation in male germ cells lacking Dnmt3L. Nature. 2004;431(7004):96–9.
Ziller MJ, Gu H, Muller F, Donaghey J, Tsai LT, Kohlbacher O, De Jager PL, Rosen ED, Bennett DA, Bernstein BE, et al. Charting a dynamic DNA methylation landscape of the human genome. Nature. 2013;500(7463):477–81.
Hon GC, Rajagopal N, Shen Y, McCleary DF, Yue F, Dang MD, Ren B. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nat Genet. 2013;45(10):1198–206.
Roadmap Epigenomics Consortium, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518(7539):317–30.
Jones PA. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012;13(7):484–92.
Yin Y, Morgunova E, Jolma A, Kaasinen E, Sahu B, Khund-Sayeed S, Das PK, Kivioja T, Dave K, Zhong F, et al. Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science. 2017;356(6337):eaaj2239.
Kenkel CD, Matz MV. Gene expression plasticity as a mechanism of coral adaptation to a variable environment. Nat Ecol Evol. 2016;1(1):14.
Feinberg AP, Irizarry RA. Evolution in health and medicine Sackler colloquium: Stochastic epigenetic variation as a driving force of development, evolutionary adaptation, and disease. Proc Natl Acad Sci USA. 2010;107(Suppl 1):1757–64.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9.
Thorvaldsdottir H, Robinson JT, Mesirov JP. Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14(2):178–92.
Corces MR, Buenrostro JD, Wu B, Greenside PG, Chan SM, Koenig JL, Snyder MP, Pritchard JK, Kundaje A, Greenleaf WJ, et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat Genet. 2016;48(10):1193–203.
Xi Y, Li W. BSMAP: whole genome bisulfite sequence MAPping program. BMC Bioinform. 2009;10:232.
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011;12:323.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. Genome project data processing S: the sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9(9):R137.
FG, CF, JG, GR and VCC analyzed the data. KH and VCC performed experiments. FL conceived the study and wrote the paper with contributions from the coauthors. All authors read and approved the final manuscript.
We thank Stephan Wolf and the DKFZ Genomics and Proteomics Core Facility for sequencing services. We also thank Dr. Manuel Paredes-Rodriguez for help with ATAC-seq library preparation.
The authors declare that they have no competing interests.
Availability of data and materials
The coding sequences of P. virginalis Dnmt1, Dnmt3 and Tet are available in GenBank (accession nos. KM453737, KM453738 and KM453739). The whole-genome bisulfite sequencing, RNA-seq and ATAC-seq datasets are available in the NCBI Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo/) under accession no. GSE112411.
Consent for publication
Ethics approval and consent to participate
VCC is supported by a postdoctoral fellowship from CellNetworks.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1.
Housekeeping genes (HKG) expression during different developmental stages and tissues. qRT-PCR was performed using primers to five different HKG, TATA-box-binding protein (TBP), endoribonuclease gene (Dicer1), survival motor neuron protein (Smn), Histone H2A and glyceraldehyde 3-phosphate dehydrogenase (GAPDH).
Additional file 2.
Whole-genome bisulfite sequencing details.
Additional file 3.
General characteristics of the marbled crayfish methylome. a Logo plot for methylated cytosines. b Distribution of the average CpG methylation level (methylation ratio). c Strand-specific density of methylated CpGs (mCpG) across the scaffold 48720 (Watson strand: blue, Crick strand: red). The density was calculated by dividing the number of methylated CpGs (methylation ratio ≥ 0.8 and coverage ≥ 3) by the length using a 1-kb non-overlapping sliding window.
Additional file 4.
Correlation of gene body methylation levels with different gene features. a Normalized CpG content [amount of observed CpGs to amount of expected CpGs (o/e)] was classified as low (< 0.6), medium (≥ 0.6, < 1.2) and high (≥ 1.2). b Boxplot of average gene methylation by gene length in kb. c Predicted marbled crayfish genes were translated into protein sequences and mapped to different phylogenetic nodes with the leftmost representing the oldest and the rightmost the youngest groups.
Additional file 5.
Methylation of repetitive sequences. The heatmap shows average methylation levels of selected repeat classes in eight independent samples (columns). Only repeats with sufficient coverage in all eight samples are shown. The four most frequent repeat classes are shown (LINEs, SINEs, DNA transposons, LTRs), as well as TcMar-Tigger as an example of a highly methylated repeat class, and rRNAs as an example for a non-transposon repeat class. Methylation levels are indicated on a scale from 0 (blue) to 1 (red). E1.7: stage 1.7 embryos, hep.: hepatopancreas, musc.: abdominal musculature, hem.: hemocytes. Colors denote individual animals.
Additional file 6.
RNA sequencing details.
Additional file 7.
Correlation between DNA methylation and gene expression levels. Scatter plots show the promoter methylation (left) and gene body methylation (middle) levels in relationship to gene expression levels. Boxplots (right) show the relationship between gene body methylation and gene expression ranks. Results are shown for all genes (a) and for housekeeping genes (b).
Additional file 8.
ATAC sequencing details.
Additional file 9.
ATAC-seq quality controls showing library quality (top left), pairwise comparisons of ATAC peak intensities from three independent libraries and a representative Genome Browser screen shot of read enrichment (bottom panel).
Additional file 10.
Reproducibility between three independent hemocyte RNA-seq datasets. Correlation of log10 TPM values for pairwise RNA-seq replicates.
Additional file 11.
Heatmaps of chromatin accessibility for high-methylated (methylation level > 0.5, left) and low-methylated (methylation level < 0.5, right) genes around transcription start sites (TSS).
Additional file 12.
Correlation between DNA methylation and gene expression variation in hepatopancreas and abdominal musculature. a Low methylation levels correlate with higher gene expression coefficient of variation for hepatopancreas and abdominal musculature. Methylation rank 0 represents completely unmethylated genes. b Representative genome browser tracks and expression levels (red bars) for a highly methylated gene with low gene expression variation (top) and a lowly methylated gene with high gene expression variation (bottom).
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Gatzmann, F., Falckenhayn, C., Gutekunst, J. et al. The methylome of the marbled crayfish links gene body methylation to stable expression of poorly accessible genes. Epigenetics & Chromatin 11, 57 (2018). https://doi.org/10.1186/s13072-018-0229-6
- Marbled Crayfish
- Gene Body Methylation
- Whole-genome Bisulfite Sequencing
- Chromatin Accessibility