Well-positioned nucleosomes punctuate polycistronic pol II transcription units and flank silent VSG gene arrays in Trypanosoma brucei
© The Author(s) 2017
Received: 14 March 2017
Accepted: 14 March 2017
Published: 20 March 2017
The compaction of DNA in chromatin in eukaryotes allowed the expansion of genome size and coincided with significant evolutionary diversification. However, chromatin generally represses DNA function, and mechanisms coevolved to regulate chromatin structure and its impact on DNA. This included the selection of specific nucleosome positions to modulate accessibility to the DNA molecule. Trypanosoma brucei, a member of the Excavates supergroup, falls in an ancient evolutionary branch of eukaryotes and provides valuable insight into the organization of chromatin in early genomes.
We have mapped nucleosome positions in T. brucei and identified important differences compared to other eukaryotes: The RNA polymerase II initiation regions in T. brucei do not exhibit pronounced nucleosome depletion, and show little evidence for defined −1 and +1 nucleosomes. In contrast, a well-positioned nucleosome is present directly on the splice acceptor sites within the polycistronic transcription units. The RNA polyadenylation sites were depleted of nucleosomes, with a single well-positioned nucleosome present immediately downstream of the predicted sites. The regions flanking the silent variant surface glycoprotein (VSG) gene cassettes showed extensive arrays of well-positioned nucleosomes, which may repress cryptic transcription initiation. The silent VSG genes themselves exhibited a less regular nucleosomal pattern in both bloodstream and procyclic form trypanosomes. The DNA replication origins, when present within silent VSG gene cassettes, displayed a defined nucleosomal organization compared with replication origins in other chromosomal core regions.
Our results indicate that some organizational features of chromatin are evolutionarily ancient, and may already have been present in the last eukaryotic common ancestor.
KeywordsGenome-wide nucleosome positions Trypanosoma brucei Polycistronic transcription units Silent variant surface glycoproteins VSG Bloodstream expression sites Evolution MNase-seq
African trypanosomes lie in an ancient evolutionary branch of the Excavates supergroup, which split from the last eukaryotic common ancestor (LECA) along with the SAR, Archaeplastida, and Unikonta supergroups some 2 billion years ago [1, 2]. Despite this early divergence, Trypanosoma brucei encodes an extensive repertoire of proteins associated with chromatin structure, modification, and functional regulation [3–6]. The presence of an epigenome in trypanosomes is perhaps expected, given the evolutionary origin of functional core histones in the ancestral Archaea , and the presence of linker histone homologs in evolutionarily distant bacteria . Although Archaea lack multi-domain chromatin remodelers , the SNF2 domain, which has DNA-dependent ATPase activity and is present in a broad range of chromatin remodelers , is identifiable in bacterial helicases. The occurrence of histone modification enzymes, and a functional role for modified DNA packaging proteins, was also demonstrated in Archaea , suggesting that the regulation of chromatin structure and the epigenetic definition of different functional states of chromatin predates LECA.
Nucleosomes can influence diverse DNA functions [12–14], and the precise nucleosome positions present in regulatory elements are functionally crucial . Nucleosomes can also assume a similarly defined distribution around RNA polymerase II (pol II) transcription start sites (TSSs) in diverse eukaryotes from the Unikonta supergroup, including Dictyostelium discoideum, Saccharomyces cerevisiae, and Homo sapiens. Incredibly, this nucleosomal arrangement appears evolutionarily ancient, as a similar nucleosome-depleted region bracketed by positioned −1 and +1 “nucleosomes” was observed upstream of genes in the archaeal Haloferax volcanii. This is despite the fact that the structural chromatin unit in this archaeal cell is composed of a tetramer instead of an octamer of histones .
There is currently a lack of insight into the genome-wide nucleosomal organization of genomes from organisms that are evolutionarily far removed from the Unikonta supergroup of eukaryotes, yet encode conventional nucleosomes composed of a conserved octamer of histones. In this regard, T. brucei represents a very intriguing subject. Here, all four core histones are present, and the classic conservation benchmark, canonical H4, is 79% similar to that of H. sapiens. The most recent ancestor shared between T. brucei and the Unikonta supergroup is therefore, in all likelihood, LECA.
Trypanosoma brucei is a unicellular parasite that is transmitted to humans by one of several Glossina fly species, and causes human African trypanosomiasis (HAT) . Upon initial human infection, T. brucei invades interstitial spaces, the lymph system, and the bloodstream. With prolonged infection, the parasite crosses the blood–brain barrier and invades the central nervous system . Without treatment HAT is often fatal and although the number of cases is declining, more than 1.8 million people are still thought to be at high risk of the disease . As the parasite cycles between the mammalian host and the insect vector, it differentiates into different life cycle stages including the bloodstream form (BF) in the mammal or the procyclic form (PF) in the midgut of the tsetse fly .
In the bloodstream of the mammalian host, T. brucei escapes clearance by the immune system by periodically switching a mono-allelically expressed variant surface glycoprotein (VSG), an abundant cell surface protein that masks invariant cell surface proteins [21, 22]. The active VSG is expressed from a single pol I-transcribed subtelomeric VSG expression site (ES) . The expressed VSG gene can be switched through multiple mechanisms . First of all, a transcriptional switch can result in silencing of the active ES and the activation of one of approximately 15 other silent ESs. Alternatively, DNA recombination can be involved. Gene conversion can result in all or part of the active VSG gene being swapped with sequences from a different silent VSG cassette, present on a variety of types of chromosomes. T. brucei contains 11 megabase chromosomes (>1 Mb), ~5 intermediate chromosomes (200–900 kb), and ~100 mini-chromosomes (30–150 kb), and all of these contain silent VSGs . Lastly, VSGs can be switched through telomere exchange with another VSG-containing telomere.
ESs are telomeric transcription units. There is a relatively localized telomeric silencing gradient extending up to 10 kb from the telomere end, although this is not implicated in the ES regulation involved in antigenic variation [26–29]. The telomeric repression observed in the immediate vicinity of the telomeres of the silent ESs appears superficially reminiscent of that observed in yeast and Drosophila in that it requires RAP1, among other factors . However, SIR2, which plays an important role in telomere position effect in eukaryotes, appears to also have unrelated functions in T. brucei . Additional repressive mechanisms appear to operate on the ES promoter itself. These is about 40–60-kb upstream from the chromosome end and is effectively silenced, even though distance-wise it would be expected to escape the effects of typical telomere position effect . A number of proteins including the chromatin remodelers ISWI [31, 32], ORC1 , FACT , and HDAC3  among others, play a role in ES promoter silencing. In addition, T. brucei histone H1, similar to the C-terminal tail of the H1 from metazoans, renders chromatin in the BF stage more resistant to nucleases, presumably due to a more closed chromatin conformation. H1 is also required for full transcriptional repression of silent ESs [36, 37]. Another unusual feature of ESs is that they are transcribed in a mono-allelic fashion by pol I, which in eukaryotes normally exclusively transcribes ribosomal DNA (rRNA) .
Unusually for a eukaryote, protein-coding genes in T. brucei are arranged in extensive, polycistronic transcription units (PTUs). These are constitutively transcribed by pol II from poorly defined promoters and can span up to several hundred kilobases . There is no transcriptional regulation of pol II in T. brucei, and virtually all regulation of mRNA and protein levels appears to occur posttranscriptionally . Pol II transcription initiation typically occurs in the strand switch regions (SSRs) between two divergently transcribed PTUs, but pol II start sites have also been identified between head-to-tail aligned PTUs . Various epigenetic marks are located at SSRs and are likely to functionally define pol II regulatory regions . The pol II transcription start sites are enriched for the H2A.V and H2B.V histone variants, as well as for modified H3K4me3 and H4K10ac [42, 43]. Termination of pol II transcription also occurs in the SSRs, for example, between two convergent PTUs. These termination regions are enriched for the H3.V and H4.V histone variants and modified H3K76me1/2 [42, 44]. In addition, the modified thymidine base, β-d-glucosyl-hydroxymethyluracil (base J), is also located at termination sites. Base J was shown to contribute to transcriptional termination in both T. brucei [45, 46] and Leishmania tarentolae , and the knockdown of base J and H3.V in T. brucei results in transcriptional read-through, and the appearance of downstream anti-sense RNA . In addition, pol II termination is often also associated with a tRNA gene .
In this study, we mapped the genome-wide nucleosome positions in two different life cycle stages (BF and PF) of T. brucei 427 using MNase-seq. We report the nucleosomal organization at pol I and pol III promoters, as well as in regions flanking pol II-transcribed PTUs, including the adjacent transcription start and termination regions. We find that the pol II PTUs are punctuated internally by well-positioned nucleosomes at regions involved in RNA processing and analyzed the possible contribution of DNA sequence elements in the nucleosomal positions that were observed. In addition, we find that the silent VSG gene arrays are flanked by regions of well-positioned nucleosomes. This could play a role in suppressing fortuitous initiation by pol II, and ensuring mono-allelic expression of a single VSG from the active ES.
Distribution of positioned nucleosomes in the T. brucei genome
There was not a significant difference in nucleosome density between the BF and PF life cycle stages, and coding sequences contained 0.22 and 0.21 dyads/bp in both BF and PF T. brucei, respectively (Fig. 1d). The intergenic regions (SSRs as well as noncoding regions between individual genes in a PTU) had a nucleosomal dyad density of 0.22 dyads/bp in both the BF and PF life cycle stages (Fig. 1e). A slight extension of the bin population to higher value bins (or more well-positioned nucleosomes at a given genomic region) was visible in the intergenic, compared to the coding regions (compare Fig. 1d, e). This increase is statistically significant (Whitney–Mann U test; p < 0.001), showing that highly positioned nucleosomes were present more often in intergenic regions. No statistically significant difference could be detected between the distribution of nucleosomes in BF and PF forms in either the coding sequences or intergenic regions (Whitney–Mann U test; p > 0.05). This result suggests that the bulk nucleosome density and proportion of well-positioned nucleosomes were comparable in BF and PF T. brucei.
In the case of the predominantly pol III-transcribed tRNA genes, an average dyad density of 0.1 dyads/bp was found, with a normalized bins distribution as shown in Fig. 1f. The absence of high-value dyad bins indicates that nucleosomes are generally weakly positioned on the tRNA genes, which may be related to the transcriptional activity of tRNA genes and the size of the pol III transcription complex compared to the size of the tRNA gene itself. In yeast, tRNA genes are occupied by the TFIIIB–TFIIIC complex, displaying a distinctive occupancy pattern termed a “bootprint,” and are generally nucleosome free . A TFIIIC ortholog has not been identified in T. brucei, and it is not clear whether any transcription-related protein complex binds to the putative A-box . rRNA genes had a dyad density of 0.34 dyads/bp and accommodated more well-positioned nucleosomes (Fig. 1g) compared to the genome average (see Fig. 1c; indicated by the gray diagonal in Fig. 1g). This may indicate the presence of a subpopulation of inactive rRNA genes, which is a common feature of the rDNA transcription units in other eukaryotes , assuming the number of rRNA genes annotated in the reference genome is accurate. The bulk nucleosome repeat length, or average nucleosome dyad to dyad distance, was determined at 194 bp in both the BF and PF life stages (data not shown).
5S rRNA genes
A nucleosome-depleted region (NDR) was observed upstream of the 5S rRNA gene, preceded by a well-positioned nucleosome at approximately position −400, which was equivalent to the second downstream nucleosome in the repeating 5S unit (labeled III′ and III, respectively, in Fig. 2a). The eight 5S rRNA genes in the T. brucei 427 reference genome encode transcripts that are 100% identical, and the 620-bp intergenic regions are 98% identical at nucleotide level. The alignment of sequence reads to the reference genome was therefore averaged between the individual genes by the mapping procedure. The distribution of the 3 identified nucleosomes was at sterically allowed distances (nucleosome I–II, 185 bp, and nucleosome II–III, 160 bp). There was no detectable difference in the nucleosomal organization of the 5S genes in the BF and PF life cycle stages (Fig. 2a; Additional file 1: Fig. S2A).
Trypanosoma brucei 427 contains two larger rRNA gene clusters with 24% identity at the nucleotide level located on chromosomes 2 and 3, and six smaller clusters of rRNA genes on chromosomes 6, 7, 8, and 9. The small number of annotated rRNA gene clusters precludes an informative alignment of the nucleosomal dyads at these loci.
We next aligned the 64 tRNA encoding genes at the transcript start positions. A clear NDR was again discernible directly upstream of the tRNA genes (Fig. 2b). This region was bracketed by groups of nucleosomes that appear to be located in multiple, overlapping frames. Three groups of nucleosomes were typically visible downstream of the tRNA gene and may represent three abutting nucleosomes in multiple phases. Upstream of the tRNA gene another group of overlapping nucleosome positions was typically visible. The nucleosomal arrangement at the tRNA genes appears functionally important, since the NDR, and the single upstream and three downstream nucleosomes were independently visible on the Watson and Crick strands (Additional file 1: Fig. S2B; Kendall’s tau correlation; τ = 0. 56, p < 0.001), suggesting that they were arranged relative to the direction of transcription of the tRNA gene. However, the nucleosomal dyad distributions shown in Fig. 2b represent the contribution from both active and inactive genes, and the first downstream nucleosome may be unstable or disrupted in active tRNA genes, thereby not contributing substantially to the dyad distribution.
Nucleosomal organization of pol II PTU transcription initiation regions
In other eukaryotes, a general picture has emerged for nucleosomal organization associated with the upstream regions of pol II-transcribed genes, where well-positioned nucleosomes flank an NDR that largely overlaps with the TSS . This NDR is thought to allow assembly of the pol II pre-initiation complex in the region of the TSS . However, in T. brucei pol II-mediated transcription is unusual in the sense that conventional pol II promoters are absent or undefined, and transcription commences at divergent SSRs and at a few other unique non-SSR locations . Sequence analysis has failed to identify conventional TATA boxes or initiator sequences in the pol II transcription start regions , and it is thought that T. brucei lacks the many transcriptional activators, basal factors and cis-elements associated with pol II gene expression in eukaryotes from the Unikonta supergroup . Although the pol II transcription start region is enriched for the H2A.V and H2B.V histone variants, and epigenetic marks associated with transcriptional activation in model eukaryotes , it is not understood how these histone variants and modifications are targeted to these precise chromatin regions. We were, therefore, interested in the nucleosomal organization of these pol II transcription start regions, and the idea that the local nucleosomal landscape may define functional pol II TSSs in T. brucei.
As the pol II TSSs remain unmapped in T. brucei, we aligned the nucleosomal dyads of PTUs relative to the most upstream splice acceptor site (SAS) which was identified in a T. brucei transcriptomic study . The SAS is located upstream of each coding sequence in a PTU and is where the 39-nt sequence spliced leader RNA is trans-spliced onto the nascent RNA. The SAS is functionally related to the 3′ end of introns at intron–exon junctions in metazoans . In the case of the SAS of the first gene of the PTU, this would be the genomic feature that is closest to the pol II transcription initiation region. When aligned relative to the first SAS (Fig. 2c), a region that was weakly depleted of nucleosome dyads was visible. This region was depleted of nucleosomal dyads in an approximately 100-bp region upstream of the SAS of the first gene of the PTU on both the Watson and the Crick strand (Fig. 2d), suggesting that this nucleosomal organization was sensitive to the direction of transcription (Kendall’s tau correlation p < 0.001).
Although evidence of positioned nucleosomes downstream of the NDR was visible, these nucleosomes did not form a single phased array relative to a defined genomic feature, such as a +1 nucleosome, as is seen in other eukaryotes [14, 55, 56]. The observed peaks were less than the allowed nucleosome-to-nucleosome distance, indicating the absence of a regularly spaced nucleosome array, in both BF and PF cell lines (Additional file 1: Figs. S3 and S4). Although this nucleosomal arrangement is reminiscent of nucleosomes that were mapped downstream of the pol II TSSs in S. cerevisiae  and Drosophila , we note that the pol II PTUs in T. brucei are constitutively expressed, and a well-defined nucleosomal array in this region would be unexpected. A similar pattern of nucleosomal distribution surrounding pol II TSS has been observed in a related kinetoplastid, Leishmania major . A high nucleosome occupancy (defined as the average nucleosome dyad density per base pair in a specified region), but little positioning (defined as the number of co-aligned dyads at a specific base pair), was observed across the constitutively transcribed pol II PTUs . The nucleosomes downstream of the first SAS appeared less organized in the PF life cycle stage (Additional file 1: Fig. S4A–D). In our experiments, we did not partially crosslink the nucleosomes chemically. However, the less-ordered nucleosomal arrangement in these regions is unlikely to be due to nucleosome sliding, since we do observe stable and strongly positioned nucleosomes at the 5S rRNA genes.
These results showed that a region displaying a weak nucleosomal depletion was present upstream of the first SAS of pol II PTUs in T. brucei. This NDR was not as well defined as that seen at pol II TSSs in other eukaryotes such as in S. cerevisiae , and its functional association with the TSS, which remains unmapped in T. brucei, is also uncertain. If this NDR is indeed linked to transcription initiation, it suggests that this organizational feature, probably inherited from an Archaeal ancestor, was not maintained to the same extent in T. brucei compared with other eukaryotes. A clear enrichment of the H2A.Z histone variant was seen in nucleosome +1 bordering the NDR at the pol II TSSs in eukaryotic model organisms from the Unikonta supergroup . The structurally less stable H2A.Z-containing nucleosome was proposed to facilitate elongation by pol II . We therefore mapped H2A.V ChIP-seq data from T. brucei (kindly made available by N. Siegel) to our MNase-seq data. Although a clear enrichment in H2A.V-containing nucleosomes was seen in the upstream regions of PTUs and overlapping with the first gene, as was previously reported , this broad enrichment of nucleosomes did not map to any single, well-positioned nucleosome (data not shown).
The NDR is not a feature of all SAS regions
We next assessed whether the nucleosomal organization observed for the region surrounding the SAS of the first gene in pol II PTUs was specific to this region, or simply reflected the nucleosomal organization of all upstream SAS regions for all genes within a PTU. We therefore repeated the dyad alignment analysis for all genes with assigned SAS sites, excluding that of the first gene in a PTU (3178 genes). The result is shown in Fig. 2e. In stark contrast to the arrangement seen at the SAS upstream of the first gene in a PTU, a striking enrichment of nucleosomal dyad axes that were aligned with the mapped SAS was seen. This was observed in the biological duplicates in both cell lines of the BF and PF life cycle stages (Additional file 1: Fig. S5) and was also independently seen (Kendall’s tau correlation p < 0.001) on the assigned SAS of both the Watson (1545 genes) and Crick (1633 genes) strands (Fig. 2f). This suggests that there is a well-positioned nucleosome preferentially aligned with each internal SAS in T. brucei, contrary to the NDR observed at internal SASs in L. major . When studying the nucleosomal arrangement at the SAS at the level of individual genes, the consistent, tight positioning of a nucleosome in the direct vicinity of the SAS is often observed, with a more random distribution of nucleosomes in the regions adjacent to the SAS (Additional file 1: Fig. S6). In contrast, bordering nucleosomes appeared disorganized.
Nucleosomal organization at pol II termination regions
In addition to the SAS, the individual genes in a PTU typically contain a 3′ UTR with an average length of 676 bp, terminating at the polyadenylation site (PAS) . The genomic region downstream of the PAS of the last gene of the PTU does not appear to contain defined transcription termination sequences . The pol II transcription termination regions are enriched for the histone variants H3.V and H4.V , and both H3.V and base J deposition in this region appear to act synergistically to mediate efficient pol II termination [46, 60]. We were, therefore, interested in possible specialized nucleosomal arrangements in the regions of pol II transcription termination.
We first aligned the nucleosomal dyad axes relative to the PAS elements of all genes within PTUs  to get an overview of the general PAS structure. In contrast to the average nucleosomal organization around the internal SAS element, the PAS was clearly depleted of nucleosomes, with a single well-positioned nucleosome present immediately downstream of the PAS (Fig. 2g). Again, similar nucleosomal arrangements were independently observed on the Watson (1364 genes) and Crick (1448 genes) strands (Kendall’s tau correlation; τ = 0.46, p < 0.001, Fig. 2h), suggesting that the nucleosomal arrangement was important to a directional process on the DNA molecule (presumably transcription). This nucleosomal arrangement is very similar to that seen in human and yeast genomes, where nucleosomes are also depleted on polyadenylation sites [61, 62]. A similar organization was also seen in L. major, although the observed NDR was present immediately upstream of the PAS . However, the relatively small number of terminal PAS elements (n = 35) precluded any statistically meaningful analysis of the average nucleosomal organization at this genomic position.
Genomic distribution of nucleosome refractory sequences
It has previously been shown that oligo-d(A·T) runs are generally excluded from central locations of isolated chicken nucleosome cores , and it was suggested that this depletion is due to the inherently rigid structure of A·T tracts due to a series of bifurcated hydrogen bonds . In T. brucei, runs of oligo-d(A·T) of 7 bp and longer are present in nucleosomes at approximately 70% of that expected for a random distribution (Additional file 1: Fig. S7A). Interestingly, runs of oligo-d(G·C) of up to 4 nucleotides were present in nucleosomes more often or the same as that expected from a random distribution, with a striking absence of runs longer than 7 bp (Additional file 1: Fig. S7B). Oligo-d(A-T) and oligo-d(T-A) are markedly depleted in T. brucei nucleosomes (Additional file 1: Fig. S8), even though these sequence runs occur at very high frequencies in the genome (Additional file 1: Table S1). This depletion might possibly be due to the destruction of nucleosomal fragments containing these sequences during nucleosome core preparation and subsequent rarefaction in the sequencing sample. It therefore appears that runs of oligo-d(A·T) and oligo-d(G·C) contribute to the relative absence of nucleosomes in specific regions of the T. brucei genome.
The nucleosomal organization in the region of the first SAS of a PTU compared to the average organization of all PTUs differed significantly (see Fig. 2c, e). A region weakly depleted of nucleosomes was observed upstream of the first SAS (Fig. 2c, d), and a well-positioned nucleosome was observed at all internal SAS sites (Fig. 2e, f). The region of nucleosome depletion at position −50 to −150 does not precisely align with the region enriched for T·A runs at position −70 to −20. However, the overlap in these regions, and the observation that oligo A runs in excess of 7 bp are generally depleted of nucleosomes, makes it highly likely that the presence of the oligo-d(A·T) runs upstream of the SAS contributes to the appearance of an NDR. However, the positional mismatch between the oligo-d(A·T) run and the NDR suggests the involvement of additional factors in establishing the NDR. The second region of oligo-dA enrichment at position +100 relative to the SAS of the first gene may serve to position the first nucleosome of the first gene of a PTU (Fig. 2c, d). The absence of this second oligo-dA-enriched area at the remaining genes in a PTU may contribute to the absence of positioned nucleosomes downstream of the nucleosome positioned on the SAS (Fig. 2e, f).
A striking enrichment of oligo-dA runs is present directly upstream of the PASs (Fig. 3c), partially overlapping with the NDR observed in this region (Fig. 2g). In higher eukaryotes, a highly conserved AATAAA hexanucleotide that signals polyadenylation is located upstream of the PAS and may be refractory to nucleosomes . However, this cis-acting element seems to be absent in T. brucei  and the role of the observed oligo-dA run abutting the PASs is unclear, but might contribute to the observed NFR at PASs.
Nucleosome positioning signals at the initial SAS, internal SASs and internal PASs
Travers and colleagues showed that the rotational position of isolated nucleosome cores is defined by a distribution of dinucleotides at a periodicity equal to that of the DNA helix . This was interpreted in terms of the structural constraints imposed on the rotational freedom of specific dinucleotide steps, and the ability to accommodate a narrowed or expanded minor groove. It was subsequently shown in genome-wide studies that positioned nucleosomes were often associated with di- and trinucleotide distributions equal to the DNA periodicity . We therefore investigated whether the positioned nucleosomes upstream of the first gene in a T. brucei PTU, as well as those present at internal SASs and at the PASs, was positioned by underlying sequence periodicity.
We analyzed the distribution of the A–A dinucleotide by plotting the Fourier amplitude in a 128-nt window at consecutive settings across sites of interest in the genome, showing the strength of a periodic distribution of a given nucleotide. In the case of the first gene of a PTU, the highest Fourier amplitude is seen at approximately position +200 bp (Fig. 3d), indicating the likely presence of a strong nucleosome positioning sequence in this region. This aligns with the nucleosome immediately downstream of the NDR (see Fig. 2c) of the first SAS in a PTU. Interestingly, this nucleosome fits into the region forming a saddle between the two peaks of enrichment for oligo-dA (Fig. 3a), and the NDR (Fig. 2c), and partially overlaps with the peak at −50 in the distribution of oligo-dA (Fig. 3a). Therefore, the nucleosome distribution in the region of the SAS upstream of the first gene of a PTU can be explained in terms of the enrichment and depletion of specific sequence elements in this region and the known preference of nucleosome cores for such elements.
When looking at the internal SASs, a region generally enriched for 10-bp A–A periodicities is seen downstream of the SAS (Fig. 3e). Upstream of the SAS, centered at approximately position −80 bp, a region depleted of 10-bp A–A periodicities is evident. This aligns almost perfectly with the region enriched for oligo-dA tracts (Fig. 3b). The average nucleosomal structure in the region showed a nucleosome positioned at approximately position 0 (Fig. 2e, f). Thus, this nucleosome position is also consistent with the distribution of 10-bp A–A periodicities and oligo-dA tracts in these regions.
Looking at the Fourier amplitude of all annotated PASs (Fig. 3f), a region slightly enriched for a 10-bp AA periodicity is seen downstream of position +150. Comparing this distribution to that of oligo-dA runs (Fig. 3c), a strong peak of oligo-dA runs is typically seen directly upstream of the PAS. Thus, there appears to be a sequence arrangement that discourages nucleosome formation from position 0 of the PAS and is more facilitative of nucleosome deposition downstream of the PAS. This is exactly what was seen in the average nucleosomal organization surrounding the PAS elements (Fig. 2g, h), where position 0 was depleted of nucleosomes, and a strongly positioned nucleosome was visible centered at approximately position +150. Again, the nucleosomal organization can be explained in terms of the sequence elements present at the PAS.
It was previously shown that nucleosome positions in vivo are directed by sequences as well as DNA binding proteins that may initiate “statistical positioning,” as well as by chromatin remodelers . Our results do not exclude the contribution from agents other than sequence, and the NDR upstream of the first SAS of a PTU cannot be fully attributed only to the polypyrimidine tract, which overlaps only partially with the NDR, thus implying the involvement of other factors.
Nucleosome organization in BF and PF T. brucei life cycle stages is highly comparable
There is little evidence for pol II transcriptional control in T. brucei, and the life-cycle-specific control of most genes occurs posttranscriptionally [40, 69, 70]. However, some pol I-transcribed loci are differentially transcribed in the different T. brucei life cycle stages. For example, only one of about 15 VSG expression sites is active in BF T. brucei, whereas all ESs are repressed in the procyclic form. In contrast, the procyclin genes are repressed in the BF and active in the PF stage. We were therefore interested in establishing whether there were any clear differences in the nucleosomal organization of the T. brucei genome in the BF or PF life cycle stages that could explain this differential expression.
We next investigated the chromatin structure of transcription units which are expressed in a life-cycle-specific fashion. The EP1 and EP2 procyclin genes are not expressed in BF T. brucei. A larger number of nucleosomal dyads were present at these loci in BF compared with the PF T. brucei (Fig. 4b) and extended further downstream into the procyclin coding sequence in the BF cells. This indicated that the EP1 and EP2 procyclin genes have structurally more compact chromatin in BF cells, possibly impeding transcription of these procyclin loci in BF, compared with the PF cells. ESs are only transcribed at a high rate in BF T. brucei. Unfortunately, the high degree of sequence identity of the 15 ESs precluded unique read mapping and the analysis of nucleosome organization at active and inactive ESs. However, the BF HNI_VO2 cell line contains unique sequences in the form of neomycin and hygromycin resistance genes in the active and inactive ESs, respectively. Looking at the nucleosomal distributions on these single copy, unique sequence markers, fewer well-positioned nucleosomes were present on the active drug resistance gene compared with the inactive one (Fig. 4c, d). In fact, 0.07 (53 dyads per 782-bp gene) dyads were assigned per base pair in the case of the transcriptionally active neomycin gene, as opposed to 0.11 (109 dyads per 1031-bp gene) dyads per base pair in the case of the inactive hygromycin gene. Although this is in agreement with previous observations [71, 72], it is not clear to what extent sequence differences contributed to this difference in nucleosome occupancy. In summary, subtle differences in the nucleosomal organization of life-cycle-specific genes were observed, which could be an effect of active transcription. However, we saw no difference in the organization of chromatin over extensive genomic regions between the two T. brucei life cycle stages (Fig. 4a; Additional file 1: Fig. S9).
Nucleosomal organization at the silent VSG arrays
The observed well-positioned nucleosome structures flanking the silent VSG arrays could result in repressing fortuitous transcription initiation of these silent VSGs, thereby maintaining mono-allelic expression of the active VSG. The more poorly positioned nucleosomes covering the VSG arrays may represent an open chromatin structure more amenable to DNA recombination events, or may be due to a repressive chromatin structure resulting in decreased MNase cleavage.
TbORC1 and DNA replication origins
It had previously been shown that ORC1, a component of the origin recognition complex, binds to numerous regions in the T. brucei genome, many of which act as origins of DNA replication . It has also previously been shown in S. cerevisiae that ORC1, apart from its role in DNA replication, is also a component of silencing complexes assembled at silencing elements such as the E-element of the Mat alpha silent mating-type locus. Consequently, we were interested in establishing whether the T. brucei ORC1 was similarly involved in a specialized nucleosomal organization which could be implicated in gene silencing.
We mapped ORC1 sites identified in T. brucei 927  to the equivalent sequences in the genome of T. brucei 427. The ORC1 binding sequences originally identified by Tiengwe and colleagues  ranged from 65 bp to 3 kb. This mapping was therefore at a low resolution, below that of single nucleosomes. Nevertheless, we utilized this dataset to investigate the nucleosomal organization in the vicinity of assigned ORC1 binding sites.
We first investigated the DNA region surrounding ORC1 sites in the core region of chromosomes (994 sites), which contain the constitutively expressed housekeeping genes. Here, very little nucleosomal organization relative to the center of the assigned DNA replication origin was discernible (Fig. 5c). We next investigated the subtelomeric regions (119 sites), which are defined as regions adjacent to telomeric ends. These contain the silent VSG gene and pseudogene arrays, expression site-associated genes (ESAGs), and the highly repetitive retrotransposon hotspot proteins (RHS) gene family. Here, a very clear nucleosomal organization was evident (Fig. 5d). The differences in organization of nucleosomes within the coding and flanking regions of the silent VSG arrays are consistent with the presence of specialized bordering silencing complexes, as also suggested by McCulloch and colleagues . Intriguingly, of the 15 high confidence (false discovery rate (FDR) <0.05; ) ORC1 binding sites identified in the subtelomeric region of chromosome 9, 14 were present in the silent VSG array. This result strongly suggests that ORC1 binding sites, and presumably ORC1 itself are involved in the demarcation of specialized chromatin domains associated with silencing of the VSG gene and pseudogene arrays, as is the case in S. cerevisiae silent mating-type loci . This suggests that the role of ORC1 in arranging surrounding chromatin structure and repressing regional transcription is evolutionarily ancient, and was likely present in LECA.
Trypanosoma brucei regulates the expression of most of its protein-coding genes at the posttranscriptional level. However, the T. brucei genome encodes homologs for putative chromatin writers, readers and erasers, as well as chromatin remodeling enzymes and histone variants [3–6]. It has been shown that specific histone variants demarcate the borders of PTUs . In addition, various epigenetic players were shown to be important in the regulation of the mono-allelic expression of the active VSG ES, and the concomitant repression of the approximately 14 silent ESs [28, 33, 36, 37, 42, 44, 71, 74–76].
In this study, we present a whole genome analysis of nucleosomal positioning in T. brucei. We provide clear evidence for locally organized nucleosomal structures in both BF and PF T. brucei. A general feature of the transcription initiation region of pol II-transcribed genes in model organisms from the Unikonta supergroup includes an NDR region overlapping the TSS. Although a weak NDR was observed upstream of the first gene of pol II-transcribed PTUs in T. brucei, the pol II TSS remains unmapped. It is therefore unclear whether this NDR is functionally related to the transcription process or is due to a polypyrimidine tract that is required by the splicing mechanism, but is refractory to nucleosomes.
In the case of the PASs, the oligo-dA abutting the end of the PAS would in theory be refractory for nucleosomes, whereas the presence of a weak A–A distribution at a 10-bp periodicity would accommodate the well-positioned nucleosome in its observed location (Fig. 6c). The nucleosomal organization seen at metazoan intron–exon boundaries and at polyadenylation sites, compared to that seen at T. brucei SASs and PASs, is thus highly similar. This could be a consequence of the evolutionarily conserved sequence elements that direct local nucleosome placement. Alternatively, this could be a nucleosome arrangement that is required for the respective genetic mechanisms involving these elements. The nucleosome organization at the SASs and PASs therefore appears evolutionarily ancient and was probably already established before the evolutionary divergence of the Excavata from the other eukaryotic supergroups [1, 80]. The aforementioned is supported by recent findings where comparable nucleosomal patterns were observed in L. major, a related kinetoplastid with genomic synteny to T. brucei . The apparent organization of nucleosomes relative to these SAS and PAS sequence elements is intriguing, as these sequence elements, and notably the oligo-dA runs, are functionally relevant for RNA processing rather than transcription. Although the nucleosomal organization in the region of the first SAS is related to that seen at TSSs in other eukaryotes, this could be a consequence of the role of these sequences in RNA processing. It is possible that these sequences have dual functions in contributing both to the organization of the TSS region, as well as to spliceosome binding on the nascent RNA.
The functional relevance of the nucleosomal organization seen at internal SASs is less clear. One possibility is that the concerted placement of a nucleosome on internal SASs would slow the RNA polymerase at this position, ensuring time for splicing at the SAS and subsequent splicing and polyadenylation at the upstream PAS . The juxtaposition of a poly-d(A·T) tract upstream of the nucleosome-associated internal SASs makes this an intriguing possibility, as the polypyrimidine tract is known to affect both trans-splicing and polyadenylation of adjacent genes . It is also theoretically possible that factors involved in RNA processing are preloaded onto the DNA and then transferred to the growing, nascent RNA by the elongating pol II. In fact, a physical interaction between the SF3a60 spliceosome factor and the largest subunit of pol II was shown using a two-hybrid approach , suggesting a possible link between the elongating pol II and RNA splicing in T. brucei. In addition, TbRRM1, an RNA-binding SR nucleoprotein, has been shown to directly interact with histones and modulate chromatin structure, maintaining permissive chromatin to facilitate transcription and RNA processing, and may also be involved in splicing commitment for a subset of transcripts .
Alternatively, the nucleosomal presence at internal SASs (or the 3′ acceptor region of a metazoan intron) could limit DNA recombination in this area [85, 86]. This would protect exon units over evolutionary time, or limit mutation of the SAS, and thus conserve a mechanistically functional SAS. In support of this, the rate of C → T hydrolytic deamination was reported to be reduced by twofold in nucleosomally wrapped DNA in S. cerevisiae, C. elegans, and Oryzias . Given that a single point mutation in the 3′ AG acceptor sequence can functionally destroy the entire downstream gene, it is likely that protection of such sites would be evolutionarily advantageous. The PAS sites themselves showed an NDR and a well-positioned nucleosome downstream, similar to the organization seen in other organisms . This arrangement may also reflect a requirement for transient pausing of pol II transcription elongation.
Chromatin can play an important role in silencing areas of eukaryotic genomes. The regions flanking the silent VSG arrays showed a unique nucleosomal organization, with an extensive array of well-positioned nucleosomes. The nucleosomes on the silent VSG coding regions themselves appeared less well-positioned. We propose that the bordering chromatin structure of well-positioned nucleosomes serves to limit cryptic pol II initiation. For antigenic variation to work, it is imperative for T. brucei to maintain the silent VSG arrays in a transcriptionally repressed state. Promiscuous expression of these silent VSGs would allow the host immune system to develop an immune response to a wide variety of VSGs and therefore facilitate immune clearance of the parasite. This repressive nucleosomal arrangement found at the silent VSG arrays is reminiscent to that found at the silent mating-type loci HML and HMR in S. cerevisiae. In contrast, the less defined nucleosomal structure on the silent VSG genes themselves could be more permissive for gene conversion events copying the silent VSG to the active ES. In this way, the chromatin structure at these silent VSG arrays could facilitate antigenic variation in African trypanosomes.
In T. brucei, ORC1 and RAP1 were shown to be required for full repression of the silent ESs, and ORC1 was shown to bind at positions bordering many silent VSG cassettes. The similarity between silencing at the S. cerevisiae HM loci and telomeres, and silencing at the T. brucei ESs and the VSG gene arrays is striking. However, no Sir-related protein other than SIR2RP1 has been identified in T. brucei, and although SIR2RP1 is involved in silencing the immediate regions of the telomeres , it does not appear to play a direct role in ES silencing. In S. cerevisiae, ORC1 is thought to be the ancestral gene of SIR3, which arose after a genome duplication event . However, unlike yeast ORC1, the T. brucei ORC1 does not contain a BAH domain. It is therefore unclear whether it can substitute for SIR3 in T. brucei and therefore contribute to the propagation of a repressive heterochromatic structure at the telomeres and silent VSG arrays. Of all the mapped T. brucei ORC1 binding sites, 38% were found in the chromosomal cores, localizing to transcription boundaries, with the remaining 62% localizing to subtelomeric and silent VSG arrays. All active DNA origins of replication (ORI) were found in the chromosomal cores with no evidence of ORIs originating from subtelomeric sites or silent VSG arrays . It is possible that these non-replicative ORC1 binding sites have a repressive function at transcriptionally silent VSG-containing subtelomeric regions in T. brucei. A gradient of repression extends up to 10 kb from the telomeric ends in T. brucei , implying the propagation of a repressive chromatin structure. However, the proteins that participate in the establishment of this repressive structure remain unknown.
In summary, our genome-wide nucleosomal analysis revealed striking correlations, as well as stark differences, in the nucleosomal architecture of T. brucei compared to other model eukaryotes studied to date. These similarities suggest that some chromatin features, like the weak NDR upstream of PTUs, the strong positioning of nucleosomes on the internal SASs, the depletion of nucleosomes from PASs, and the nucleosomal organization at silent VSG arrays and ORC1 binding sites, were already established in LECA, before the divergence of the eukaryotic super groups. These findings, in conjunction with the co-localization of histone variants, histone and DNA modifications, and chromatin modulators, indicate the presence and importance of a functional epigenome in T. brucei, possibly providing a regulatory interface to genome regulation.
Trypanosome strains and culturing
Bloodstream form T. brucei Lister 427 was cultured in HMI-9 medium as previously described  supplemented with 15% fetal calf serum (FCS) and appropriate drugs at 37 °C under 5% CO2. Procyclic form trypanosomes were cultured in SDM-79 medium supplemented with 10% (v/v) FCS, 5 µg/ml hemin, and appropriate drugs at 27 °C . Two cell lines were chosen for both the BF and the PF life cycle stages to account for possible cell line-specific differences. These are the BF (HNI_VO2 and RYT3) and PF (Amsterdam wild-type and 221BsrDsRed) cell lines. HNI_VO2 cells have a hygromycin resistance gene in the silent VSG221 ES and a neomycin resistance gene in the active VO2 ES, providing single copy sequences which allow the differentiation of the silent and active VSG ESs . The RYT3 cell line has a blasticidin resistance gene in the active VSGT3 ES and an eGFP gene and a puromycin resistance gene in the silent VSG221 ES . The PF 221BsrDsRed cell line has a blasticidin resistance gene and a DsRed gene in the silent 221 ES .
Core particle preparation
MNase digestion of chromatin was performed as described . Briefly, 5 × 107 cells/sample were harvested by centrifugation (1200g, 5 min) and permeabilized with 400 µM digitonin for 5 min at room temperature (Sigma-Aldrich). Chromatin was digested with 1–32 units MNase (Worthington Biochemicals) for 5 min at 37 °C, followed by phenol–chloroform extraction and ethanol precipitation. Recovered DNA was resolved on a 2% (w/v) agarose gel. DNA fragments of ~147 bp were extracted from the gel using the freeze-and-squeeze technique (Bio-Rad laboratories, Hercules) and 50 nt from each end was sequenced by paired-end methodology (Illumina) as previously described .
Alignment to reference genome
The FASTQ sequence files were aligned to version 4.2 of the T. brucei 427 genome using Bowtie 2  allowing no mismatches, limiting alignments to 7 ambiguous (N) bases in the reference genome, and filtering for convergent primer pairs separated by between 145 and 155 bp, end-to-end. The detail of the alignment output is shown in Additional file 1: Table S2.
Dyad value files
The file of dyad values, listing the number of dyads assigned to each nucleotide position of each chromosome, was normalized to chromosome length to allow inter-chromosome comparisons, as well as normalized between different experiments, to allow analysis between different experimental conditions. We assumed that the median nucleosome density was the same between experimental conditions and chromosomes. The genomic positions of tandem repeats, previously identified with the TRF program , were downloaded from TriTrypDB, and the dyad values of all nucleotides that fell within tandem repeat sequences were set to “−1” with the program strip_tandem_repeats, and ignored in the calculation of average dyad densities. To mask the contribution of unmatched, N-rich sequences to the generation of spurious NDRs, all possible dyad positions of fragments between 145 and 155 bp, where either of the aligned 50-bp paired-end fragments exceeded the maximum cutoff for base ambiguity, were set to “−2” in the dyad value files, and ignored in subsequent calculations.
A binning analysis was performed with the program dyad_bins using the normalized dyad values. The lowest and highest numbers of dyad axes co-localized on single nucleotides, representing the minimum and maximum bin values, were identified. Dyad values associated with AT-rich repeats were masked as indicated. Bins intermediate to the minimum and maximum bins were defined at an increment of 1, and the number of occurrences of dyad values equal to each bin value identified in each file of dyad values. The number of members in each bin was normalized to the genome size to allow direct comparison between regions of different sizes. The number of samples in each bin was plotted with Gnuplot version 4.6.3.
Normalized dyad densities
Differences in the nucleosomal organization in BF and PF T. brucei
Alignment of dyads
The alignment of dyads relative to specific genomic positions was performed with the program align_dyads, which aligned the dyads in an orientation depending on whether the feature was on the Watson or the Crick strand, and reported the average value in a 50-bp running window.
The Kendall’s tau correlation and Pearson product moment correlation test were performed with R scripts.
Mapping of T. brucei 927 genes to T. brucei 427
The list of translated coding sequences of T. brucei strain 927 (Tb927) was downloaded from TriTypDB and BLASTed against a database of T. brucei 427 protein sequences (Tb427). This had been generated with version 7 of the fastA format translated CDS files downloaded from tritrypdb.org, using the makeblastdb. The homologous sequences were listed, and entries with E values <10−10 selected. Where Tb927 genes were mapped to multiple Tb427 genes, only the match with the smallest E value was selected. Where a Tb927 gene had more than one splice acceptor site, the site furthest upstream from the coding sequence was chosen as it was likely to be closest to the site of transcription initiation. The position of the splice acceptor sites in the genome sequence of the Tb427 was mapped using the program 927_to_427_map.
Mapping of ORC1 binding sites from Tb927 to Tb427
The ORC1 binding sites assigned with a signal ratio >1.0 and a FDR <0.05 were selected from the list of ORC1 sites (kindly provided by R. McCulloch) , and the corresponding sequences in Tb927 retrieved from TriTrypDB. The retrieved sequences were BLASTed against version 4.2 of the T. brucei Lister 427 genome, and the single, best hit for each query sequence recovered. Only target sequences that were >95% identical and with E values <10−10 were retained (1114 sites, Additional file 2: Table S3). The ORC1 site was defined as the center of each such recovered sequence.
Definition of chromosome regions
Subtelomeric regions were identified as the terminal regions of chromosomes enriched for retrotransposon hotspot proteins (RHS), expression site-associated genes (ESAGs), VSG genes and pseudogenes, and leucine-rich repeat protein (LRRP) genes. Subtelomeric regions were further divided into telomere–proximal (≤10% of the chromosomal length from the telomere) and chromosome internal/core (>10% of the chromosomal length away from telomere) subtelomeric regions.
Alignment to resistance markers
The hygromycin resistance gene (Hygromycin-B 4-O-kinase; accession number V01499) as well as the neomycin resistance marker (aminoglycoside 3′-phosphotransferase; accession number P00551) was indexed using Bowtie 2 with default settings, and the sequenced pairs for the BF HNI_VO2 aligned to the indexed sequences, allowing accordant alignments of sequence pairs resulting in fragment lengths of between 145 and 155 bp.
Probability of sequence repeats
Probability of finding an A10, G10, and AT10 repeat
Specific sequences were recovered from the genome of T. brucei using the program get_sequences. The enrichment of specific nucleotide oligomers was determined in aligned sequences using the program polyA_enrich. The number of sequence runs was normalized to the number of sequences analyzed, smoothed with a 50-nt running average, and expressed as a percentage of the smoothing window size. The probability of finding 10-bp homo- or hetero-oligomer motifs in the coding and noncoding regions of T. brucei 427 was calculated with a hypergeometric distribution function using the coding frequencies shown in Additional file 1: Table S4.
The transcription start regions identified in Tb927 were mapped to the equivalent genomic positions in Tb427 using the correlation list derived from the BLAST analysis explained above. The presence of upstream t-, r- or snRNA genes was verified for internal start regions. The list of mapped transcription start regions is shown in Additional file 3: Table S5.
Discrete Fourier analysis of dinucleotide frequencies
The presence of strong 10-bp A–A dinucleotide periodicities was assessed from the Fourier magnitude of the distribution in specific regions using the program hp_fftw that utilizes the FFTW library (www.fftw.org).
All software was written in C++ (ISO/IEC 14882:2011) and compiled with the g++ version 4.8.2 64-bit compiler (gcc.gnu.org) using the mingw-w64 version 4.8.2 (mingw-w64.sourceforge.net) toolchain on a Windows version 8.1 operating system platform. The program hp_fftw was compiled with g++ version 4.8 on Linux openSUSE version 12.3 (www.opensuse.org). All plots were prepared with scripts using gnuplot version 4.6 (www.gnuplot.info). The source code for software developed and used in this study is freely available (sourceforge.net/projects/nucpos/).
last eukaryotic common ancestor
- pol II:
RNA polymerase II
transcription start site
human African trypanosomiasis
variant surface glycoprotein
polycistronic transcription unit
strand switch region
- Base J:
splice acceptor site
- SL RNA:
spliced leader RNA
retrotransposon hotspot protein
expression site-associated gene
leucine-rich repeat protein
false discovery rate
origins of replication
fetal calf serum
HGP conceived and coordinated the study; JPM performed the experimental work; MLP performed experimental work; GR provided the strains and facilities for the experimental work; DJC performed the DNA repair and paired-end sequencing; HGP wrote the software; HGP and JPM performed the analysis and drafted the manuscript. All authors read and approved the final manuscript.
The authors thank Robert Schall for his advice on statistical analyses, Razvan Chereji for his assistance with the analysis of the nucleosome organization at TSSs, the NHLBI Sequencing Core Facility (Yan Luo, Poching Lu, Yoshi Wakabayashi and Jun Zhu), Nicolai Siegel for making his H2A.V ChIP-seq data available to us before publication, and Richard McCulloch for providing ORC1 data.
Johannes Petrus Maree and Hugh-George Patterton are members of the H3Africa Consortium.
The authors declare that they have no competing interests.
Availability of data and materials
The Trypanosoma brucei 427 reference genome (version 4.2), coding sequences, and genomic positions of tandem repeats are available at www.tritrypdb.org. The data files analyzed in this study are available from the GEO database Accession Number GSE90593.
This work was supported by the H3Africa program of the National Institutes of Health [Grant 1U01HG007465, to HGP] and in part by the Intramural Research Program of the National Institutes of Health [to DJC]. G.R. is a Wellcome Senior Research Fellow funded by the Wellcome Trust. The funding bodies did not contribute to the design of the study, collection, analysis, and interpretation of data, or to writing the manuscript.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- He D, Fiz-Palacios O, Fu C-J, Fehling J, Tsai C-C, Baldauf SL. An alternative root for the eukaryote tree of life. Curr Biol. 2014;24:465–70.View ArticlePubMedGoogle Scholar
- Cavalier-Smith T. Kingdoms Protozoa and Chromista and the eozoan root of the eukaryotic tree. Biol Lett. 2010;6:342–5.View ArticlePubMedGoogle Scholar
- Figueiredo LM, Cross GAM, Janzen CJ. Epigenetic regulation in African trypanosomes: a new kid on the block. Nat Rev Microbiol. 2009;7:504–13.View ArticlePubMedGoogle Scholar
- Maree JP, Patterton HG. The epigenome of Trypanosoma brucei: a regulatory interface to an unconventional transcriptional machine. Biochim Biophys Acta. 2014;1839:743–50.View ArticlePubMedPubMed CentralGoogle Scholar
- Duraisingh MT, Horn D. Epigenetic regulation of virulence gene expression in parasitic protozoa. Cell Host Microbe. 2016;19:629–40.View ArticlePubMedPubMed CentralGoogle Scholar
- Günzl A, Kirkham JK, Nguyen TN, Badjatia N, Park SH. Mono-allelic VSG expression by RNA polymerase I in Trypanosoma brucei: expression site control from both ends? Gene. 2015;556:68–73.View ArticlePubMedGoogle Scholar
- Reeve JN, Sandman K, Daniels CJ. Archaeal histones, nucleosomes, and transcription initiation. Cell. 1997;89:999–1002.View ArticlePubMedGoogle Scholar
- Kasinsky HE, Lewis JD, Dacks JB, Ausio J. Origin of H1 linker histones. FASEB J. 2001;15:34–42.View ArticlePubMedGoogle Scholar
- Sandman K, Reeve JN. Archaeal chromatin proteins: different structures but common function? Curr Opin Microbiol. 2005;8:656–61.View ArticlePubMedGoogle Scholar
- Peterson CL, Workman JL. Promoter targeting and chromatin remodeling by the SWI/SNF complex. Curr Opin Genet Dev. 2000;10:187–92.View ArticlePubMedGoogle Scholar
- Marsh VL, Peak-Chew SY, Bell SD. Sir2 and the acetyltransferase, Pat, regulate the archaeal chromatin protein, Alba. J Biol Chem. 2005;280:21122–8.View ArticlePubMedGoogle Scholar
- Simpson RT. Nucleosome positioning can affect the function of a cis-acting DNA element in vivo. Nature. 1990;343:387–9.View ArticlePubMedGoogle Scholar
- Hinz JM, Czaja W. Facilitation of base excision repair by chromatin remodeling. DNA Repair. 2015;36:91–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Jiang C, Pugh BF. Nucleosome positioning and gene regulation: advances through genomics. Nat Rev Genet. 2009;10:161–72.View ArticlePubMedPubMed CentralGoogle Scholar
- Bai L, Morozov AV. Gene regulation by nucleosome positioning. Trends Genet. 2010;26:476–83.View ArticlePubMedGoogle Scholar
- Ammar R, Torti D, Tsui K, Gebbia M, Durbic T, Bader GD, et al. Chromatin is an ancient innovation conserved between Archaea and Eukarya. eLife. 2012;1:e00078.View ArticlePubMedPubMed CentralGoogle Scholar
- Fèvre EM, Wissmann BV, Welburn SC, Lutumba P. The burden of human African trypanosomiasis. In: Brooker S, editor. PLoS Negl Trop Dis. 2008;2:e333.
- Mogk S, Meiwes A, Boßelmann CM, Wolburg H, Duszenko M. The lane to the brain: how African trypanosomes invade the CNS. Trends Parasitol. 2014;30:470–7.View ArticlePubMedGoogle Scholar
- Simarro PP, Cecchi G, Franco JR, Paone M, Diarra A, Priotto G, et al. Monitoring the progress towards the elimination of gambiense human African trypanosomiasis. PLoS Negl Trop Dis. 2015;9:e0003785.View ArticlePubMedPubMed CentralGoogle Scholar
- Fenn K, Matthews KR. The cell biology of Trypanosoma brucei differentiation. Curr Opin Microbiol. 2007;10:539–46.View ArticlePubMedPubMed CentralGoogle Scholar
- Horn D. Antigenic variation in African trypanosomes. Mol Biochem Parasitol. 2014;195:123–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Taylor JE, Rudenko G. Switching trypanosome coats: what’s in the wardrobe? Trends Genet. 2006;22:614–20.View ArticlePubMedGoogle Scholar
- Hertz-Fowler C, Figueiredo LM, Quail MA, Becker M, Jackson A, Bason N, et al. Telomeric expression sites are highly conserved in Trypanosoma brucei. PLoS ONE. 2008;3:e3527.View ArticlePubMedPubMed CentralGoogle Scholar
- Vink C, Rudenko G, Seifert HS. Microbial antigenic variation mediated by homologous DNA recombination. FEMS Microbiol Rev. 2012;36:917–48.View ArticlePubMedPubMed CentralGoogle Scholar
- Daniels J-P, Gull K, Wickstead B. Cell biology of the trypanosome genome. Microbiol Mol Biol Rev. 2010;74:552–69.View ArticlePubMedPubMed CentralGoogle Scholar
- Perrod S, Gasser SM. Long-range silencing and position effects at telomeres and centromeres: parallels and differences. Cell Mol Life Sci. 2003;60:2303–18.View ArticlePubMedGoogle Scholar
- Renauld H, Aparicio OM, Zierath PD, Billington BL, Chhablani SK, Gottschling DE. Silent domains are assembled continuously from the telomere and are defined by promoter distance and strength, and by SIR3 dosage. Genes Dev. 1993;7:1133–45.View ArticlePubMedGoogle Scholar
- Yang X, Figueiredo LM, Espinal A, Okubo E, Li B. RAP1 is essential for silencing telomeric variant surface glycoprotein genes in Trypanosoma brucei. Cell. 2009;137:99–109.View ArticlePubMedPubMed CentralGoogle Scholar
- Alsford S, Kawahara T, Isamah C, Horn D. A sirtuin in the African trypanosome is involved in both DNA repair and telomeric gene silencing but is not required for antigenic variation. Mol Microbiol. 2007;63:724–36.View ArticlePubMedGoogle Scholar
- Rudenko G. Epigenetics and transcriptional control in African trypanosomes. Essays Biochem. 2010;48:201–19.View ArticlePubMedGoogle Scholar
- Stanne TM, Kushwaha M, Wand M, Taylor JE, Rudenko G. TbISWI regulates multiple polymerase I (Pol I)-transcribed loci and is present at Pol II transcription boundaries in Trypanosoma brucei. Eukaryot Cell. 2011;10:964–76.View ArticlePubMedPubMed CentralGoogle Scholar
- Hughes K, Wand M, Foulston L, Young R, Harley K, Terry S, et al. A novel ISWI is involved in VSG expression site downregulation in African trypanosomes. EMBO J. 2007;26:2400–10.View ArticlePubMedPubMed CentralGoogle Scholar
- Benmerzouga I, Concepción-Acevedo J, Kim HS, Vandoros AV, Cross GAM, Klingbeil MM, et al. Trypanosoma brucei Orc1 is essential for nuclear DNA replication and affects both VSG silencing and VSG switching. Mol Microbiol. 2013;87:196–210.View ArticlePubMedGoogle Scholar
- Denninger V, Fullbrook A, Bessat M, Ersfeld K, Rudenko G. The FACT subunit TbSpt16 is involved in cell cycle specific control of VSG expression sites in Trypanosoma brucei. Mol Microbiol. 2010;78:459–74.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang QP, Kawahara T, Horn D. Histone deacetylases play distinct roles in telomeric VSG expression site silencing in African trypanosomes. Mol Microbiol. 2010;77:1237–45.View ArticlePubMedPubMed CentralGoogle Scholar
- Povelones ML, Gluenz E, Dembek M, Gull K, Rudenko G. Histone H1 plays a role in heterochromatin formation and VSG expression site silencing in Trypanosoma brucei. In: Ullu E, editor. PLoS Pathog. 2012;8:e1003010.
- Pena AC, Pimentel MR, Manso H, Vaz-Drago R, Pinto-Neves D, Aresta-Branco F, et al. T rypanosoma brucei histone H1 inhibits RNA polymerase I transcription and is important for parasite fitness in vivo. Mol Microbiol. 2014;93:645–63.View ArticlePubMedPubMed CentralGoogle Scholar
- Günzl A, Bruderer T, Laufer G, Schimanski B, Tu L, Chung H, et al. RNA polymerase I transcribes procyclin genes and variant surface glycoprotein gene expression sites in Trypanosoma brucei. Eukaryot Cell. 2003;2:542–51.View ArticlePubMedPubMed CentralGoogle Scholar
- Berriman M, Ghedin E, Hertz-Fowler C, Blandin G, Renauld H, Bartholomeu DC, et al. The genome of the African trypanosome Trypanosoma brucei. Science. 2005;309:416–22.View ArticlePubMedGoogle Scholar
- Clayton CE. Networks of gene expression regulation in Trypanosoma brucei. Mol Biochem Parasitol. 2014;195:96–106.View ArticlePubMedGoogle Scholar
- Kolev NG, Franklin JB, Carmi S, Shi H, Michaeli S, Tschudi C. The transcriptome of the human pathogen Trypanosoma brucei at single-nucleotide resolution. In: Beverley SM, editor. PLoS Pathog. 2010;6:e1001090.
- Siegel TN, Hekstra DR, Kemp LE, Figueiredo LM, Lowell JE, Fenyo D, et al. Four histone variants mark the boundaries of polycistronic transcription units in Trypanosoma brucei. Genes Dev. 2009;23:1063–76.View ArticlePubMedPubMed CentralGoogle Scholar
- Wright JR, Siegel TN, Cross GAM. Histone H3 trimethylated at lysine 4 is enriched at probable transcription start sites in Trypanosoma brucei. Mol Biochem Parasitol. 2010;172:141–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Mandava V, Fernandez JP, Deng H, Janzen CJ, Hake SB, Cross GAM. Histone modifications in Trypanosoma brucei. Mol Biochem Parasitol. 2007;156:41–50.View ArticlePubMedPubMed CentralGoogle Scholar
- Schulz D, Zaringhalam M, Papavasiliou FN, Kim H-S. Base J and H3.V regulate transcriptional termination in Trypanosoma brucei. PLoS Genet. 2016;12:e1005762.View ArticlePubMedPubMed CentralGoogle Scholar
- Reynolds D, Hofmeister BT, Cliffe L, Alabady M, Siegel TN, Schmitz RJ, et al. Histone H3 variant regulates RNA polymerase II transcription termination and dual strand transcription of siRNA loci in Trypanosoma brucei. In: Figueiredo L, editor. PLoS Genet. 2016;12:e1005758.
- Van Luenen HGAM, Farris C, Jan S, Genest PA, Tripathi P, Velds A, et al. Glucosylated hydroxymethyluracil, DNA base J, prevents transcriptional readthrough in Leishmania. Cell. 2012;150:909–21.View ArticlePubMedPubMed CentralGoogle Scholar
- Nagarajavel V, Iben JR, Howard BH, Maraia RJ, Clark DJ. Global “bootprinting” reveals the elastic architecture of the yeast TFIIIB–TFIIIC transcription complex in vivo. Nucleic Acids Res. 2013;41:8135–43.View ArticlePubMedPubMed CentralGoogle Scholar
- Schimanski B, Nguyen TN, Gunzl A. Characterization of a multisubunit transcription factor complex essential for spliced-leader RNA gene transcription in Trypanosoma brucei. Mol Cell Biol. 2005;25:7303–13.View ArticlePubMedPubMed CentralGoogle Scholar
- Birch JL, Zomerdijk JCBM. Structure and function of ribosomal RNA gene chromatin. Biochem Soc Trans. 2008;36:619–24.View ArticlePubMedGoogle Scholar
- Siegel TN, Hekstra DR, Wang X, Dewell S, Cross GAM. Genome-wide analysis of mRNA abundance in two life-cycle stages of Trypanosoma brucei and identification of splicing and polyadenylation sites. Nucleic Acids Res. 2010;38:4946–57.View ArticlePubMedPubMed CentralGoogle Scholar
- Giaever G, Chu AM, Ni L, Connelly C, Riles L, Véronneau S, et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002;418:387–91.View ArticlePubMedGoogle Scholar
- Palenchar JB, Bellofatto V. Gene transcription in trypanosomes. Mol Biochem Parasitol. 2006;146:135–41.View ArticlePubMedGoogle Scholar
- Kogan S, Trifonov EN. Gene splice sites correlate with nucleosome positions. Gene. 2005;352:57–62.View ArticlePubMedGoogle Scholar
- Brogaard K, Xi L, Wang J-P, Widom J. A map of nucleosome positions in yeast at base-pair resolution. Nature. 2012;486:496–501.PubMedPubMed CentralGoogle Scholar
- Mavrich TN, Jiang C, Ioshikhes IP, Li X, Venters BJ, Zanton SJ, et al. Nucleosome organization in the Drosophila genome. Nature. 2008;453:358–62.View ArticlePubMedPubMed CentralGoogle Scholar
- Lombraña R, Álvarez A, Fernández-Justel JM, Almeida R, Poza-Carrión C, Gomes F, et al. Transcriptionally driven DNA replication program of the human Parasite Leishmania major. Cell Rep. 2016;16:1774–86.View ArticlePubMedGoogle Scholar
- Jin C, Felsenfeld G. Nucleosome stability mediated by histone variants H3.3 and H2A.Z. Genes Dev. 2007;21:1519–29.View ArticlePubMedPubMed CentralGoogle Scholar
- Martínez-Calvillo S, Vizuet-De-Rueda JC, Florencio-Martínez LE, Manning-Cela RG, Figueroa-Angulo EE. Gene expression in trypanosomatid parasites. J Biomed Biotechnol. 2010;2010:525241.View ArticlePubMedPubMed CentralGoogle Scholar
- Ling X, Harkness TAA, Schultz MC, Fisher-Adams G, Grunstein M. Yeast histone H3 and H4 amino termini are important for nucleosome assembly in vivo and in vitro: redundant and position-independent functions in assembly but not in gene regulation. Genes Dev. 1996;10:686–99.View ArticlePubMedGoogle Scholar
- Huang H, Liu H, Sun X. Nucleosome distribution near the 3′ ends of genes in the human genome. Biosci Biotechnol Biochem. 2013;77:2051–5.View ArticlePubMedGoogle Scholar
- Mavrich TN, Ioshikhes IP, Venters BJ, Jiang C, Tomsho LP, Qi J, et al. A barrier nucleosome model for statistical positioning of nucleosomes throughout the yeast genome. Genome Res. 2008;18:1073–83.View ArticlePubMedPubMed CentralGoogle Scholar
- Satchwell SC, Drew HR, Travers AA. Sequence periodicities in chicken nucleosome core DNA. J Mol Biol. 1986;191:659–75.View ArticlePubMedGoogle Scholar
- Nelson HC, Finch JT, Luisi BF, Klug A. The structure of an oligo(dA).oligo(dT) tract and its biological implications. Nature. 1987;330:221–6.View ArticlePubMedGoogle Scholar
- Vazquez MP, Mualem D, Bercovich N, Stern MZ, Nyambega B, Barda O, et al. Functional characterization and protein–protein interactions of trypanosome splicing factors U2AF35, U2AF65 and SF1. Mol Biochem Parasitol. 2009;164:137–46.View ArticlePubMedGoogle Scholar
- Lee T-Y, Huang H-D, Hung J-H, Huang H-Y, Yang Y-S, Wang T-H. dbPTM: an information repository of protein post-translational modification. Nucleic Acids Res. 2006;34:D622–7.View ArticlePubMedGoogle Scholar
- Liang X, Haritan A, Uliel S. Trans and cis splicing in trypanosomatids: mechanism, factors, and regulation. Eukaryot Cell. 2003;2:830–40.View ArticlePubMedPubMed CentralGoogle Scholar
- Struhl K, Segal E. Determinants of nucleosome positioning. Nat Struct Mol Biol. 2013;20:267–73.View ArticlePubMedPubMed CentralGoogle Scholar
- Clayton CE. Life without transcriptional control? From fly to man and back again. EMBO J. 2002;21:1881–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Kramer S. Developmental regulation of gene expression in the absence of transcriptional control: the case of kinetoplastids. Mol Biochem Parasitol. 2012;181:61–72.View ArticlePubMedGoogle Scholar
- Stanne TM, Rudenko G. Active VSG expression sites in Trypanosoma brucei are depleted of nucleosomes. Eukaryot Cell. 2010;9:136–47.View ArticlePubMedGoogle Scholar
- Figueiredo LM, Cross GAM. Nucleosomes are depleted at the VSG expression site transcribed by RNA polymerase I in African trypanosomes. Eukaryot Cell. 2010;9:148–54.View ArticlePubMedGoogle Scholar
- Haber JE. Mating-type genes and MAT switching in Saccharomyces cerevisiae. Genetics. 2012;191:33–64.View ArticlePubMedPubMed CentralGoogle Scholar
- Tiengwe C, Marcello L, Farr H, Dickens N, Kelly S, Swiderski M, et al. Genome-wide analysis reveals extensive functional interaction between DNA replication initiation and transcription in the genome of Trypanosoma brucei. Cell Rep. 2012;2:185–97.View ArticlePubMedPubMed CentralGoogle Scholar
- Narayanan MS, Rudenko G. TDP1 is an HMG chromatin protein facilitating RNA polymerase I transcription in African trypanosomes. Nucleic Acids Res. 2013;41:2981–92.View ArticlePubMedPubMed CentralGoogle Scholar
- Pandya UM, Sandhu R, Li B. Silencing subtelomeric VSGs by Trypanosoma brucei RAP1 at the insect stage involves chromatin structure changes. Nucleic Acids Res. 2013;41:7673–82.View ArticlePubMedPubMed CentralGoogle Scholar
- McCall M, Brown T, Kennard O. The crystal structure of d(G-G-G-G-C-C-C-C). A model for poly(dG).poly(dC). J Mol Biol. 1985;183:385–96.View ArticlePubMedGoogle Scholar
- Ruan J-P, Arhin GK, Ullu E, Tschudi C. Functional characterization of a Trypanosoma brucei TATA-binding protein-related factor points to a universal regulator of transcription in trypanosomes. Mol Cell Biol. 2004;24:9610–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Schwartz S, Meshorer E, Ast G. Chromatin organization marks exon–intron structure. Nat Struct Mol Biol. 2009;16:990–5.View ArticlePubMedGoogle Scholar
- Adl SM, Simpson AGB, Lane CE, Lukeš J, Bass D, Bowser SS, et al. The revised classification of eukaryotes. J Eukaryot Microbiol. 2012;59:429–514.View ArticlePubMedPubMed CentralGoogle Scholar
- Ullu E, Matthews KR, Tschudi C. Temporal order of RNA-processing reactions in trypanosomes: rapid trans splicing precedes polyadenylation of newly synthesized tubulin transcripts. Mol Cell Biol. 1993;13:720–5.View ArticlePubMedPubMed CentralGoogle Scholar
- Matthews KR, Tschudi C, Ullu E. A common pyrimidine-rich motif governs trans-splicing and polyadenylation of tubulin polycistronic pre-mRNA in trypanosomes. Genes Dev. 1994;8:491–501.View ArticlePubMedGoogle Scholar
- Nyambega B, Helbig C, Masiga DK, Clayton C, Levin MJ. Proteins associated with SF3a60 in T. brucei. PLoS ONE. 2014;9:e91956.View ArticlePubMedPubMed CentralGoogle Scholar
- Naguleswaran A, Gunasekera K, Schimanski B, Heller M, Hemphill A, Ochsenreiter T, et al. Trypanosoma brucei RRM1 is a nuclear RNA-binding protein and modulator of chromatin structure. MBio. 2015;6:e00114–5.View ArticlePubMedPubMed CentralGoogle Scholar
- Baumann M, Mamais A, McBlane F, Xiao H, Boyes J. Regulation of V(D)J recombination by nucleosome positioning at recombination signal sequences. EMBO J. 2003;22:5197–207.View ArticlePubMedPubMed CentralGoogle Scholar
- Getun IV, Wu ZK, Bois PRJ. Organization and roles of nucleosomes at mouse meiotic recombination hotspots. Nucleus. 2012;3:244–50.View ArticlePubMedPubMed CentralGoogle Scholar
- Chen X, Chen Z, Chen H, Su Z, Yang J, Lin F, et al. Nucleosomes suppress spontaneous mutations base-specifically in eukaryotes. Science. 2012;335:1235–8.View ArticlePubMedGoogle Scholar
- Hickman MA, Rusche LN. Transcriptional silencing functions of the yeast protein Orc1/Sir3 subfunctionalized after gene duplication. Proc Natl Acad Sci USA. 2010;107:19384–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Hirumi H, Hirumi K. Continuous cultivation of Trypanosoma brucei blood stream forms in a medium containing a low concentration of serum protein without feeder cell layers. J Parasitol. 1989;75:985–9.View ArticlePubMedGoogle Scholar
- Brun R, Schönenberger. Cultivation and in vitro cloning or procyclic culture forms of Trypanosoma brucei in a semi-defined medium. Short communication. Acta Trop. 1979;36:289–92.PubMedGoogle Scholar
- Rudenko G, Chaves I, Dirks-Mulder A, Borst P. Selection for activation of a new variant surface glycoprotein gene expression site in Trypanosoma brucei can result in deletion of the old one. Mol Biochem Parasitol. 1998;95:97–109.View ArticlePubMedGoogle Scholar
- Cole HA, Howard BH, Clark DJ. Genome-wide mapping of nucleosomes in yeast using paired-end sequencing. Methods Enzymol. 2012;513:145–68.View ArticlePubMedGoogle Scholar
- Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–80.View ArticlePubMedPubMed CentralGoogle Scholar