Academia.eduAcademia.edu

Chromatin and extracellular vesicle associated sperm RNAs

2015, Nucleic acids research

A diverse pool of RNAs remain encapsulated within the transcriptionally silent spermatozoon despite the dramatic reduction in cellular and nuclear volume following cytoplasm/nucleoplasm expulsion. The impact of this pronounced restructuring on the distribution of transcripts inside the sperm essentially remains unknown. To define their compartmentalization, total RNA >100 nt was extracted from sonicated (SS) mouse spermatozoa and detergent demembranated sucrose gradient fractionated (Cs/Tx) sperm heads. Sperm RNAs predominately localized toward the periphery. The corresponding distribution of transcripts and thus localization and complexity were then inferred by RNA-seq. Interestingly, the number of annotated RNAs in the CsTx sperm heads exhibiting reduced peripheral enrichment was restricted. However this included Cabyr, the calcium-binding tyrosine phosphorylation-regulated protein encoded transcript. It is present in murine zygotes prior to the maternal to the zygotic transiti...

Published online 15 June 2015 Nucleic Acids Research, 2015, Vol. 43, No. 14 6847–6859 doi: 10.1093/nar/gkv591 Chromatin and extracellular vesicle associated sperm RNAs Graham D. Johnson1 , Paula Mackie2 , Meritxell Jodar1,3 , Sergey Moskovtsev2,4 and Stephen A. Krawetz1,3,* 1 Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, MI 48201, USA, CReATe Fertility Centre, Toronto, ON, M5G 1N8, Canada, 3 Department of Obstetrics and Gynecology, Wayne State University School of Medicine, Detroit, MI 48201, USA and 4 Department of Obstetrics and Gynaecology, University of Toronto, ON, M5G 1E2, Canada 2 Received November 7, 2014; Revised May 16, 2015; Accepted May 23, 2015 ABSTRACT INTRODUCTION A diverse pool of RNAs remain encapsulated within the transcriptionally silent spermatozoon despite the dramatic reduction in cellular and nuclear volume following cytoplasm/nucleoplasm expulsion. The impact of this pronounced restructuring on the distribution of transcripts inside the sperm essentially remains unknown. To define their compartmentalization, total RNA >100 nt was extracted from sonicated (SS) mouse spermatozoa and detergent demembranated sucrose gradient fractionated (Cs/Tx) sperm heads. Sperm RNAs predominately localized toward the periphery. The corresponding distribution of transcripts and thus localization and complexity were then inferred by RNA-seq. Interestingly, the number of annotated RNAs in the CsTx sperm heads exhibiting reduced peripheral enrichment was restricted. However this included Cabyr, the calciumbinding tyrosine phosphorylation-regulated protein encoded transcript. It is present in murine zygotes prior to the maternal to the zygotic transition yet absent in oocytes, consistent with the delivery of internally positioned sperm-borne RNAs to the embryo. In comparison, transcripts enriched in sonicated sperm contributed to the mitochondria and exosomes along with several nuclear transcripts including the metastasis associated lung adenocarcinoma transcript 1 (Malat1) and several small nucleolar RNAs. Their preferential peripheral localization suggests that chromatin remodeling during spermiogenesis is not limited to nucleoproteins as part of the nucleoprotein exchange. Following spermatogenesis the mature male gamete remains in a transcriptionally and translationally quiescent state yet harbors a unique population of RNAs (1–4). Many of these transcripts possess a clearly deined role during spermatogenesis providing a rich source for the discovery of male fertility biomarkers (5). Still other sperm RNAs await functional assignment. Notably, these sequences in sperm may not require translation to be of functional importance within the cell or following fertilization (6,7). Regardless of their coding potential attributing function to sperm RNAs has been complicated by a poor understanding of basic principles governing these transcripts including their cellular localization. Some RNAs likely remain bound to protein chaperones as ribonucleoprotein particles that are observed following their transcription in round spermatids (8,9). Yet, others, may complex with the paternal chromatin (10,11). Nevertheless, assigning functional roles to all sperm transcripts necessitates understanding their cellular localization. Various studies have used luorescence in situ hybridization to establish the presence of speciic RNAs within the context of the spermatozoon (12,13). However, localizing an internally positioned target within the mature sperm head is challenging due to the extreme compaction of this structure. The stability of proteins relative to RNAs offers greater lexibility in their immunological localization as chemical agents can be employed to increase penetrance and thereby the chance of detection (14). However, microscopy is not conducive to identifying the major sites of sperm RNA compartmentalization in a high-throughput manner. As illustrated in Figure 1, the extra-nuclear and intranuclear compartments provide at least two sites within the limited volume of the sperm head that should be capable of harboring RNA. Polymerase chain reaction (PCR) analysis of others has shown that speciic transcripts are lost when sperm nuclei are demembranated (15). This is consistent * To whom correspondence should be addressed. Tel: +1 313 577 6770; Fax: +1 313 577 8554; Email: [email protected]  C The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] 6848 Nucleic Acids Research, 2015, Vol. 43, No. 14 Figure 1. Potential sites of RNA localization in sperm. RNAs extracted from sonicated sperm (SS) can localize to three generalized compartments: (i) Mitochondria; (ii) extra-nuclear compartment that includes the plasma membrane, the acrosome and associated membranes; (iii) the intra-nuclear compartment which includes the nucleus, the nuclear envelope and the perinuclear theca. Fractionation and demembranation produces a population of sperm heads that remain associated with the perinuclear theca but lack the extra-nuclear compartment as well as the mitochondrial sheath and tail. RNA-seq analyses of transcripts extracted from SS or Cs/TX heads were used to identify compartment speciic patterns of RNA enrichment. with previous detergent based sperm puriication methods which are expected to compromise membranes, impacting RNA-seq proiles (16). Fractionation of nuclei and subsequent isolation and analysis by RNA-seq is a routine procedure in somatic cells (17) but has yet to be attempted in sperm. To discern the global pattern of transcript compartmentalization in mouse spermatozoa, total RNA was extracted from sonicated sperm (SS) and detergent demembranated gradient fractionated (Cs/TX) sperm heads. RNA quantiication demonstrated that the majority of RNA in sperm is associated with the peripheral membranes which are lost following treatment with detergent. To identify sperm transcripts that exhibited preferential enrichment within the intra- or extra-nuclear compartments (Figure 1) RNAs extracted from SS and Cs/TX sperm heads were subjected to RNA-seq analysis. The Cs/TX heads exhibited suppressed coverage of annotated RNAs supporting their general depletion from the nucleus and perinuclear theca and localization within the outer sperm membranes. As expected and observed by RNA-seq, transcripts enriched in these samples displayed a reduced peripheral enrichment when evaluated by RT-PCR and included RNAs associated with the cytoskeleton and spermatogenesis. In comparison, within the SS samples, RNA-seq analysis highlighted a set of preferentially extra-nuclear localized RNAs, many of which are associated with extracellular vesicles. This association was supported by a cross-species comparison of the mouse and human homologs enriched in the sperm extra-nuclear compartment and exosomes recovered from semen. These re- sults contribute to the growing evidence for the presence of exosomes on the surfaces of the male gamete (18–24). Additional classes of RNAs also appeared to be associated with the outer sperm membranes including nuclear-encoded mitochondrial transcripts and some nuclear RNAs. Their preferential peripheral localization suggests that chromatin remodeling during spermiogenesis is not limited to nucleoproteins as described in the following. MATERIALS AND METHODS Sonicated and Cs/Tx sperm head preparation and RNA extraction Sperm fractions were prepared and RNAs extracted essentially as described (Figure 1;25,26,27). In brief, mature spermatozoa from transgenic line HP3.1 (28) were isolated from cauda epididymides and vas deferens harvested from individual four month old transgenic mice on ice into 50 mM Tris–HCl, pH 7.4, buffer. The cells were washed twice following iltration through an 80 micron mesh, resuspended in 0.5 ml 50 mM Tris–HCl, pH 7.4, buffer and subjected to sonication with a TekMar TM-50 sonic disruptor (TekMar, Cincinnati, OH, USA) at 70% maximum output for 2 min on ice to separate heads from tails and to lyse potential cellular contaminants. The sonicated sperm suspension was washed twice and 1 – 5 × 107 sperm per sample were diluted to a volume of 7.5 ml in 1 M sucrose buffered with 50 mM Tris–HCl, pH 7.4, containing 5 mM MgCl2 . A triplestep gradient was prepared by overlaying diluted samples onto cushions consisting of 2 M sucrose buffered with 50 Nucleic Acids Research, 2015, Vol. 43, No. 14 6849 mM Tris–HCl, pH 7.4, containing 5 mM MgCl2 and 0.45 g/ml CsCl buffered with 25 mM Tris–HCl, pH 7.4, buffer containing 0.5% Triton X-100. Sperm heads were recovered by ultracentrifugation at 75 600 × rcf for 45 min at 4◦ C and subsequently washed twice with 25 mM Tris–HCl, pH 7.4, buffer containing 0.5% Triton X-100. Following resuspension in 0.5 ml RLT, a guanidine-thiocyanate lysis buffer (Qiagen) supplemented with 1.5% ␤-mercaptoethanol (Amresco), 0.2 mm stainless steel beads were added to the samples and the Cs/TX sperm heads mechanically lysed with a Disruptor Genie (Scientiic Industries, Inc., Bohemia, NY, USA). After the addition of an equal volume of Qiazol (Qiagen) nucleic acids were recovered using the RNeasy system (Qiagen). Following sonication and washing sonicated sperm RNAs were equivalently extracted. Total RNAs were DNased (Turbo DNase, Ambion) and subsequently subjected to RT-PCR with intron-spanning primers to Prm2 (Supplementary Dataset 1; 28). Isolation of RNA from human sperm and seminal vesicles Normozoospermic semen (WHO 2010) was collected from healthy donors with proven fertility by masturbation after 2–5 days of sexual abstinence. Following analysis excess semen was stored at −80◦ C. Samples were thawed at room temperature for 20 min and exosomes were isolated from the seminal luid by differential centrifugation at 4◦ C as follows: 3,000 rcf for 10 min to pellet sperm fraction which was stored at −80◦ C, followed by 12,000 rcf for 45 min to pellet cellular debris and larger vesicles. The supernatant was centrifuged at 110,000 rcf (Beckman Coulter Optimax MLA-13 rotor) for 70 min. The resulting exosomal pellet was washed with 1 ml Phosphate buffered saline (Dulbecco’s Phopshate Buffered saline), pelleted again at 110 000 rcf for 70 min and stored at −80◦ C until RNA extraction. The sperm pellet was thawed and processed through a 50% PureSperm (Nidacon) cushion prior to RNA extraction. Exosomal and sperm RNA-seq libraries were prepared as outlined below. Transmission electron microscopy Aliquots of SS and Cs/TX murine sperm were subjected to transmission electron microscopy at the Microscopy Imaging Laboratory (University of Toronto, ON, USA) using standard protocols. Briely, cells were pelleted by centrifugation, ixed in a 4% paraformaldehyde with 1% glutaraldehyde buffered with 0.1 M sodium phosphate monobasic buffer (NaH2 PO4 ), pH 7.2, at 4◦ C and subsequently postixed with 1% osmium tetroxide buffered with 0.1 M sodium phosphate dibasic buffer (Na2 HPO4 ), pH 7.2, for 1 h. Dehydration of the cells was achieved with a series of increasing concentrations of ethanol to allow for iniltration of the Epon embedding resin. The resin was sectioned on an ultramicrotome (Leica) and transferred to a copper transmission electron microscopy (TEM) grid. Reynolds lead citrate and 5% uranyl acetate was used for contrast staining and micrographs were taken on a Hitachi H 7000 at 75 Kv (25uA beam current) using an AMT XR-60 digital camera. Sequencing library preparation and analysis Stock synthetic spike-in External RNA Controls Consortium ERCC (ERCC) RNAs (Invitrogen) were diluted 1:10 000 and pooled with 5 ng of total sperm RNAs prior to reverse transcription and ampliication using the Seqplex system (Sigma; 29). Pre-ampliied cDNA libraries were subjected to sequencing library construction (DNA UltraLow, NEB) followed by 50 cycles of paired-end sequencing on the Illumina Hi-Seq 2500 platform. Sequencing reads were aligned simultaneously to the mouse genome assembly Mm10 in addition to the ERCC FASTA sequences with Tophat2 (version 2.0.12; 30) using the following parameters: tophat2 -r 30 –mate-std-dev50 –no-coverage-search -G genes.gtf. Novel transcript structures were assembled using Cuflinks (version 2.1.1; 31) after removal of PCRduplicates with Picard tools (http://picard.sourceforge.net). Assembled transcripts exceeding twice the average fragment length (135 bp) were combined with UCSC gene annotations (Mm10). Preliminary analysis of aligned sequencing reads was carried out with the Samtools (version 0.1.19; 32) and Bedtools (version v2.19.1–2; 33) suites. Ranked predicted fold change and RPKM values for enrichment analyzes were calculated using uniquely aligned reads and the GFold package (34). A lower limit of detection was determined by analysis of ERCC coverage. Log normalized ERCC RPKM values were well correlated across samples (r2 > 0.90; Supplementary Figure S1) demonstrating that inter-sample comparisons of RNAs exceeding 116 RPKM were reliable. To derive enrichment predictions the Cs/TX samples were independently compared to the SS sample and the intersection of transcripts with an absolute GFold value greater than zero and a RPKM value exceeding the ERCC cutoff were considered for RT-PCR validation. Differential enrichment analysis of the human ejaculate exosomes and human sperm RNA-seq samples as well as mouse embryo and oocyte analyses were performed with both with the HT-Seq and DEseq2 software packages (35–37) as well as GFold (38). Plots and browser graphics were generated using the ggplot2 R package and the UCSC genome browser, respectively. Ontological analyzes were performed using GOrilla and associations presented as False Discovery Rate corrected q-values (39). A hypergeometric probability function in R was used to assess the likelihood of overlap observed between sperm datasets and nuclear encoded mitochondrial transcripts taken from the Mitocarta database (40) and human ejaculate RNA-seq datasets. Genomic coordinates of repeat elements were retrieved from the Repeat Master track of the UCSC genome browser and analyzed with Bedtools. To determine the relative coverage of repeat families unmapped and multiply-mapped reads were aligned to the rodent RepBase v19.0 repeat sequences (41). Following alignment the number of uniquely aligned sequencing reads corresponding to each repeat family was summed and those values combined with those obtained from the initial unique alignments from the same sample. Estimates of subfamily abundance of sperm LINEs were determined by equivalently subsampling and aligning FASTQ iles to canonical sequences for each of the actively 6850 Nucleic Acids Research, 2015, Vol. 43, No. 14 transcribed mouse LINE elements retrieved from RepBase (42). Only uniquely aligned sequencing reads were considered. RT-PCR validation Samples (n = 6) used for RT-PCR validation of the RNAseq results were isolated as above with minor changes. Sonicated and washed sperm from a single mouse were counted on a hemocytometer and split into two equal aliquots. Subsequently sperm were either subjected to ultracentrifugation through a triple-step gradient or left on ice prior to nucleic acid extraction. SS and Cs/TX pellets were divided into two aliquots and used for separate RNA and DNA extractions. Prior to RNA extraction sperm samples were lyzed with RLT buffer supplemented with 1.5% ␤-mercaptoethanol and Qiazol as above and following disruption with 0.2 mm stainless steel beads stored at −80◦ C. Sperm pellets were digested with proteinaseK in the presence of ␤-mercaptoethanol and DNAs recovered by phenol–chloroform extraction with phase-lock tubes (5Prime) followed by ethanol precipitation with linear acrylamide. DNAs were quantiied with Picogreen (Invitrogen) and used to equilibrate the stored RNA extractions with respect to sperm concentration per volume of lysate. For each individual animal equal volumes of the SS and Cs/TX lysate were passed through the RNeasy column such that RNA was recovered from equivalent number of match sonicated and demembranated sperm heads. Subsequently, eluted RNAs were DNased as above prior to quantiication with Ribogreen (Invitrogen). First-strand cDNA was synthesized using equal volumes of total RNA with the addition of 1 ␮l of diluted ERCC, (1:1000), 40 units of RNase Block (Agilent), 100 ng of random primers (Invitrogen) and the Superscript III kit (Invitrogen). Diluted SS and Cs/TX cDNA products were analyzed in parallel triplicate reactions by real-time PCR using Hot-Star Taq Polymerase system (Qiagen) and 0.5 mM of each forward and reverse oligonucleotide primer (Supplemntary Datatset 1) for 50 cycles. Delta-Ct values are calculated as follows: Ct = Ct Cs/TX − Ct SS . RESULTS AND DISCUSSION Isolation of compartmentalized RNAs The highly condensed mature spermatozoon possesses distinct structural features and organelles that may harbor sitespeciic populations of RNAs. To elucidate whether patterns of RNA retention vary between the peripheral structures contained in the extra-nuclear compartment of the gamete and those found within the intra-nuclear compartment, sperm samples were briely sonicated and aliquots subjected to gradient fractionation in the presence of detergent. Electron microscopy conirmed the separation of sperm heads and tails following sonication as well as the absence of contaminating cell types (Figure 2). Despite the absence of an intact acrosome the sonicated sperm (SS) heads retained the inner acrosomal membrane (IAM) as well as portions of the outer acrosomal membrane (OAM; 43). Ultracentrifugation of the SS heads through a sucrose cesium chloride step gradient and detergent demembranation produced sperm heads (Cs/TX) free of tail and mitochondrial sheath remnants. In addition, the Cs/TX heads lacked all peripheral membranes while retaining the perinuclear theca. This network of cytosolic and nuclear proteins attaches the inner acrosomal membrane to the nuclear envelope and is resistant to extraction with both ionic and non-ionic detergents such as Triton X-100 used in this study (43,44). Isolation of RNAs from SS and Cs/TX heads demonstrated that the long RNA fraction (>100 nt) in sperm is largely peripherally localized (Figure 3). Considering the RNA recovered from Cs/TX sperm head fraction, 61.45% of the 87 femtograms of RNA retained in each spermatozoon must be found within the membranes removed during isolation of the nucleus and perinuclear theca (Figures 1 and 2). This supports prior work demonstrating that treatment with detergents alters the transcript proile of the sperm cell (16). Resolution of whether the RNAs recovered from the Cs/TX heads reside within the nucleus and/or the perinuclear theca remains a technical challenge. To better understand patterns of transcript localization throughout the male gamete RNA from the SS and Cs/TX sperm heads was subjected to RNA-seq analysis. Library characteristics Mitochondrial encoded transcripts. The eficacy of the sample fractionation established by microscopy was conirmed by the reduced sequencing coverage of the mitochondrial genome in the Cs/TX sperm heads (Figure 4A). In contrast, the SS heads exhibited a 10-fold enrichment in coverage of mitochondrial RNAs corresponding to >7% of all uniquely aligning reads. This value is within the range (3.5–36%) observed in other mouse sperm RNA-seq datasets (3,4,45) indicative of signiicant differences in sample and library preparation that has been resolved (16). Ribosomal RNAs. In addition to a greater proportion of mitochondrial transcripts, the SS heads possessed greater amounts of ribosomal RNAs (rRNAs, Figure 4A). Together these transcripts and their associated pseudogenes constituted 28 and 17% of all unique sequencing reads in the SS and Cs/TX libraries, respectively, even though unlike other cell types, sperm rRNAs are fragmented (46,47). It is not obvious why the ribosomal transcripts persist at elevated levels in both sperm fractions. Within the SS samples the elevated levels of rRNA relect the additional contribution of membrane-associated ribosomal transcripts in addition to those retained following isolation of the Cs/TX sperm head structures (Figures 1 and 2). The abundance of the rRNAs in SS relative to demembranated Cs/TX samples may relect entrapment of cytoplasmic constituents during condensation and concurrent membrane restructuring and/or be acquired from or maintained in external structures such as membrane-bound extracellular vesicles. These structures have been observed in vivo in physical association with spermatozoa (18,21,23,24). Unlike cytoplasmic droplets which are osmotically sensitive structures formed by maturing spermatids as they expel their cytoplasm and often lost during routine sperm processing (48), Nucleic Acids Research, 2015, Vol. 43, No. 14 6851 Figure 2. Cs/TX fractionated sperm heads lack peripheral membranes. (A and B) Sonication separates the tail from the sperm head while preserving the outer membrane structures. Fragments of the sperm tail (PP) and mitochondrial sheath (MS) are observed following sonication whereas potential somatic contaminants are not. (C and D) Fractionation through a sucrose–cesium chloride gradient in the presence of Triton X-100 enriches a population of sperm heads free from tail and mitochondrial remnants. The Cs/TX sperm nucleus remains surrounded by the perinuclear theca (PT) as evidenced by the dorsal tip (DT) and ventral spur (VS) structures but lacks the peripheral membranes observed in the sonicated sample. Dashed boxes in (A and C) (Magniication × 12 000; scale bar = 500 nm) are presented at higher magniication in (B and D) (Magniication × 50 000; scale bar = 100 nm). Equatorial segment (ES); inner acrosomal membrane (IAM); mitochondrial sheath (MS); nucleus (N); outer acrosomal membrane (OAM); perinuclear theca (PT) postacrosomal sheath of the PT (PAS-PT), principle piece (PP) of the sperm tail; subacrosomal layer of the PT (SAL-PT). sperm-bound exosomes fuse with the gamete (22). Further support that the elevated levels of ribosomal transcripts in the SS cells relative to that observed in the Cs/TX heads may result from the packaging of these sequences in extracellular vesicles is the recent observation that this class of RNA is the most abundant observed in exosomes retrieved from human semen (20). Greater than 17% of the uniquely aligned reads in the Cs/TX RNA-seq libraries corresponded to the ribosomal transcripts. Considering that rRNAs are not commonly observed in somatic nuclear RNA (17) and in the absence of evidence for their translocation into the nucleus these transcripts are presumed to be intimately associated with but likely outside of the highly condensed nucleus. In this manner these fragmented transcripts might persist within partially assembled ribosomes as has been observed in cytoplasmic droplets (46). The abundance of the rRNAs and their associated proteins would suggest that they are not 6852 Nucleic Acids Research, 2015, Vol. 43, No. 14 Figure 3. The majority of the RNA retained in sperm is localized to the sperm periphery. RNAs were quantiied after recovery from equal numbers of sperm heads from the same animal (n = 9) following sonication and Cs/TX fractionation. The SS cells contained signiicantly more RNA than matched Cs/TX heads (P = 0.007, two-tailed paired t-test). After accounting for the presence of the intra-nuclear compartment within each SS sperm cell ∼53.47 fg of RNA is associated within the peripheral structures of the extra-nuclear compartment. Center lines show the medians (87.03 and 33.56 fg of RNA per sperm head, SS and Cs/TX, respectively); box limits indicate the 25th and 75th percentiles as determined by R software; whiskers extend 1.5 times the interquartile range from the 25th and 75th percentiles, outliers are represented by dots and represent matched SS and Cs/TX samples from the same animal. completely expelled during the inal stages of spermiogenesis. This may lead to the sequestration of the translational machinery within the perinuclear theca as has been observed in proteomic analysis of detergent-treated sperm nuclei (49). As these proteins are not known to contribute to this cytoskeletal nucleus-acrosomal interface they may be passively retained along with bound rRNAs through prior association. Whether this mechanism resolves other RNA associations within the sperm nucleus (15) remains to be established. ERCC control RNAs. To determine a lower limit of detection synthetic ERCC (External RNA Controls Consortium) RNAs were added in equal amounts to all samples prior to cDNA synthesis. Coverage of the control transcripts was highly correlated across all samples (r2 > 0.90; Supplementary Figure S1) while their absolute levels varied between fractionation methods. Of all uniquely aligning sequencing reads approximately, 25% Cs/TX and 16% SS sperm head reads corresponded to the ERCC transcripts in the RNA-seq libraries, respectively. The increased representation of these synthetic sequences in the Cs/TX sequencing libraries was likely a consequence of the reduced availability of biological and or accessible RNA templates following the loss of the outer sperm membranes. Supporting this observation the Cs/TX heads exhibited a 10% increase in the percentage of reads aligned to the nuclear genome with a concomitant decrease in coverage of annotated regions relative to the SS heads (Figure 4B). This is reminiscent of the observation that the use of detergents to remove somatic cell contaminants from human sperm negatively impacts transcript levels compared to samples puriied by other means (16). Representation of intronic sequences were equivalently suppressed in all libraries (Figure 4B) relecting the cessa- Figure 4. Sequencing coverage and enrichment in annotated regions vary between the SS and Cs/TX sperm heads. (A) RNAs extracted from SS or Cs/TX samples were subjected to RNA-seq. Following alignment the percentage of uniquely aligned sequencing reads attributed to speciic classes of sequences was determined. The largest contribution of alignments in both the Cs/TX and SS RNA-seq libraries corresponded to the nuclear genome in a treatment dependent manner. Coverage of the rRNAs and the ERCC control RNAs in libraries also varied by preparation. The reduced sequencing coverage of the mitochondrial genome in the Cs/TX sperm heads conirmed the eficacy of the sample fractionation. (B) SS cells exhibited a greater enrichment of uniquely aligned sequencing reads in exonic regions (coding and non-coding) relative to that observed for the Cs/TX samples. Fold enrichment values for uniquely aligned sequencing reads within exons, introns and intragenic regions are presented for SS and Cs/TX RNA-seq libraries. Enrichment was determined by dividing the percentage of genomic bases in an element class by the percentage of unique sequencing reads aligned to that class. tion of nascent RNA production prior to nuclear condensation. Identiication of putative SS and Cs/TX enriched transcripts Mature spermatozoa possess a unique morphology and a comparatively small quantity of RNA. These limitations required that sonicated sperm containing nuclei surrounded by perinuclear theca be compared to the demembranated Cs/TX heads rather than independently sampling compartments as can be accomplished for somatic cells (17). By mass, ∼39% of the RNA within a sonicated sperm cell can be attributed to the structures contained within a Cs/TX head (perinuclear theca and nucleus; Figures 1 and Nucleic Acids Research, 2015, Vol. 43, No. 14 6853 2). Therefore transcripts enriched within the more external regions of the spermatozoon will be diluted by RNAs present within the mitochondrial sheath, the perinuclear theca and nucleus, reducing their representation when resolved by RNA-seq. Consequently, the sampling methods employed herein are expected to under- and over-report putative externally (SS) and internally (Cs/TX) localized transcripts, respectively. In addition, the limited data suggested an alternative to standard differential expression analysis pipelines was necessary to identify patterns of RNA localization within the mature spermatozoon. Accordingly, GFold was used to rank transcripts by the posterior distribution of their normalized expression values and candidate transcripts putatively enriched within either the SS or Cs/TX heads were selected for downstream RT-PCR analysis. The GFOLD analysis identiied a total of 308 transcripts that were preferentially enriched in the Cs/TX RNA-seq libraries. Together this set of RNAs included 184 annotated and 116 novel transcripts as well as all ERCC control transcripts exceeding the minimum threshold of detection (Supplementary Dataset 2). Ontological analysis of the annotated RNAs enriched in these samples identiied terms related to spermatogenesis and cytoskeleton (Supplementary Dataset 3). Within this set of RNAs, the Prm1 transcript, as well as Erich2 and Fam71e2 (formerly 4933404M02Rik and 4930401F20Rik, respectively), were previously identiied within the mouse sperm nucleus by RT-PCR though their preferential localization within the gamete could not be inferred from that study (15). As expected fewer transcripts were predicted to be enriched in the SS RNA-seq libraries relative to the Cs/TX heads (n = 152; Supplementary Dataset 4). This set of putatively enriched sonicated sperm RNAs included many nuclear-encoded mitochondrial protein mRNAs (n = 20; P < 1.9 × 10−20 ; Figure 5B). These RNAs comprised 17% of the transcripts enriched in the SS RNA-seq libraries and only 2% of the Cs/TX enriched RNAs (n = 4; 40). This association was supported by ontological analysis of the predicted SS RNAs which revealed terms related to cytochrome-c oxidase activity and the mitochondrial membrane (Supplementary Dataset 5). Together this likely represents the evolutionarily conserved shuttling of cytosolic ribosomes bound with nuclearencoded mitochondrial mRNAs to the surface of the mitochondria prior to condensation (50–52). These transcripts are expected to be lost along with the mitochondrial sheath and proximal structures during fractionation of the Cs/TX heads. Additional ontological categories were also signiicantly associated with the RNAs enriched in the SS sample. These included terms related to exosomes (q < 1.05 × 10−9 ; Supplementary Dataset 5). The enrichment of exosomalassociated RNAs in the SS RNA-seq libraries supports prior observations that the outer sperm membranes are associated with extracellular vesicles. This set of transcripts includes guanine nucleotide binding protein (G protein), gamma 5 (Gng5; Figure 5A) for which both the RNA and protein products have been observed in exosomes recovered from multiple tissues in human and mouse (53–55). Interestingly, of the 42 SS enriched transcripts associated with the exosome ontological category, only three were present above background in both Cs/TX samples. This set of SS RNAs exhibited a median fold increase of 5.9 (range = 2.6 – 192.9) relative to the Cs/TX samples demonstrating that these transcripts are externally localized relative to the perinuclear theca and nucleus and therefore may be delivered to the gamete by exosomes and incorporated during epididymal maturation or within the vas deferens. Further, >45% of all mouse homologs enriched in the SS RNA-seq library were also differentially expressed in human ejaculate exosomes relative to sperm collected from the same semen samples (P < 6.4 × 10−19 , hypergeometric probability test; Supplementary Figure S2). By comparison only 8% of the RNAs predicted to be enriched in the Cs/TX libraries were signiicantly enriched in the human exosome dataset (P > 0.8). The depletion of these transcripts following exposure to Triton-X100 relects the loss of external sperm membranes that serve as sites of exosome attachment and likely harbor RNAs some of which may possess an origin other than sperm (19). To some extent this resolves the exosomal RNA concentration abundance question (56). To test the proposed link between the detection of exosomal RNAs and the presence of external sperm membranes GFold was used to identify transcripts predicted to be differentially enriched in either the SS sample or in a previously published mouse sperm RNA-seq dataset prepared following detergent incubation (4). Differentially expressed human ejaculate exosome homologs were present in the set of SS enriched RNAs at levels signiicantly exceeding that expected by chance (P < 1.2 × 10−26 , hypergeometric probability test). This relationship was not observed in the detergent treated sperm (P = 0.99). Similarly the human exosome homologs enriched in the SS sample were present at elevated levels (P < 2.2 × 10−16 , Mann–Whitney U-test; Supplementary Figure S3) and exhibited a stronger ontological enrichment in exosome associated terms than their counterparts enriched in the detergent treated sample (q < 3.83 × 10−49 and q < 3.84 × 10−16 , respectively). Analogous to the Cs/TX sperm head fractionation used in the current study, detergent treatment (4) would have been expected to produce a sperm nucleus surrounded by the perinuclear theca at the expense of external membranes. Since these external structures harbor the majority of spermatozoal RNA in addition to serving as the site of exosome attachment their loss would be expected. Several of the SS transcripts included well known somatic nuclear RNAs such as Small Nucleolar RNAs, H/ACA Box (Snora23, Snora52, Snora81), Small Cajal Body-Speciic RNA 13 (Scarna13) and the long non-coding RNA (lncRNA) metastasis associated lung adenocarcinoma transcript 1 (Malat1; Figure 5C). The persistence of an elevated level of Malat1 in all sperm samples and its extra-nuclear enrichment can be ascribed to the presence of a triple helix structure at its 3′ end which stabilizes the transcript (57). Association of this nuclear RNA with more external regions of the spermatozoon suggests Malat1, Snora RNAs, Scarna13 and potentially other ncRNAs are expelled along with the nucleoplasm during condensation. Bereft of a nucleous or Cajal bodies these transcripts appear as if they may not be essential to the mature male gamete or the next generation (15). However, the retention of a minor pool of nuclear-associated Malat1 tran- 6854 Nucleic Acids Research, 2015, Vol. 43, No. 14 Figure 5. RNAs representative of speciic cellular functions and organelles are enriched within the peripheral sperm membranes. Coverage of uniquely aligned sequencing reads are presented for three transcripts on the UCSC genome browser. Reads per million uniquely aligned reads (RPM) are presented on the Y-axis for each sample with the maximum value corresponding to that observed in the sonicated sample (SS) for each RNA panel. The direction of transcription is depicted by arrows. (A) Cox6x is nuclear-encoded mitochondrial transcript. (B) The Gng5 RNA and corresponding membrane-associated G protein product are enriched in exosomes from several tissues and species. (C) Malat1 is a conserved long non-coding RNA that is enriched at active loci and within paraspeckles where it recruits splicing factors. scripts (Figure 5C) may result from the direct interaction of these lncRNAs with regions of the sperm genome which remain in a ‘poised’ chromatin coniguration (58–61) and clariication is being actively pursued. A series of oligonucleotide primers were designed to validate patterns of preferential transcript compartmentalization in sperm by RT-PCR. RNAs from individual mice (n = 6) were extracted from equal numbers of SS or Cs/TX sperm heads and reverse transcribed with random primers. The addition of ERCC RNAs to the irst-strand synthesis reaction served as a loading and synthesis control (Figure 6A). Although the fold-changes estimated from the RNAseq datasets and delta-Ct values were well correlated (r2 = 0.86; Figure 6B) all transcripts queried were found to be enriched in the sonicated sperm samples suggesting that globally, RNAs are peripherally localized in the mature male gamete (Figure 2) consistent with mass distribution. This included the irst and third most enriched protein-coding transcripts in the Cs/TX RNA-seq libraries, Ankyrin repeat and MYND domain containing 1 (Ankmy1) and ecotropic viral integration site 5-like (Evi5l; Supplementary Dataset 2). However, the RNAs predicted by GFold to be enriched in the Cs/TX heads exhibited signiicantly reduced deltaCt values compared to those observed for the set of transcripts predicted to be enriched in the sonicated sperm (P < 0.004, Mann–Whitney U-test; Figure 6C). These results suggest that while the majority of RNA within the mature spermatozoon can be localized to the periphery of the cell their relative proportions vary between transcripts as some are expected to be differentially retained within the intranuclear compartment. On one hand, the relative depletion of Malat1 and Snora81 RNAs (Table 1) from the Cs/TX samples predicts that in the male gamete these transcripts are unlikely to contribute to nuclear organization in a manner reminiscent of that observed in somatic cells. On the other hand, some Malat1 remains within the Cs/TX heads and may be suficient to contribute to the packaging of the limited number of histone-associated promoters present in Nucleic Acids Research, 2015, Vol. 43, No. 14 6855 Figure 6. RT-PCR validation of peripheral sperm RNA enrichment. (A) To validate patterns of transcript enrichment predicted by RNA-seq RNAs from individual mice (n = 6) were extracted from equal numbers of sonicated or Cs/TX sperm heads and subjected to RT-PCR analysis. The transcript encoding lysosome-associated membrane glycoprotein 2 (Lamp2) is enriched in sonicated sperm indicating the preferential localization of this RNA within the outer sperm membranes. ERCC spike-in RNAs served as a loading control an exhibited no preferential enrichment. (B) Estimated RNA-seq fold-changes and delta-Ct values were well correlated (r2 = 0.86). Transcripts exceeding the RPKM threshold (>116 RPKM; Supplementary Figure S1) and predicted by RNA-seq to be enriched in the extra- or intra-nuclear fractions are presented as solid circles or triangles, respectively. Lamp2 and NADH dehydrogenase [ubiquinone] 1 subunit C1 (Ndufc1) partitioned to the extra-nuclear fraction but did not exceed the threshold (∼85 RPKM in SS) and are presented as open circles. (C) Transcripts predicted by RNA-seq to be enriched in the SS samples (n = 6) exhibited signiicantly greater delta-Ct values (P = 0.004, Mann–Whitney U-test) than transcripts predicted to be enriched in the Cs/TX heads (n = 5). Center lines show the median delta-Ct values of the SS- and Cs/TX-predictions from RNA-seq (9.23 and 3.72, respectively); box limits indicate the 25th and 75th percentiles as determined by R software; whiskers extend 1.5 times the interquartile range from the 25th and 75th percentiles, outliers are represented by dots. sperm chromatin (60,61). Alternatively, other RNAs such as the repetitive RNAs may participate in packaging the paternal genome (Supplementary Tables SI, SII). Repetitive RNAs in mouse sperm Retained repetitive RNAs are of particular interest as they may contribute to nuclear structure as has been observed in somatic cells or may participate in genome confrontation and consolidation observed in plants and animals whereby parental genome compatibility is ensured (5,62). To better understand the role and abundance of these RNAs, the coverage of all individual repetitive elements was determined initially using uniquely aligned sequencing reads (Supplementary Tables SI, SII). This was complimented by realigning multiply mapped and unmapped sequencing reads to the RepBase repeat annotations. These alignments were combined with the uniquely aligned reads to determine the relative abundance of each repeat family within the sperm RNAs (Supplementary Dataset 6). The Cs/TX libraries were enriched in repeat sequences such as satellite, simple and low complexity repeats (Supplementary Table SI). The presence of satellite RNAs may relect opportunistic transcription as heterochromatic structures relax prior to protamine deposition. A series of com- Table 1. Sperm RNAs are enriched in the extra-nuclear compartment 6856 Nucleic Acids Research, 2015, Vol. 43, No. 14 Figure 7. Detection of sperm dervied RNAs in fertilized oocytes prior to zygotic genome activation. Cabyr is observed in sperm (blue; Supplementary Figure S4) absent from oocytes and present in pronuclei embryos (black and red respectively; 37) following natural mating. Coverage for each sample is presented as reads per million uniquely aligned reads (RPM) on the Y-axis. The direction of transcription is depicted under the gene model. plimentary sequences were consistently enriched in the Cs/TX sperm head RNA-seq libraries relative to sonicated sperm (Supplementary Dataset 6). The representation of these sequences within the Cs/TX samples was well above their genomic background resolving as a pattern of poly-purine or -pyrimidine RNA enrichment in these samples (Supplementary Table SII). Interestingly, GAA repeatcontaining RNAs (GRC-RNAs) have been observed previously in the several somatic cell types wherein they contributed to chromatin structure by directly binding nuclear matrix proteins (63). These chromatin-associated RNAs form discrete DNase insensitive foci that are lost following RNase treatment. Conirmation that the nuclear-retained simple repeat transcripts identiied in this study are polypurine in nature would suggest that they may correspond to the population of nuclear matrix associated RNAs previously observed in sperm (11). Though assigning a similar nuclear function and even localization to the simple repeat sperm RNAs is an intriguing possibility, their lack of sequence complexity precludes RT-PCR validation and precise mapping. Reminiscent of the results obtained for annotated and novel RNAs long interspersed nuclear elements (LINEs) exhibited greater RNA-seq coverage in the Cs/TX samples relative to sonicated sperm yet RT-PCR suggested that they were approximately eighty times more enriched in the SS samples (data not shown). Alignment to the LINE-1 (L1) repeat class on average contributed to more than 5% of all unique reads in the Cs/TX samples. Realignment to all canonical repeat sequences showed that ∼25% of all uniquely aligned reads corresponded to the L1 family in the Cs/TX samples (Supplementary Table SI). This was nearly twice the amount observed for the next most abundant family. To determine which of the L1 repeats was most actively transcribed during the inal stages of spermatogenesis the RNA-seq libraries were realigned to canonical sequences for each active subfamily (42). In all samples the most abundant L1 transcripts corresponded to subfamily III of the L1MdTf lineage. The potential paternal contribution of L1 RNAs to the zygote, thought to perhaps spur autoregulated embryonic transcription of such elements, has been discussed (5,64). LINE transcripts have also recently been shown to directly contribute to somatic chromatin structure in an RNase sensitive manner (65). However, this as- sociation is not observed in the condensed somatic heterochromatin. Perhaps during the later stages of spermatogenesis the LINE transcripts are depleted either passively or actively from the sperm nucleus contributing to sperm chromatin condensation as is observed following RNase treatment in interphase cells (65,66). In the present study this would resolve as and be consistent with their observed extra-nuclear enrichment. Identiication of sperm-borne RNAs post-fertilization The delivery of RNAs by human sperm to an oocyte has been shown (1,67). To ascertain whether mouse spermatozoa may potentially fulill a similar function murine oocyte and pronuclei embryos sampled prior to zygotic genome activation were subjected to differential expression analysis (37). A total of 88 differentially enriched embryonic transcripts were identiied which exhibited no coverage in the oocyte but were detected in all three sperm RNAseq datasets in the current study. A previous mouse sperm RNA-seq study proposed that the Wnt4 and Foxg1 transcripts may be delivered by sperm to the oocyte (68). These RNAs were not detected in the embryonic datasets nor were they consistently present in sperm RNA-seq studies from others (Supplementary Figure S4B). Though the majority of the differentially enriched mouse embryo RNAs identiied above were present at low levels in sperm (median RPKM ∼1.4) the transcript encoding the Calcium-binding tyrosine phosphorylation-regulated protein (Cabyr; Figure 7 and Supplementary Figure S4A) was consistently detected at elevated levels in both SS and Cs/TX samples and was observed in all available mouse sperm RNA-seq datasets (3,4,45). Interestingly, this transcript was enriched in the Cs/Tx heads by RNA-seq (Supplementary Dataset 2) suggesting a reduced peripheral enrichment similar to that observed for the set of predicted Cs/TX RNAs evaluated by RT-PCR (Figure 6 and Table ). These transcripts likely persist within either the perinuclear theca or the nucleus and would therefore be expected to reach the oocyte cytoplasm following fertilization (69). It is not clear whether paternal Cabyr RNA would be of functional importance considering the established role of its protein product in modulating intracellular calcium levels during capacitation (70), although one could speculate a signaling function as part of Nucleic Acids Research, 2015, Vol. 43, No. 14 6857 the ion lux upon fertilization. Irrespective, in human males, it is considered of diagnostic value for idiopathic male factor infertility (71). Nevertheless, detection of a testis-speciic RNAs encoding a protein restricted to the post-meiotic male germline in naturally fertilized embryos prior to zygotic genome activation supports the view that sperm RNA persists throughout the normal lifecycle of the paternal gamete and are delivered to the oocyte. The distribution of RNAs within the spermatozoon The mature spermatozoon is host to a cadre of RNAs that are not evenly distributed throughout the limited volume of the sperm head. By mass ∼66% of the RNAs greater than ∼200 nt in length were enriched within the limited volume of the extra-nuclear compartment (plasma membrane, the acrosome and associated membranes as well as the sperm tail and mitochondrial sheath) of the sperm cell. Approximately one-third of the RNAs observed in sperm are expected to be within the nucleus/perinuclear theca. As determined by RT-PCR the strength of their preferential compartmentalization varied in accord with the RNA-seq results. Together these results provide a list of candidate extra- and intra-nuclear associated transcripts which serve as the foundation for future studies delineating their function within the mature sperm. Intriguingly, the set of RNAs exhibiting the strongest external localization included many transcripts linked to exosomes and transcripts commonly localized to speciic organelles in somatic cells including the nucleus and mitochondria. Whether these transcripts are passively retained within the external sperm membranes following expulsion of the cytoplasm or are packaged within extracellular vesicles to serve a later function will be resolved when the contents of sperm-bound exosomes are fully described. The peripheral enrichment of non-coding RNAs such as Malat1 and the LINE transcripts which are known to modulate somatic cell chromatin structure suggests that nuclear remodeling in the sperm is not limited to nucleoprotein exchange. The fraction of these RNAs that remain associated with the intra-nuclear compartment may be restricted to the few poised histone-bound accessible regions that persist following protamination. Representative of the population of transcripts is testis-speciic RNA Cabyr. This RNA is likely delivered by sperm to the embryo and is a diagnostic element indicative of idiopathic male infertility (71). ACCESSION NUMBER Raw sequencing reads and processed data are deposited in the Gene Expression Omnibus repository (GEO Accession Number: GSE62874). SUPPLEMENTARY DATA Supplementary Data are available at NAR Online. FUNDING Charlotte B. Failing Professorship (to S.A.K.; in part). Funding for open access charge: Charlotte B. Failing Professorship (to S.A.K.; in part). Conlict of interest statement. None declared. REFERENCES 1. Ostermeier,G.C., Miller,D., Huntriss,J.D., Diamond,M.P. and Krawetz,S.A. (2004) Reproductive biology: delivering spermatozoan RNA to the oocyte. Nature, 429, 154. 2. Sendler,E., Johnson,G.D., Mao,S., Goodrich,R.J., Diamond,M.P., Hauser,R. and Krawetz,S.A. (2013) Stability, delivery and functions of human sperm RNAs at fertilization. Nucleic Acids Res., 41, 4104–4117. 3. Soumillon,M., Necsulea,A., Weier,M., Brawand,D., Zhang,X., Gu,H., Barthes,P., Kokkinaki,M., Nef,S., Gnirke,A. et al. (2013) Cellular source and mechanisms of high transcriptome complexity in the mammalian testis. Cell Rep., 3, 2179–2190. 4. Hammoud,S.S., Low,D.H., Yi,C., Carrell,D.T., Guccione,E. and Cairns,B.R. (2014) Chromatin and transcription transitions of Mammalian adult germline stem cells and spermatogenesis. Cell Stem Cell, 15, 239–253. 5. Jodar,M., Selvaraju,S., Sendler,E., Diamond,M.P., Krawetz,S.A. and Reproductive Medicine,N. (2013) The presence, role and clinical use of spermatozoal RNAs. Hum. Reprod. Update, 19, 604–624. 6. Liu,W.M., Pang,R.T., Chiu,P.C., Wong,B.P., Lao,K., Lee,K.F. and Yeung,W.S. (2012) Sperm-borne microRNA-34c is required for the irst cleavage division in mouse. Proc. Natl. Acad. Sci. U.S.A., 109, 490–494. 7. Gapp,K., Jawaid,A., Sarkies,P., Bohacek,J., Pelczar,P., Prados,J., Farinelli,L., Miska,E. and Mansuy,I.M. (2014) Implication of sperm RNAs in transgenerational inheritance of the effects of early trauma in mice. Nat. Neurosci., 17, 667–669. 8. Kleene,K.C. (1989) Poly(A) shortening accompanies the activation of translation of ive mRNAs during spermiogenesis in the mouse. Development, 106, 367–373. 9. Kwon,Y.K. and Hecht,N.B. (1993) Binding of a phosphoprotein to the 3′ untranslated region of the mouse protamine 2 mRNA temporally represses its translation. Mol. Cell. Biol., 13, 6547–6557. 10. Miller,D., Ostermeier,G.C. and Krawetz,S.A. (2005) The controversy, potential and roles of spermatozoal RNA. Trends Mol. Med., 11, 156–163. 11. Lalancette,C., Miller,D., Li,Y. and Krawetz,S.A. (2008) Paternal contributions: new functional insights for spermatozoal RNA. J. Cell. Biochem., 104, 1570–1579. 12. Sarrate,Z. and Anton,E. (2009) Fluorescence in situ hybridization (FISH) protocol in human sperm. J. Vis. Exp., 31, 1–2. 13. Modi,D., Shah,C., Sachdeva,G., Gadkar,S., Bhartiya,D. and Puri,C. (2005) Ontogeny and cellular localization of SRY transcripts in the human testes and its detection in spermatozoa. Reproduction, 130, 603–613. 14. Johnson,G.D., Lalancette,C., Linnemann,A.K., Leduc,F., Boissonneault,G. and Krawetz,S.A. (2011) The sperm nucleus: chromatin, RNA, and the nuclear matrix. Reproduction, 141, 21–36. 15. Yan,W., Morozumi,K., Zhang,J., Ro,S., Park,C. and Yanagimachi,R. (2008) Birth of mice after intracytoplasmic injection of single puriied sperm nuclei and detection of messenger RNAs and MicroRNAs in the sperm nuclei. Biol. Reprod., 78, 896–902. 16. Mao,S., Goodrich,R.J., Hauser,R., Schrader,S.M., Chen,Z. and Krawetz,S.A. (2013) Evaluation of the effectiveness of semen storage and sperm puriication methods for spermatozoa transcript proiling. Syst. Biol. Reprod. Med., 59, 287–295. 17. Zaghlool,A., Ameur,A., Nyberg,L., Halvardson,J., Grabherr,M., Cavelier,L. and Feuk,L. (2013) Eficient cellular fractionation improves RNA sequencing analysis of mature and nascent transcripts from human tissues. BMC Biotechnol., 13, 99, 1–10. 18. Sullivan,R. and Saez,F. (2013) Epididymosomes, prostasomes, and liposomes: their roles in mammalian male reproductive physiology. Reproduction, 146, R21–R35. 19. Cossetti,C., Lugini,L., Astrologo,L., Saggio,I., Fais,S. and Spadafora,C. (2014) Soma-to-germline transmission of RNA in mice xenografted with human tumour cells: possible transport by exosomes. PLoS One, 9, e101629. 20. Vojtech,L., Woo,S., Hughes,S., Levy,C., Ballweber,L., Sauteraud,R.P., Strobl,J., Westerberg,K., Gottardo,R., Tewari,M. et al. (2014) Exosomes in human semen carry a distinctive repertoire 6858 Nucleic Acids Research, 2015, Vol. 43, No. 14 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 40. 41. 42. of small non-coding RNAs with potential regulatory functions. Nucleic Acids Res., 42, 7290–7304. Ronquist,G., Nilsson,B.O. and Hjerten,S. (1990) Interaction between prostasomes and spermatozoa from human semen. Arch. Androl., 24, 147–157. Arienti,G., Carlini,E. and Palmerini,C.A. (1997) Fusion of human sperm to prostasomes at acidic pH. J. Membr. Biol., 155, 89–94. Frenette,G. and Sullivan,R. (2001) Prostasome-like particles are involved in the transfer of P25b from the bovine epididymal luid to the sperm surface. Mol. Reprod. Dev., 59, 115–121. Rejraji,H., Vernet,P. and Drevet,J.R. (2002) GPX5 is present in the mouse caput and cauda epididymidis lumen at three different locations. Mol. Reprod. Dev., 63, 96–103. Ward,W.S. (2013) Isolation of sperm nuclei and nuclear matrices from the mouse, and other rodents. Methods Mol. Biol., 927, 437–444. Goodrich,R., Johnson,G. and Krawetz,S.A. (2007) The preparation of human spermatozoal RNA for clinical analysis. Arch. Androl., 53, 161–167. Goodrich,R.J., Anton,E. and Krawetz,S.A. (2013) Isolating mRNA and small noncoding RNAs from human sperm. Methods Mol. Biol., 927, 385–396. Johnson,G.D., Platts,A.E., Lalancette,C., Goodrich,R., Heng,H.H. and Krawetz,S.A. (2011) Interrogating the transgenic genome: development of an interspecies tiling array. Syst. Biol. Reprod. Med., 57, 54–62. Mao,S., Sendler,E., Goodrich,R.J., Hauser,R. and Krawetz,S.A. (2014) A comparison of sperm RNA-seq methods. Syst. Biol. Reprod. Med., 60, 308–315. Kim,D., Pertea,G., Trapnell,C., Pimentel,H., Kelley,R. and Salzberg,S.L. (2013) TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol., 14, R36, 1–13. Roberts,A., Pimentel,H., Trapnell,C. and Pachter,L. (2011) Identiication of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics, 27, 2325–2329. Li,H., Handsaker,B., Wysoker,A., Fennell,T., Ruan,J., Homer,N., Marth,G., Abecasis,G., Durbin,R. and Genome Project Data Processing,S. (2009) The sequence alignment/map format and SAMtools. Bioinformatics, 25, 2078–2079. Quinlan,A.R. and Hall,I.M. (2010) BEDTools: a lexible suite of utilities for comparing genomic features. Bioinformatics, 26, 841–842. Feng,J., Meyer,C.A., Wang,Q., Liu,J.S., Shirley Liu,X. and Zhang,Y. (2012) GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data. Bioinformatics, 28, 2782–2788. Anders,S., McCarthy,D.J., Chen,Y., Okoniewski,M., Smyth,G.K., Huber,W. and Robinson,M.D. (2013) Count-based differential expression analysis of RNA sequencing data using R and Bioconductor. Nat. Potoc., 8, 1765–1786. Anders,S., Pyl,P.T. and Huber,W. (2014) HTSeq–– a Python framework to work with high-throughput sequencing data. Bioinformatics, 2, 1–4. Xue,Z., Huang,K., Cai,C., Cai,L., Jiang,C.Y., Feng,Y., Liu,Z., Zeng,Q., Cheng,L., Sun,Y.E. et al. (2013) Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing. Nature, 500, 593–597. Love,M.I., Huber,W. and Anders,S. (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol.,15, 550, 1–21. Eden,E., Navon,R., Steinfeld,I., Lipson,D. and Yakhini,Z. (2009) GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics, 10, 48, 1–7. Pagliarini,D.J., Calvo,S.E., Chang,B., Sheth,S.A., Vafai,S.B., Ong,S.E., Walford,G.A., Sugiana,C., Boneh,A., Chen,W.K. et al. (2008) A mitochondrial protein compendium elucidates complex I disease biology. Cell, 134, 112–123. Jurka,J., Kapitonov,V.V., Pavlicek,A., Klonowski,P., Kohany,O. and Walichiewicz,J. (2005) Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res., 110, 462–467. Sookdeo,A., Hepp,C.M., McClure,M.A. and Boissinot,S. (2013) Revisiting the evolution of mouse LINE-1 in the genomic era. Mob. DNA, 4, 3, 1–15. 43. Ferrer,M., Xu,W. and Oko,R. (2012) The composition, protein genesis and signiicance of the inner acrosomal membrane of eutherian sperm. Cell Tissue Res., 349, 733–748. 44. Korley,R., Pouresmaeili,F. and Oko,R. (1997) Analysis of the protein composition of the mouse sperm perinuclear theca and characterization of its major protein constituent. Biol. Reprod., 57, 1426–1432. 45. Kobayashi,H., Sakurai,T., Imai,M., Takahashi,N., Fukuda,A., Yayoi,O., Sato,S., Nakabayashi,K., Hata,K., Sotomaru,Y. et al. (2012) Contribution of intragenic DNA methylation in mouse gametic DNA methylomes to establish oocyte-speciic heritable marks. PLoS Genet., 8, e1002440. 46. Cappallo-Obermann,H., Schulze,W., Jastrow,H., Baukloh,V. and Spiess,A.N. (2011) Highly puriied spermatozoal RNA obtained by a novel method indicates an unusual 28S/18S rRNA ratio and suggests impaired ribosome assembly. Mol. Hum. Reprod., 17, 669–678. 47. Johnson,G.D., Sendler,E., Lalancette,C., Hauser,R., Diamond,M.P. and Krawetz,S.A. (2011) Cleavage of rRNA ensures translational cessation in sperm at fertilization. Mol. Hum. Reprod., 17, 721–726. 48. Cooper,T.G., Yeung,C.H., Fetic,S., Sobhani,A. and Nieschlag,E. (2004) Cytoplasmic droplets are normal structures of human sperm but are not well preserved by routine procedures for assessing sperm morphology. Hum. Reprod., 19, 2283–2288. 49. de Mateo,S., Castillo,J., Estanyol,J.M., Ballesca,J.L. and Oliva,R. (2011) Proteomic characterization of the human sperm nucleus. Proteomics, 11, 2714–2726. 50. Kellems,R.E., Allison,V.F. and Butow,R.A. (1974) Cytoplasmic type 80 S ribosomes associated with yeast mitochondria. II. Evidence for the association of cytoplasmic ribosomes with the outer mitochondrial membrane in situ. J. Biol. Chem., 249, 3297–3303. 51. Sylvestre,J., Margeot,A., Jacq,C., Dujardin,G. and Corral-Debrinski,M. (2003) The role of the 3′ untranslated region in mRNA sorting to the vicinity of mitochondria is conserved from yeast to human cells. Mol. Biol. Cell, 14, 3848–3856. 52. Weis,B.L., Schleiff,E. and Zerges,W. (2013) Protein targeting to subcellular organelles via MRNA localization. Biochim. Biophys. Acta, 1833, 260–273. 53. Buschow,S.I., van Balkom,B.W., Aalberts,M., Heck,A.J., Wauben,M. and Stoorvogel,W. (2010) MHC class II-associated proteins in B-cell exosomes and potential functional implications for exosome biogenesis. Immunol. Cell Biol., 88, 851–856. 54. Gonzales,P.A., Pisitkun,T., Hoffert,J.D., Tchapyjnikov,D., Star,R.A., Kleta,R., Wang,N.S. and Knepper,M.A. (2009) Large-scale proteomics and phosphoproteomics of urinary exosomes. J. Am. Soc. Nephrol., 20, 363–379. 55. Valadi,H., Ekstrom,K., Bossios,A., Sjostrand,M., Lee,J.J. and Lotvall,J.O. (2007) Exosome-mediated transfer of mRNAs and microRNAs is a novel mechanism of genetic exchange between cells. Nat. Cell Biol., 9, 654–659. 56. Chevillet,J.R., Kang,Q., Ruf,I.K., Briggs,H.A., Vojtech,L.N., Hughes,S.M., Cheng,H.H., Arroyo,J.D., Meredith,E.K., Gallichotte,E.N. et al. (2014) Quantitative and stoichiometric analysis of the microRNA content of exosomes. Proc. Natl. Acad. Sci. U.S.A., 111, 14888–14893. 57. Brown,J.A., Valenstein,M.L., Yario,T.A., Tycowski,K.T. and Steitz,J.A. (2012) Formation of triple-helical structures by the 3′ -end sequences of MALAT1 and MENbeta noncoding RNAs. Proc. Natl. Acad. Sci. U.S.A., 109, 19202–19207. 58. Arpanahi,A., Brinkworth,M., Iles,D., Krawetz,S.A., Paradowska,A., Platts,A.E., Saida,M., Steger,K., Tedder,P. and Miller,D. (2009) Endonuclease-sensitive regions of human spermatozoal chromatin are highly enriched in promoter and CTCF binding sequences. Genome Res., 19, 1338–1349. 59. Hammoud,S.S., Nix,D.A., Zhang,H., Purwar,J., Carrell,D.T. and Cairns,B.R. (2009) Distinctive chromatin in human sperm packages genes for embryo development. Nature, 460, 473–478. 60. West,J.A., Davis,C.P., Sunwoo,H., Simon,M.D., Sadreyev,R.I., Wang,P.I., Tolstorukov,M.Y. and Kingston,R.E. (2014) The long noncoding RNAs NEAT1 and MALAT1 bind active chromatin sites. Mol. Cell, 55, 791–802. 61. Carone,B.R., Hung,J.H., Hainer,S.J., Chou,M.T., Carone,D.M., Weng,Z., Fazzio,T.G. and Rando,O.J. (2014) High-resolution mapping of chromatin packaging in mouse embryonic stem cells and sperm. Dev. Cell, 30, 11–22. Nucleic Acids Research, 2015, Vol. 43, No. 14 6859 62. Bourc’his,D. and Voinnet,O. (2010) A small-RNA perspective on gametogenesis, fertilization, and early zygotic development. Science, 330, 617–622. 63. Zheng,R., Shen,Z., Tripathi,V., Xuan,Z., Freier,S.M., Bennett,C.F., Prasanth,S.G. and Prasanth,K.V. (2010) Polypurine-repeat-containing RNAs: a novel class of long non-coding RNA in mammalian cells. J. Cell Sci., 123, 3734–3744. 64. Fadloun,A., Le Gras,S., Jost,B., Ziegler-Birling,C., Takahashi,H., Gorab,E., Carninci,P. and Torres-Padilla,M.E. (2013) Chromatin signatures and retrotransposon proiling in mouse embryos reveal regulation of LINE-1 by RNA. Nat. Struct. Mol. Biol., 20, 332–338. 65. Hall,L.L., Carone,D.M., Gomez,A.V., Kolpa,H.J., Byron,M., Mehta,N., Fackelmayer,F.O. and Lawrence,J.B. (2014) Stable C0T-1 repeat RNA is abundant and is associated with euchromatic interphase chromosomes. Cell, 156, 907–919. 66. Nickerson,J.A., Krochmalnic,G., Wan,K.M. and Penman,S. (1989) Chromatin architecture and nuclear RNA. Proc. Natl. Acad. Sci. U.S.A., 86, 177–181. 67. Avendano,C., Franchi,A., Jones,E. and Oehninger,S. (2009) Pregnancy-speciic {beta}-1-glycoprotein 1 and human leukocyte antigen-E mRNA in human sperm: differential expression in fertile 68. 69. 70. 71. and infertile men and evidence of a possible functional role during early development. Hum. Reprod., 24, 270–277. Fang,P., Zeng,P., Wang,Z., Liu,M., Xu,W., Dai,J., Zhao,X., Zhang,D., Liang,D., Chen,X. et al. (2014) Estimated diversity of messenger RNAs in each murine spermatozoa and their potential function during early zygotic development. Biol. Reprod., 90, 94, 1–11. Sutovsky,P., Oko,R., Hewitson,L. and Schatten,G. (1997) The removal of the sperm perinuclear theca and its association with the bovine oocyte surface during fertilization. Dev. Biol., 188, 75–84. Naaby-Hansen,S., Mandal,A., Wolkowicz,M.J., Sen,B., Westbrook,V.A., Shetty,J., Coonrod,S.A., Klotz,K.L., Kim,Y.H., Bush,L.A. et al. (2002) CABYR, a novel calcium-binding tyrosine phosphorylation-regulated ibrous sheath protein involved in capacitation. Dev. Biol., 242, 236–254. Garrido,N., Martinez-Conejero,J.A., Jauregui,J., Horcajadas,J.A., Simon,C., Remohi,J. and Meseguer,M. (2009) Microarray analysis in sperm from fertile and infertile men without basic sperm analysis abnormalities reveals a signiicantly different transcriptome. Fertil. Steril., 91, 1307–1310.