Birth and expression evolution of mammalian microRNA genes

Julien Meunier, Frédéric Lemoine, Magali Soumillon, Angélica Liechti, Manuela Weier, Katerina Guschanski, Haiyang Hu, Philipp Khaitovich, Henrik Kaessmann, Julien Meunier, Frédéric Lemoine, Magali Soumillon, Angélica Liechti, Manuela Weier, Katerina Guschanski, Haiyang Hu, Philipp Khaitovich, Henrik Kaessmann

Abstract

MicroRNAs (miRNAs) are major post-transcriptional regulators of gene expression, yet their origins and functional evolution in mammals remain little understood due to the lack of appropriate comparative data. Using RNA sequencing, we have generated extensive and comparable miRNA data for five organs in six species that represent all main mammalian lineages and birds (the evolutionary outgroup) with the aim to unravel the evolution of mammalian miRNAs. Our analyses reveal an overall expansion of miRNA repertoires in mammals, with threefold accelerated birth rates of miRNA families in placentals and marsupials, facilitated by the de novo emergence of miRNAs in host gene introns. Generally, our analyses suggest a high rate of miRNA family turnover in mammals with many newly emerged miRNA families being lost soon after their formation. Selectively preserved mammalian miRNA families gradually evolved higher expression levels, as well as altered mature sequences and target gene repertoires, and were apparently mainly recruited to exert regulatory functions in nervous tissues. However, miRNAs that originated on the X chromosome evolved high expression levels and potentially diverse functions during spermatogenesis, including meiosis, through selectively driven duplication-divergence processes. Overall, our study thus provides detailed insights into the birth and evolution of mammalian miRNA genes and the associated selective forces.

Figures

Figure 1.
Figure 1.
Birth and death rates of miRNA families. Phylogeny of the six studied amniote species and estimated rates of miRNA family gain and loss, as inferred by a maximum likelihood procedure (Methods), are shown. Note that the estimated number of families in the amniote ancestor that have been completely lost during evolution (i.e., with no representative in extant species) is zero. Branch lengths reflect evolutionary divergence times in million of years (MY). Number of gained (+) and lost (–) families (in black) as well as the net gain rate of miRNA families per MY (in red) are indicated next to each branch. The therian net gain rate was computed based on the gain and loss of miRNA families across the entire therian clade (light blue box). The net number of families that have been gained since the bird-mammal split are indicated in the orange box, and the total number of families for each species are indicated above.
Figure 2.
Figure 2.
Age of miRNA families relative to their expression levels and numbers of predicted target genes. (A) Expression level distributions of miRNA families of different ages. Expression values for each miRNA family were computed as the median expression levels of all family members across all tissues. Also, common expression values associated with two or more miRNA loci with highly similar mature sequences were divided by the number of loci involved. Age categories of miRNA families are represented from the most recent (far left) to the most ancient (far right) for each species based on their phylogenetic distribution (see Methods). (B) Number of predicted target genes of miRNA families of different ages divided by the number of predicted targets for mock miRNAs from random intronic sequences (Methods) using PITA. Number of target genes per miRNA family was computed as the median number of targets of all family members. (C) Evolution of mature miRNA sequences. The percentage of miRNA families displaying one or more modifications in the mature sequence (substitutions in the seed or in the rest of the mature sequence; shifts in the 5′ or 3′ end of the mature miRNA) is shown together with 95% confidence intervals.
Figure 3.
Figure 3.
Spatial expression patterns of miRNAs and 3′-UTR structures of predicted target genes. (A) Age of miRNA families and their relative expression by tissues. Relative expression values for each family were calculated as the sum of expression values of all family members in a given tissue divided by their total expression across all tissues. Colored symbols indicate the median relative expression value of miRNA families. Ancient/recent families: families that originated before/after the mammal-bird split. Samples sizes (n) are indicated (note that miRNA families with low expression levels were filtered out in this analysis; see Methods for details). For all but the ancient macaque and recent platypus families, the difference between the maximum and minimum median values is significantly higher than expected by chance (permutation test on tissue labels, corrected P < 0.05). (B) 3′-UTR lengths of protein-coding genes. Protein-coding genes were classified according to the tissue in which they are most highly expressed. (C) Numbers of miRNA families targeting protein-coding genes (using PITA predictions). A miRNA family was considered to target a gene if one or more of the miRNAs were predicted to target the gene's 3′ UTR. Patterns are shown for all miRNA families. Notably, similar results are obtained for ancient and recent families when these are analyzed separately and using TargetScan target predictions (Supplemental Fig. S15).
Figure 4.
Figure 4.
Evolution of miRNA precursor sequences. (A) Comparisons of phyloP score category frequencies between human miRNA precursor sequences and the genomic background. miRNA/genome frequency ratios >1 indicate a higher frequency of sites with a given phyloP score category in miRNAs relative to the genomic background (and vice versa). Primate-based phyloP scores: rapidly evolving sites (score < −0.5); slowly evolving sites (score > 0.5). Error bars: 95% confidence intervals. The age (i.e., phylogenetic distribution; Methods) of miRNAs and the total number of miRNA sites considered (n) are indicated at the bottom of the panel. (B) Sequence evolution of miRNA families on the X chromosome and autosomes. To limit biases due to age variations, only eutherian-specific miRNA families predating the human-mouse split were considered.
Figure 5.
Figure 5.
Expression patterns of sex chromosome-linked and autosomal miRNAs. (A) Spatial expression pattern of miRNA genes on the therian X, platypus X1–X5, and bird Z chromosomes. Expression level distributions of miRNA genes shared among and specific to eutherians, marsupials, monotremes, or specific to chicken (see Methods for miRNA age definitions) are shown. The sample size, n, corresponds to the number of independent expression values. Note that we define expression levels of miRNA genes in somatic tissues as their median expression levels across all four somatic tissues. Also, common expression values associated with two or more miRNA loci with highly similar mature sequences were divided by the number of loci involved. (B) Expression of miRNA genes shared among and specific to eutherians in mouse spermatogenic cells. Sample sizes are the same as indicated for mouse in panel A.

References

    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ 1990. Basic local alignment search tool. J Mol Biol 215: 403–410
    1. Bartel DP 2009. MicroRNAs: Target recognition and regulatory functions. Cell 136: 215–233
    1. Benjamini Y, Hochberg Y 1995. Controlling the false discovery rate: A practical and powerful approach to multiple testing. JR Stat Soc 57: 289–300
    1. Berezikov E 2011. Evolution of microRNA diversity and regulation in animals. Nat Rev Genet 12: 846–860
    1. Bernhart SH, Tafer H, Mückstein U, Flamm C, Stadler PF, Hofacker IL 2006. Partition function and base pairing probabilities of RNA heterodimers. Algorithms Mol Biol 1: 3 doi: 10.1186/1748-7188-1-3
    1. Brawand D, Soumillon M, Necsulea A, Julien P, Csardi G, Harrigan P, Weier M, Liechti A, Aximu-Petri A, Kircher M, et al. 2011. The evolution of gene expression levels in mammalian organs. Nature 478: 343–348
    1. Buard J, Barthes P, Grey C, de Massy B 2009. Distinct histone modifications define initiation and repair of meiotic recombination in the mouse. EMBO J 28: 2616–2624
    1. Burroughs AM, Ando Y, de Hoon MJ, Tomaru Y, Nishibu T, Ukekawa R, Funakoshi T, Kurokawa T, Suzuki H, Hayashizaki Y, et al. 2010. A comprehensive survey of 3′ animal miRNA modification events and a possible role for 3′ adenylation in modulating miRNA targeting effectiveness. Genome Res 20: 1398–1410
    1. Campo-Paysaa F, Sémon M, Cameron RA, Peterson KJ, Schubert M 2011. microRNA complements in deuterostomes: Origin and evolution of microRNAs. Evol Dev 13: 15–27
    1. Carthew RW, Sontheimer EJ 2009. Origins and mechanisms of miRNAs and siRNAs. Cell 136: 642–655
    1. Chen K, Rajewsky N 2007. The evolution of gene regulation by transcription factors and microRNAs. Nat Rev Genet 8: 93–103
    1. Christodoulou F, Raible F, Tomer R, Simakov O, Trachana K, Klaus S, Snyman H, Hannon GJ, Bork P, Arendt D 2010. Ancient animal microRNAs and the evolution of tissue identity. Nature 463: 1084–1088
    1. Csuros M 2010. Count: Evolutionary analysis of phylogenetic profiles with parsimony and likelihood. Bioinformatics 26: 1910–1912
    1. Dannemann M, Nickel B, Lizano E, Burbano HA, Kelso J 2012. Annotation of primate miRNAs by high throughput sequencing of small RNA libraries. BMC Genomics 13: 116 doi: 10.1186/1471-2164-13-116
    1. Devor EJ, Samollow PB 2008. In vitro and in silico annotation of conserved and nonconserved microRNAs in the genome of the marsupial Monodelphis domestica. J Hered 99: 66–72
    1. Duret L, Galtier N 2009. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu Rev Genomics Hum Genet 10: 285–311
    1. Edgar RC 2004. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797
    1. Fahlgren N, Jogdeo S, Kasschau KD, Sullivan CM, Chapman EJ, Laubinger S, Smith LM, Dasenko M, Givan SA, Weigel D, et al. 2010. MicroRNA gene evolution in Arabidopsis lyrata and Arabidopsis thaliana. Plant Cell 22: 1074–1089
    1. Friedländer MR, Chen W, Adamidi C, Maaskola J, Einspanier R, Knespel S, Rajewsky N 2008. Discovering microRNAs from deep sequencing data using miRDeep. Nat Biotechnol 26: 407–415
    1. Grabske RJ, Lake S, Gledhill BL, Meistrich ML 1975. Centrifugal elutriation: Separation of spermatogenic cells on the basis of sedimentation velocity. J Cell Physiol 86: 177–189
    1. Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ 2006. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res 34: D140–D144
    1. Grimson A, Farh KK, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP 2007. MicroRNA targeting specificity in mammals: Determinants beyond seed pairing. Mol Cell 27: 91–105
    1. Guioli S, Lovell-Badge R, Turner JM 2012. Error-prone ZW pairing and no evidence for meiotic sex chromosome inactivation in the chicken germ line. PLoS Genet 8: e1002560 doi: 10.1371/journal.pgen.1002560
    1. Guo X, Su B, Zhou Z, Sha J 2009. Rapid evolution of mammalian X-linked testis microRNAs. BMC Genomics 10: 97 doi: 10.1186/1471-2164-10-97
    1. Guo H, Ingolia NT, Weissman JS, Bartel DP 2010. Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature 466: 835–840
    1. Heimberg AM, Sempere LF, Moy VN, Donoghue PCJ, Peterson KJ 2008. MicroRNAs and the advent of vertebrate morphological complexity. Proc Natl Acad Sci 105: 2946–2950
    1. Hertel J, Lindemeyer M, Missal K, Fried C, Tanzer A, Flamm C, Hofacker IL, Stadler PF, The students of Bioinformatics Computer Labs of 2004 and 2005 2006. The expansion of the metazoan microRNA repertoire. BMC Genomics 7: 25 doi: 10.1186/1471-2164-7-25
    1. Hubbard TJ, Aken BL, Ayling S, Ballester B, Beal K, Bragin E, Brent S, Chen Y, Clapham P, Clarke L, et al. 2009. Ensembl 2009. Nucleic Acids Res 37: D690–D697
    1. Julien P, Brawand D, Soumillon M, Necsulea A, Liechti A, Schütz F, Daish T, Grützner F, Kaessmann H 2012. Mechanisms and evolutionary patterns of Mammalian and avian dosage compensation. PLoS Biol 10: e1001328 doi: 10.1371/journal.pbio.1001328
    1. Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E 2007. The role of site accessibility in microRNA target recognition. Nat Genet 39: 1278–1284
    1. Khil PP, Smirnova NA, Romanienko PJ, Camerini-Otero RD 2004. The mouse X chromosome is enriched for sex-biased genes not subject to selection by meiotic sex chromosome inactivation. Nat Genet 36: 642–646
    1. Kim VN, Han J, Siomi MC 2009. Biogenesis of small RNAs in animals. Nat Rev Mol Cell Biol 10: 126–139
    1. King M, Wilson A 1975. Evolution at two levels in humans and chimpanzees. Science 188: 107–116
    1. Kozomara A, Griffiths-Jones S 2011. miRBase: Integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res 39: D152–D157
    1. Landgraf P, Rusu M, Sheridan R, Sewer A, Iovino N, Aravin A, Pfeffer S, Rice A, Kamphorst AO, Landthaler M, et al. 2007. A mammalian microRNA expression atlas based on small RNA library sequencing. Cell 129: 1401–1414
    1. Langmead B, Trapnell C, Pop M, Salzberg SL 2009. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25 doi: 10.1186/gb-2009-10-3-r25
    1. Lercher MJ, Urrutia AO, Hurst LD 2003. Evidence that the human X chromosome is enriched for male-specific but not female-specific genes. Mol Biol Evol 20: 1113–1116
    1. Li J, Liu Y, Dong D, Zhang Z 2010. Evolution of an X-linked primate-specific micro RNA cluster. Mol Biol Evol 27: 671–683
    1. Lu J, Shen Y, Wu Q, Kumar S, He B, Shi S, Carthew RW, Wang SM, Wu CI 2008. The birth and death of microRNA genes in Drosophila. Nat Genet 40: 351–355
    1. Morin RD, O'Connor MD, Griffith M, Kuchenbauer F, Delaney A, Prabhu AL, Zhao Y, McDonald H, Zeng T, Hirst M, et al. 2008. Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res 18: 610–621
    1. Mueller JL, Mahadevaiah SK, Park PJ, Warburton PE, Page DC, Turner JMA 2008. The mouse X chromosome is enriched for multicopy testis genes showing postmeiotic expression. Nat Genet 40: 794–799
    1. Murchison EP, Kheradpour P, Sachidanandam R, Smith C, Hodges E, Xuan Z, Kellis M, Grützner F, Stark A, Hannon GJ 2008. Conservation of small RNA pathways in platypus. Genome Res 18: 995–1004
    1. Niwa R, Slack FJ 2007. The evolution of animal microRNA function. Curr Opin Genet Dev 17: 145–150
    1. Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A 2010. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res 20: 110–121
    1. Ramsköld D, Wang ET, Burge CB, Sandberg R 2009. An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data. PLoS Comput Biol 5: e1000598 doi: 10.1371/journal.pcbi.1000598
    1. Rice WR 1984. Sex chromosomes and the evolution of sexual dimorphism. Evolution 38: 735–742
    1. Robinson MD, McCarthy DJ, Smyth GK 2010. edgeR: A bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26: 139–140
    1. Roux J, Gonzalez-Porta M, Robinson-Rechavi M 2012. Comparative analysis of human and mouse expression data illuminates tissue-specific evolutionary patterns of miRNAs. Nucleic Acids Res 40: 5890–5900
    1. Scarpino S, Morena AR, Petersen C, Froysa B, Soder O, Boitani C 1998. A rapid method of Sertoli cell isolation by DSA lectin, allowing mitotic analyses. Mol Cell Endocrinol 146: 121–127
    1. Sin HS, Ichijima Y, Koh E, Namiki M, Namekawa SH 2012. Human postmeiotic sex chromatin and its impact on sex chromosome evolution. Genome Res 22: 827–836
    1. Song R, Ro S, Michaels JD, Park C, McCarrey JR, Yan W 2009. Many X-linked microRNAs escape meiotic sex chromosome inactivation. Nat Genet 41: 488–493
    1. Stefani G, Slack FJ 2008. Small non-coding RNAs in animal development. Nat Rev Mol Cell Biol 9: 219–230
    1. Turner JMA 2007. Meiotic sex chromosome inactivation. Development 134: 1823–1831
    1. van der Ven PF, Speel EJ, Albrechts JC, Ramaekers FC, Hopman AH, Furst DO 1999. Assignment of the human gene for endosarcomeric cytoskeletal M-protein (MYOM2) to 8p23.3. Genomics 55: 253–255
    1. Wang PJ, McCarrey JR, Yang F, Page DC 2001. An abundance of X-linked genes expressed in spermatogonia. Nat Genet 27: 422–426
    1. Xiao C, Rajewsky K 2009. MicroRNA control in the immune system: Basic principles. Cell 136: 26–36
    1. Zhang H, Lee JY, Tian B 2005. Biased alternative polyadenylation in human tissues. Genome Biol 6: R100 doi: 10.1186/gb-2005-6-12-r100
    1. Zhang R, Peng Y, Wang W, Su B 2007. Rapid evolution of an X-linked microRNA cluster in primates. Genome Res 17: 612–617
    1. Zhang YE, Landback P, Vibranovski MD, Long M 2011. Accelerated recruitment of new brain development genes into the human genome. PLoS Biol 9: e1001179 doi: 10.1371/journal.pbio.1001179

Source: PubMed

3
Subscribe