| Literature DB >> 21854559 |
Marilyn B Renfree1, Anthony T Papenfuss, Janine E Deakin, James Lindsay, Thomas Heider, Katherine Belov, Willem Rens, Paul D Waters, Elizabeth A Pharo, Geoff Shaw, Emily S W Wong, Christophe M Lefèvre, Kevin R Nicholas, Yoko Kuroki, Matthew J Wakefield, Kyall R Zenger, Chenwei Wang, Malcolm Ferguson-Smith, Frank W Nicholas, Danielle Hickford, Hongshi Yu, Kirsty R Short, Hannah V Siddle, Stephen R Frankenberg, Keng Yih Chew, Brandon R Menzies, Jessica M Stringer, Shunsuke Suzuki, Timothy A Hore, Margaret L Delbridge, Hardip R Patel, Amir Mohammadi, Nanette Y Schneider, Yanqiu Hu, William O'Hara, Shafagh Al Nadaf, Chen Wu, Zhi-Ping Feng, Benjamin G Cocks, Jianghui Wang, Paul Flicek, Stephen M J Searle, Susan Fairley, Kathryn Beal, Javier Herrero, Dawn M Carone, Yutaka Suzuki, Sumio Sugano, Atsushi Toyoda, Yoshiyuki Sakaki, Shinji Kondo, Yuichiro Nishida, Shoji Tatsumoto, Ion Mandiou, Arthur Hsu, Kaighin A McColl, Benjamin Lansdell, George Weinstock, Elizabeth Kuczek, Annette McGrath, Peter Wilson, Artem Men, Mehlika Hazar-Rethinam, Allison Hall, John Davis, David Wood, Sarah Williams, Yogi Sundaravadanam, Donna M Muzny, Shalini N Jhangiani, Lora R Lewis, Margaret B Morgan, Geoffrey O Okwuonu, San Juana Ruiz, Jireh Santibanez, Lynne Nazareth, Andrew Cree, Gerald Fowler, Christie L Kovar, Huyen H Dinh, Vandita Joshi, Chyn Jing, Fremiet Lara, Rebecca Thornton, Lei Chen, Jixin Deng, Yue Liu, Joshua Y Shen, Xing-Zhi Song, Janette Edson, Carmen Troon, Daniel Thomas, Amber Stephens, Lankesha Yapa, Tanya Levchenko, Richard A Gibbs, Desmond W Cooper, Terence P Speed, Asao Fujiyama, Jennifer A M Graves, Rachel J O'Neill, Andrew J Pask, Susan M Forrest, Kim C Worley.
Abstract
BACKGROUND: We present the genome sequence of the tammar wallaby, Macropus eugenii, which is a member of the kangaroo family and the first representative of the iconic hopping mammals that symbolize Australia to be sequenced. The tammar has many unusual biological characteristics, including the longest period of embryonic diapause of any mammal, extremely synchronized seasonal breeding and prolonged and sophisticated lactation within a well-defined pouch. Like other marsupials, it gives birth to highly altricial young, and has a small number of very large chromosomes, making it a valuable model for genomics, reproduction and development.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21854559 PMCID: PMC3277949 DOI: 10.1186/gb-2011-12-8-r81
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Phylogeny of the marsupials. Phylogenetic relationships of the orders of Marsupialia. Top: the placement of the contemporary continents of South America and Australia within Gondwanaland and the split of the American and Australian marsupials. Relative divergence in millions of years shown to the left in the context of geological periods. The relationship of the Macropodide within the Australian marsupial phylogeny shown is in purple with estimated divergence dates in millions of years [5,162,163]. Representative species from each clade are illustrated. Inset: phylogeny of the genus Macropus within the Macropodidae showing the placement of the model species M. eugenii (purple) based on [59]. Outgroup species are Thylogale thetis and Petrogale xanthopus.
Comparison of Meug genome assemblies
| Assembly version | |||
|---|---|---|---|
| 1.0 | 1.1 | 2.0 | |
| Contigs (million) | 1.211 | 1.174 | 1.111 |
| N50 (kb) | 2.5 | 2.6 | 2.91 |
| Bases (Mb) | 2546 | 2,536 | 2,574 |
| Scaffolds | 616,418 | 277,711 | 379,858 |
| Max scaffold size | NA | 472,108 | 324,751 |
| Gaps (Mb) | NA | 539 | 619 |
| N50 (kb) | NA | 41.8 | 34.3 |
| Complex scaffolds | NA | 128,563 | 124,674 |
| Singleton scaffolds | NA | 149,148 | 255,184 |
| Co-linear with BACs | NA | 87.2% (418) | 93.4% (298) |
| Co-linear with ESTs | NA | 82.3% (704) | 86.7% (454) |
Summary statistics for the tammar genome assemblies. These statistics indicate the extension and merging of contigs done to improve the assembly. The larger number of scaffolds and smaller scaffold N50 is a consequence of higher stringency in the 2.0 scaffolding workflow. The higher stringency isolated many contigs. However, the number of complex (that is, useful) scaffolds is similar between the assemblies. For co-linear estimates, the scaffolds were linearized and BACs and cDNA libraries were mapped against them. The 1.1 and 2.0 assemblies were validated against 169 BAC contigs and 84,718 ESTs (that were not incorporated into either genome assembly). We determined the percentage of contigs where the scaffolding matched the order and orientation when compared to BACs or ESTs (co-linear with BACs/ESTs). Parentheses indicate the total number of contigs identified after alignment to BAC contigs or ESTs.
Figure 2Homology of tammar regions to the human karyotype, and location of major histocompatibility complex, classical class I genes and olfactory receptor gene. Colored blocks represent the syntenic blocks with human chromosomes as shown in the key. A map of the locations of the tammar major histocompatibility complex (MHC) is shown on the right-hand side of each chromosome. The rearranged MHCs are on chromosome 2 and clusters of MHC class I genes (red) near the telomeric regions of chromosomes 1, 4, 5, 6, and 7. MHC class II genes are shown in blue, olfactory receptors are shown in orange and Kangaroo endogenous retroviral elements found within these clusters are shown in green. The location of the conserved mammalian OR gene clusters in the tammar genome are shown on the left-hand side of each chromosome. OR genes are found on every chromosome, except for chromosome 6 but including the X. The location of the OR gene clusters (numbers) are shown, and their approximate size is represented by lines of different thickness.
Comparison of repeat landscape in tammar and other mammals
| Tammar | Opossum | Platypus | Human | Mouse | |
|---|---|---|---|---|---|
| Total assembly size (Gb) | 2.7 | 3.48 | 2.3 | 2.88 | 2.55 |
| Interspersed repeats (%) | |||||
| Total | 52.8 | 52.2 | 44.6 | 45.5 | 40.9 |
| LINE/non-LTR retroelements | 28.6 | 29.2 | 21.0 | 20.0 | 19.6 |
| SINE | 11.7 | 10.4 | 22.4 | 12.6 | 7.2 |
| ERV | 3.9 | 10.6 | 0.47 | 8.1 | 9.8 |
| DNA transposon | 2.9 | 1.7 | 1.1 | 2.8 | 0.8 |
| C+G (%) | 38.8 | 37.7 | 45.5 | 40.9 | 41.8 |
| CpG (%) | 3.5 | 2.3 | NA | 3.7 | 3.9 |
Comparative analyses of the interspersed repeat content in the tammar and other sequenced mammalian genomes. Repeat modeller combined dataset includes ab initio annotation of de novo repeats. ERV, endogenous retroviral element; LTR, long terminal repeat; NA, not available.
Figure 3Classification of novel tammar genes. Summary of protein domains contained within translated novel ESTs isolated from the tammar transcriptomes. A large proportion of unique genes contain receptor or transcriptional regulator domains. The next largest classes of unique ESTs were immune genes, whey acidic protein and lipid domain containing genes. These findings suggest a rapid diversification of genes associated with immune function and lactation in the tammar.
Figure 4A survey of both conserved and novel small RNAs in the tammar genome. (a) Size ranges of the major classes of small RNAs. The x-axis shows number of reads mapped to the tammar genome while the size of the read in nucleotides is on the y-axis. Boxes denote each major class analyzed in the tammar. Classes targeted for sequencing and full annotation include the miRNAs (18 to 22 nucleotides), the piRNAs (28 to 32 nucleotides) and the newly discovered crasiRNAs (35 to 45 nucleotides). (b) Five tammar miRNA libraries (brain, liver, fibroblast, ovary and testis) were pooled and mapped to the tammar genome. miRNAs with a complete overlap with miRBase entries mapped to the tammar genome were considered conserved and annotated according to species. Heat map showing the frequency of conserved mirBase entries per tissue and per species as identified in the tammar. A high degree of overlap (that is, conservation) was observed between tammar and human for fibroblast and testis, but a relatively low degree of overlap was observed for the brain. (c) The complex tammar centromere. Genome browser view of chromatin immunoprecipitation-sequencing (ChIP-Seq) for DNA bound by the centromere-specific histone CENP-A mapped to a centromeric contig (top, blue). Nucleotide position on the contig is shown on the x-axis and depth of reads shown on the y-axis. Tracks illustrated: MACs peak (model-based analyses of Chip-Seq (black); locations for mapped reads of crasiRNAs (red); location of annotated centromere sequences (in this example, the centromeric LINE L6; purple); modeler repeat prediction track (green). crasiRNAs co-localize to DNA found in CENP-A-containing nucleosomes and are enriched in regions containing known centromere sequences.
Figure 5Comparative map of X and Y chromosomes. Comparison of X/Y shared gene locations on the tammar wallaby, grey short-tailed opossum and human X chromosomes. Blue represents the X conserved region, which is common to all therian X chromosomes. Green represents the X added region, which is on the X in eutherian mammals, but autosomal in marsupial mammals. Ten genes have been identified on the short arm of the tammar Y chromosome, all with a partner on the X, and an orthologue on the Tasmanian devil Y. In contrast, only four genes on the human Y have a partner on the conserved region of the X.
Figure 6. A phylogenetic tree showing the relationship of the SHH, IHH, DHH, and fish desert-like genes. Each group is composed of representatives from mammalian and non-mammalian species. The mammalian DHH group (green) clusters tightly and forms a separate linage to the fish DHH-like genes (red), which are no more closely related to DHH than they are to vertebrate IHH (yellow) and SHH (blue). Hs, human; Tt, dolphin; Xt, Xenopus; Gag, chicken; Mum, mouse; Me, tammar.
Figure 7. mVISTA comparison of partial HOXC cluster highlights conserved HOX genes and non-coding RNAs between human and tammar. In the coding regions, HOXC11 and HOXC10 are highly conserved between human and tammar. In the intergenic regions, some conserved regions shown are non-coding RNAs (long non-coding RNA such as HOTAIR, and miRNAs such as mir-196) or unknown motifs participating in gene expression and regulation. The percentage of identities (50 to 100%) (Vertical axis) is displayed in the coordinates of the genomic sequence (horizontal axis).
Figure 8Olfaction in the tammar. (a) The olfactory apparatus of the tammar showing the pattern of vomeronasal receptor projections to the accessory olfactory bulb with the VN2 receptor cells (expressing Goα) projecting to all parts of the vomeronasal nerve layer (which may also be the case for the VN1 receptor cells (expressing Giα2). This projection pattern may reflect an intermediate type to the 'segregated type' and the 'uniform type' so far described. AOB, accessory olfactory bulb; GL, glomerular layer; GRL, granule cell layer; MOB, main olfactory bulb; MTL, mitral tufted cell layer; VNL, vomeronasal nerve layer; VNO, vomeronasal organ; VN1R and VN2R, vomeronasal receptors 1 and 2. (b) Olfactory receptor (OR) gene family in the tammar. The families of the OR gene repertoire. Neighbor joining tree of 456 full-length functional OR genes was rooted with opossum adrenergic β receptor. Only a few OR gene families (14, 51 and 52) have members that are most closely related to each other, whilst most other families have a high degree of relatedness to other families.