Tuesday, 25 January 2022

Horizontal transposon transfer and its implications for the ancestral ecology of hydrophiine snakes

The first of the BABS Genome papers has finally arrived, featuring our two 10x Genomics Supernova snake genomes. Such is the speed that genomics is moving, the snake assemblies themselves have moved on quite a bit since then and we hope to release chromosome-level versions soon. (The goalposts for a genome paper moved faster than they could be written up - always a challenge without dedicated researchers working on assemblies! Do get in touch if they’d be useful and we can collaborate.)

Rather than a pure genome paper, this paper makes use of our two elapid genomes to ask some interesting questions about possible horizontal transfer of transposable (mobile genetic) elements during the evolution of sea snakes - our two elapids provided good sister (mainland tiger snake) and outgroup (eastern brown snake) taxa for the olive sea snake, which was the focus of the study. It was doubly pleasing to collaborate on a transposable elements paper, as they were the subject of my PhD (albeit in bacteria, see here and here).

This paper is part of a special issue, Mobile Elements in Phylogenomic Reconstructions, and features some interesting examples of probable horiztonal transfer of mobile elements that provide insights into the evolutionary history of these species.


Galbraith JD, Ludington AJ, Sanders KL, Amos TG, Thomson VA, Enosi Tuipulotu D, Dunstan N, Edwards RJ, Suh A, Adelson DL (2022): Horizontal transposon transfer and its implications for the ancestral ecology of hydrophiine snakes. Genes 13(2):217. [Genes] [PDF] [bioRxiv]

Abstract

Transposable elements (TEs), also known as jumping genes, are sequences able to move or copy themselves within a genome. As TEs move throughout genomes they often act as a source of genetic novelty, hence understanding TE evolution within lineages may help in understanding environmental adaptation. Studies into the TE content of lineages of mammals such as bats have uncovered horizontal transposon transfer (HTT) into these lineages, with squamates often also containing the same TEs. Despite the repeated finding of HTT into squamates, little comparative research has examined the evolution of TEs within squamates. Here we examine a diverse family of Australo–Melanesian snakes (Hydrophiinae) to examine if the previously identified, order-wide pattern of variable TE content and activity holds true on a smaller scale. Hydrophiinae diverged from Asian elapids ~30 Mya and have since rapidly diversified into six amphibious, ~60 marine and ~100 terrestrial species that fill a broad range of ecological niches. We find TE diversity and expansion differs between hydrophiines and their Asian relatives and identify multiple HTTs into Hydrophiinae, including three likely transferred into the ancestral hydrophiine from fish. These HTT events provide the first tangible evidence that Hydrophiinae reached Australia from Asia via a marine route.

Friday, 14 January 2022

The Waratah genome paper is out!

The final version of the waratah genome paper now out in Molecular Ecology Resources. This was a fun collaboration with the Royal Botanic Gardens and Domain Trust as one of the pilot genomes for BioPlatforms Australia’s Genomics for Australian Plants (GAP) initiative.

You can read the press release here, or our piece in the Conversation, We’ve unveiled the waratah’s genetic secrets, helping preserve this Australian icon for the future.

In this paper, we present a chromosome-level assembly for the NSW State Floral Emblem, the New South Wales waratah, Telopea speciosissima. This joins macadamia as the 2nd reference genome for the Proteaceae family & should help future studies for the remaining ca. 1700 species.

The genome was assembled from a ONT chassis, scaffolded with 10x Genomics linked reads and Phase Genomics HiC - made possible thanks to quality data from AGRF and the Ramaciotti Centre for Genomics. The final assembly was chromosome-level, with 94.1% on the 11 chromosomes (2n = 22).

As well as the assembly itself, the paper presents a three genomics tools that we hope will be helpful for other assemblies:

1. DepthSizer uses long-read depths and BUSCO predictions to estimate genome size. We estimated the waratah genome to be ca. 900 Mbp - bigger than kmer estimates, but smaller than flow cytometry of Tasmanian waratah.

2. Diploidocus builds on Purge Haplotigs, combining read depths, kmer frequencies & BUSCO predictions to classify and curate/filter assembly scaffolds. This decreases false duplications & contamination, and flags collapsed repeats for closer inspection.

3. DepthKopy uses BUSCO Complete genes to establish sequencing depth (like DepthSizer) and then estimates copy number for regions (e.g. genes), scaffolds & sliding windows of the assembly. This showed that most “Duplicated” BUSCOs are real duplicates.


Chen SH, Rossetto M, van der Merwe M, Lu-Irving P, Yap JS, Sauquet H, Bourke G, Amos TG, Bragg JG & Edwards RJ (accepted): Chromosome-level de novo genome assembly of Telopea speciosissima (New South Wales waratah) using long-reads, linked-reads and Hi-C. Molecular Ecology Resources.
[Mol Ecol Res] [bioRxiv]

Abstract

Telopea speciosissima, the New South Wales waratah, is an Australian endemic woody shrub in the family Proteaceae. Waratahs have great potential as a model clade to better understand processes of speciation, introgression and adaptation, and are significant from a horticultural perspective. Here, we report the first chromosome-level genome for T. speciosissima. Combining Oxford Nanopore long-reads, 10x Genomics Chromium linked-reads and Hi-C data, the assembly spans 823 Mb (scaffold N50 of 69.0 Mb) with 97.8% of Embryophyta BUSCOs “Complete”. We present a new method in Diploidocus (https://github.com/slimsuite/diploidocus) for classifying, curating and QC-filtering scaffolds, which combines read depths, k-mer frequencies and BUSCO predictions. We also present a new tool, DepthSizer (https://github.com/slimsuite/depthsizer), for genome size estimation from the read depth of single-copy orthologues and estimate the genome size to be approximately 900 Mb. The largest 11 scaffolds contained 94.1% of the assembly, conforming to the expected number of chromosomes (2n = 22). Genome annotation predicted 40,158 protein-coding genes, 351 rRNAs and 728 tRNAs. We investigated CYCLOIDEA (CYC) genes, which have a role in determination of floral symmetry, and confirm the presence of two copies in the genome. Read depth analysis of 180 “Duplicated” BUSCO genes using a new tool, DepthKopy (https://github.com/slimsuite/depthkopy), suggests almost all are real duplications, increasing confidence in the annotation and highlighting a possible need to revise the BUSCO set for this lineage. The chromosome-level T. speciosissima reference genome (Tspe_v1) provides an important new genomic resource of Proteaceae to support the conservation of flora in Australia and further afield.

If you want a read and don’t have access, please get it touch or check out the bioRxiv preprint.