After a great presentation this morning by Emma de Jong on our High-Quality Genomes for Australian Lutjanidae Species (abstract below),
if you’re at ABACBS2024 then please drop by Poster #102 to find out about some of the work we’re doing with ONT
data.
Abstracts
Improving phased Hifiasm assemblies with 20 kb ONT reads
Richard J Edwards, Adrianne Doran, Emma de Jong, Lara Parata, Shannon Corrigan
The quality and quantity of genome assembly has improved dramatically over recent years. Many large-scale genome
projects combine assembly of HiFi and HiC reads using Hifiasm to produce contiguous phased assemblies, scaffolded
to chromosome-level. Nevertheless, HiFi reads are typically under 25 kb and can still struggle to assemble long, low
diversity repeat regions. Obtaining ultra-long (100 kb or longer) ONT reads to solve this problem remains a
significant challenge due to technical constraints and DNA sample requirements. Here, we explore the utility of using
standard ONT long reads (20 kb or more) as “ultra-long” input to improve phased Hifiasm assemblies for 22
species of bony fish (Genome Size, 627 Mb 1.54 Gb). We also explore whether the new --telo-m
mode in Hifiasm
v0.9.0 improves telomere prediction. Incorporating 20kb+ ONT reads (7.8X 93.5X) significantly increased assembly
contiguity. BUSCO Completeness was not significantly altered, although there was some re-partitioning of BUSCO
genes between phased haplotypes for some species. Improvement did not strongly correlate with read depth (either
HiFi or ONT), suggesting that the underlying read length distributions and/or specific genome features are more
important for determining the outcome. Hifiasm --telo-m
mode significantly increased telomere recovery,
assembling over six times the number of gapless telomere-to-telomere chromosomes when combined with 20kb+
ONT reads. Verification of how these results translate to the ease of curation and/or quality of final HiC-scaffolded
chromosome-level assemblies is ongoing, with a goal to determine whether the additional sample preparation and
sequencing in the lab is cost-effective.
High-Quality Genomes for Australian Lutjanidae Species
Emma de Jong, Lara Parata, Philipp E Bayer, Shannon Corrigan, Richard J Edwards
Lutjanidae (snappers) are highly valued in commercial and recreational fisheries worldwide and serve as indicator
species of the health of marine environments and fishery bioregions in Western Australia. Comprehensive genomic
mapping of immune gene families of Lutjanidae species are lacking, but this information is critical for understanding
disease vulnerability, the impact of environmental stress, improving aquaculture efforts and to provide insights into
the health of wild populations. Despite their importance, only 3 out of 113 Lutjanid species currently have available
reference genomes, two of which are highly fragmented (>11,000 and >200,000 contigs), impacting studies on gene
families relevant to aquaculture. In this study, we present high-quality chromosome-level reference genomes for 14
Australian Lutjanidae species across seven genera, generated using HiFi and HiC data. We present initial comparative
genomic analyses, including immune gene content and chromosomal synteny analyses across species. These analyses
provide insights into the genomic architecture and evolutionary relationships within Lutjanidae. Ongoing work aims
to comprehensively map and compare the immune gene family repertoire across Lutjanidae genera, as well as
Lethrinidae species as an outgroup, to determine genus-specific changes in genes (e.g. loss, selection, duplication)
important for pathogen detection, antigen presentation, inflammation, and immune memory. These genome
assemblies will serve as a foundational resource to the wider scientific community interested in Lutjanidae