Edwards Lab: myrtle rust

Showing posts with label myrtle rust. Show all posts

Friday, 15 December 2023

A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes

Chen SH, Martino AM, Luo Z, Schwessinger B, Jones A, Tolessa T, Bragg JG, Tobias PA, Edwards RJ (2023): A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes. GigaScience 12:giad102. [Gigascience] [PubMed]

Background. Melaleuca quinquenervia (broad-leaved paperbark) is a coastal wetland tree species that serves as a foundation species in eastern Australia, Indonesia, Papua New Guinea, and New Caledonia. While extensively cultivated for its ornamental value, it has also become invasive in regions like Florida, USA. Long-lived trees face diverse pest and pathogen pressures, and plant stress responses rely on immune receptors encoded by the nucleotide-binding leucine-rich repeat (NLR) gene family. However, the comprehensive annotation of NLR encoding genes has been challenging due to their clustering arrangement on chromosomes and highly repetitive domain structure; expansion of the NLR gene family is driven largely by tandem duplication. Additionally, the allelic diversity of the NLR gene family remains largely unexplored in outcrossing tree species, as many genomes are presented in their haploid, collapsed state.

Results. We assembled a chromosome-level pseudo-phased genome for M. quinquenervia and described the allelic diversity of plant NLRs using the novel FindPlantNLRs pipeline. Analysis reveals variation in the number of NLR genes on each haplotype, distinct clustering patterns, and differences in the types and numbers of novel integrated domains.

Conclusions. The high-quality M. quinquenervia genome assembly establishes a new framework for functional and evolutionary studies of this significant tree species. Our findings suggest that maintaining allelic diversity within the NLR gene family is crucial for enabling responses to environmental stress, particularly in long-lived plants.

Monday, 1 February 2021

Dr Collin Ahrens (Postdoc)

Dr Collin Ahrens joined the Edwards lab as a postdoctoral researcher in January of 2021. He studies local adaptation to environmental variation (e.g. pathogens, drought, heatwaves etc.), and investigates how these adaptive patterns can be used to manage plant populations.

Local adaptation is often driven by differential physiological responses to environmental stress, controlled by genetic mechanisms. Therefore, he focuses on the E + G = P paradigm to ask questions such as how do populations evolve such different responses to different environmental conditions? And how do species evolve such different responses to the same environmental conditions? To answer these fundamental questions, he leverages several computational techniques to disentangle patterns of adaptation. At the Edwards lab, he will use whole genome sequencing, quantitative genetics, and physiological experimentation to explore how myrtle rust resistance segregates within Melaleuca quinquenervia populations to assist in broader conservation programs, including applied outcomes such as seed collection and ex situ breeding programs.

Tuesday, 10 December 2019

Edwards Lab at GIW/ABACBS2019

The Edwards Lab and close affiliates have five posters at GIW-ABACBS2019 this year. Come and check them out during the two poster sesstions:

Poster #26: Åsa Pérez-Bercoff. Comparative performance of long-read whole genome assembly tools in diploid eukaryotes
Poster #33: Tim Amos. Estimating genome size using long read depth profiles and single copy regions of draft genome assemblies
Poster #92: Kat Stuart. Using genomics to reveal drivers of invasion success
Poster #93: Stephanie Chen. Advancing genomic resources for myrtle rust research and management
Poster #168: Rich Edwards. BUSCOMP: BUSCO Compilation and Comparison for Assessing Completeness in Multiple Genome Assemblies

Poster #26: Comparative performance of long-read whole genome assembly tools in diploid eukaryotes

Åsa Pérez-Bercoff, Paris Thompson & Richard J. Edwards [PDF]

As long-read (single molecule) sequencing from Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) is getting more common, more long-read assemblers are emerging. Whilst a few benchmarking studies have been performed, there is no clear “best” assembly tool, and the choice is often a combination of compute resource availability and anecdotal reports of relative performance in similar organisms. Here, we have de novo assembled PacBio long-read sequencing data from 13 diploid Saccharomyces cerevisiae (baker’s yeast) strains (11 diploid, 1 haploid and 1 tetraploid) using four different assemblers: Canu v1.8, Flye v2.4.2, WTDBG2 (a.k.a. Redbean) v2.4 and Ra v20181211. Assembly performance statistics have been generated by comparing assemblies to the reference yeast genome (SGD R64.2.1) using QUAST v5.0.2, and rating each assembly for accuracy, coverage, and contiguity.

The latest generation of assembly tools decide how to process reads based on a genome size parameter. For heterozygous diploid organisms, it is not clear whether this should be the haploid or diploid genome size, or something in-between based on heterozygosity. In this study, we used three genome size settings: haploid (13.1Mb), diploid (26.2Mb) and half-way in-between (19.65Mb). This will enable us to establish how the genome size parameter influences the quality of the resulting genome assembly.

Poster #33: Estimating genome size using long read depth profiles and single copy regions of draft genome assemblies

Timothy G Amos, Ziying Zhang & Richard J Edwards [PDF]

Estimating the size of a eukaryote genome is a fundamental task in genome assembly. As well as informing decisions on sequencing technology and depth, greater accuracy in genome size prediction can assist in assessing the completeness and duplication of a genome assembly. Genome sizes can be estimated through both experimental and genome sequencing approaches. Experimental methods include densitometry, flow cytometry analysis of stained nuclei and quantitative PCR (qPCR) of single copy genes. Genome sequencing approaches include short-read k-mer distributions. However, these methods can give variable results, and are prone to inaccurate predictions for genomes with abundant repetitive sequences.

Here, we show that the read depth of long reads in a draft genome can be used to provide a relatively accurate estimate of genome size across various model organisms. In general, diploid genome assemblies will consist of regions of haploid, diploid and incorrect (or multi-copy organelle) depth. Repeats and assembly errors are likely to be highly variable in terms of genome vs assembly copy number and, therefore, coverage. The modal read depth of conserved single copy orthologues should therefore approximate the sequencing depth of the input data, which can be extrapolated to estimate genome size. We found that our method can provide comparable, if not more accurate, estimates than short-read k-mer distributions.

Poster #92: Using genomics to reveal drivers of invasion success

Katarina Stuart, Lee Ann Rollins, William Sherwin, Richard Edwards, Natalie Hofmeister & Yuanyuan Cheng [PDF]

Invasive species are a global concern due to their negative impacts on the economy and local ecosystems. However, well-documented invasions provide a useful system in which to pose biologically interesting questions regarding short time scale evolution. Answering these questions will further our knowledge of evolutionary mechanisms, as well as inform specific management strategies for the invasive population. The European starling (Sturnus vulgaris) is a global pest that was introduced into Australia’s south-eastern states in the 1860’s and has since greatly expanded its range. Previous research on multiple introduced starling populations has demonstrated that their morphology has undergone subtle shifts following colonisation. My research applies a range of sequencing techniques to investigate genomic variation across Australia’s starling population. We are combining long-, short- and linked-read whole genome sequencing to assemble a high-quality starling reference genome. This genome will be annotated using Iso-seq long-read (PacBio) whole transcriptome sequencing in order to properly identify putatively evolving loci. Population sequencing data (whole genome and DArTSeq) will be mapped onto this reference to identify functional SNPs and reveal potential drivers of the rapid phenotypic divergences across the Australian population.

Poster #93: Advancing genomic resources for myrtle rust research and management

Stephanie Chen, Jason Bragg & Richard Edwards [PDF]

Myrtle rust is a plant disease caused by an invasive fungal pathogen (Austropuccinia psidii) first detected in Australia in 2010. Over 350 native species from the family Myrtaceae, which includes eucalypts, paperbarks, tea-trees, and lillipillies, are known hosts. Detailed genetic information needed for an effective and coordinated response encompassing conservation management is lacking. We performed de novo genome assembly and annotation for two species which exhibit a spectrum of resistance – Rhodamnia argentea (malletwood) and Syzygium oleosum (blue lilly pilly). Reference genomes have been generated using 10X linked reads, and are being supplemented with long reads and a hybrid assembly approach. These genomes will complement reduced representation sequencing (DArTseq) and rust resistance assays from different genotypes across the landscape. Together, these data will facilitate the characterisation of genetic structure and rust resistance across space as well as within and among species. This research is crucial for the management of at-risk species through optimising methods of improving disease resistance and adaptation to future climates in addition to increasing our understanding of myrtle rust which is a pressing concern to native biodiversity.

Poster #168: BUSCOMP: BUSCO Compilation and Comparison for Assessing Completeness in Multiple Genome Assemblies

Richard J. Edwards [PDF]

Advances in DNA sequencing technology and bioinformatics tools have placed de novo genome assembly of complex organisms firmly in the domain of individual labs and small consortia. Nevertheless, the assemblies produced are often fragmented and incomplete. Optimal assembly depends on the size, repeat landscape, ploidy and heterozygosity of the genome, which are often unknown. It is therefore common practice to try multiple strategies, and there is a bottleneck in assessing and comparing assemblies.

BUSCO [1] is a powerful and popular tool that estimates genome completeness using gene prediction and curated models of single-copy protein orthologues. However, results can be counterintuitive: adding/removing scaffolds can alter BUSCO predictions elsewhere in the assembly, while low sequence quality may reduce “completeness” scores and miss genes that are present in the assembly [2].

BUSCOMP (BUSCO Compilation and Comparison) complements BUSCO to identify/overcome these issues. BUSCOMP compiles a non-redundant set of the highest-scoring single-copy BUSCO complete sequences, rapidly searches these against assemblies, and robustly re-rates genes as Complete (Single/Duplicated), Fragmented/Partial or Missing. On test data from three organisms (yeast, cane toad and mainland tiger snake), BUSCOMP (1) gives consistent results when re-running the same assembly, (2) is not affected by adding or removing non-BUSCO-containing scaffolds, and (3) is minimally affected by assembly quality. This makes BUSCOMP ideal to run alongside BUSCO when trying to compare and rank genome assemblies, even in the absence of error-correction.

Available at: https://github.com/slimsuite/buscomp.

Simão FA et al. (2015) Bioinformatics 31:3210–3212
Edwards RJ et al. (2018) GigaScience 7:giy095

Tuesday, 9 July 2019

We’ve been funded! ARC Linkage - Optimising plant populations for ecological restoration and resilience

We are very happy to report another successful ARC Linkage grant application.

LP180100721: Optimising plant populations for ecological restoration and resilience

Dr Richard Edwards; Professor Justin Borevitz; Dr Jason Bragg; Dr Maurizio Rossetto; Dr Brett Summerell; Dr Marlien van der Merwe

When choosing individual plants for restoration populations, there is potentially a trade-off between maximising genetic diversity (‘adaptability’) and selection for desirable properties (‘adaptation’). This project aims to develop pioneering methods to quantify this trade-off, and facilitate the design of optimised populations, with a focus on two Australian rainforest trees that are being impacted by myrtle rust infection: Rhodamnia argentea and Rhodamnia rubescens. By studying the genetic variation in each species, and how this relates to myrtle rust resistance and climate, this project aims to design populations that are genetically diverse, maximally resistant to myrtle rust, and adapted to future climate.

We are collaborating with the Royal Botanic Garden and Domain Trust to apply genomics to challenges of conservation for rainforest trees in the face of climate change and invasive pathogens.

There will be job and studentship opportunities associated with this grant, so watch this space (or get in touch)!

Pages