Tuesday, 17 December 2024

Parental assigned chromosomes for cultivated cacao provides insights into genetic architecture underlying resistance to vascular streak dieback

As a big fan of chocolate, it was great to have the opportunity to help out with a cacao genome. This study presents the first diploid, fully scaffolded, and parentally phased genome resource for Theobroma cacao L. to provide insights into the genetic architecture underlying resistance and susceptibility to vascular streak dieback (VSD), a significant threat to cacao production in Southeast Asia and Melanesia. By analyzing NLR gene clusters and other disease response gene candidates in proximity to informative QTLs, the research identifies structural variants within NLRs inherited from resistant and susceptible parents, offering potential breeding targets for VSD resistance.

Tobias PA, Downs J, Epaina P, Singh G, Park RF, Edwards RJ, Brugman E, Zulkifli A, Muhammad J, Purwantara A & Guest DI (2024): Parental assigned chromosomes for cultivated cacao provides insights into genetic architecture underlying resistance to vascular streak dieback. The Plant Genome doi: 10.1002/tpg2.20524. [Plant Genome] [bioRxiv] [PubMed]

Abstract

Diseases of Theobroma cacao L. (Malvaceae) disrupt cocoa bean supply and economically impact growers. Vascular streak dieback (VSD), caused by Ceratobasidium theobromae, is a new encounter disease of cacao currently contained to southeast Asia and Melanesia. Resistance to VSD has been tested with large progeny trials in Sulawesi, Indonesia, and in Papua New Guinea with the identification of informative quantitative trait loci (QTLs). Using a VSD susceptible progeny tree (clone 26), derived from a resistant and susceptible parental cross, we assembled the genome to chromosome-level and discriminated alleles inherited from either resistant or susceptible parents. The parentally phased genomes were annotated for all predicted genes and then specifically for resistance genes of the nucleotide-binding site leucine-rich repeat class (NLR). On investigation, we determined the presence of NLR clusters and other potential disease response gene candidates in proximity to informative QTLs. We identified structural variants within NLRs inherited from parentals. We present the first diploid, fully scaffolded, and parentally phased genome resource for T. cacao L. and provide insights into the genetics underlying resistance and susceptibility to VSD.

#AusEvol2024 - Depth-based correction of gene duplications and losses in genome assemblies

The Australasian Evolution Society conference has always been one of my favourites, due to its laid back culture of inclusivity and kindness. (And low cost!) It therefore feels quite fitting that my last conference as an Aussie academic was AES2024.

This talk was a bit of an update from my AES2021 presentation. This showcased some of the latest additions to DepthKopy, including depth-based copy number correction of genome features, such as rDNA genes, repeat families, or multicopy genes. This includes a feature that classifies multicopy “Duplicated” genes identified by BUSCO as true (biological) or false (artefactual) duplicates. TL/DR version: analysis of draft genome assemblies for 45 species of fish across five different depths/qualities indicates that DepthKopy can correct the copy number and total length of multicopy features to within 10% of the true number. (The lower-quality raw assemblies ranged from a 30% under-estimate to a 60% over-estimate.)

This will be of most importance when low quality draft genomes are included in a comparative genomics analysis. However, even the best genome assemblies appear to have some “collapsed” or duplicated loci where the copy number in the assembly does not accurately reflect the copy number in the genome. DepthKopy is useful for exploring the magnitude of such disparities, and can help to identify and correct specific disrepancies in genes or features of interest.

Tuesday, 3 December 2024

So long, Ocean Genomes... and thanks for all the fish!

After a successful couple of years, today was my last day at UWA. I am proud of the team that I helped to build at the Minderoo Oceanomics Centre at UWA, and the things we have accomplished together. The UWA Oceans Institute has been a fantastic place to work, and I look forward to completing some exciting ongoing collaborations in my capacity as adjunct. It's been exciting to see Ocean Genomes grow from a concept with a largely empty lab to a fully-fledged genome factory capable of generating multiple high-quality genomes a week. The associated publications should hopefully be following soon, and I look forward to continued collaboration with the team as an Oceans Institute adjunct.

Developments in DNA sequencing technology over the past few years have been immense, but the most impressive part for me has been witnessing the laboratory technical team optimising the sample preparations for sequencing. Everything gets so much harder when you move from human samples (the focus of most methods development and testing) into non-model organisms, and I am convinced that the quality of genomes we’ve been producing is in large part due to the quality of the DNA going into the sequencers.

I am now looking for my next challenge and am officially Open For Work. We’ll be moving back to Dublin at the end of January. If you are based in Ireland and need an experienced interdisciplinary problem solver with broad expertise across bioinformatics and biomolecular science, please get in touch! Academic and non-academic opportunities are welcome.