k-mer-Based GWAS Unlock Causal Variant Discovery in Soybean

Genome‐wide association studies (GWAS) allow the discovery of loci associated with traits of interest in crops by identifying statistical associations between molecular markers and traits. Many studies use single‐nucleotide polymorphisms (SNPs) as molecular markers, but observed variation in traits can be due to other types of genetic changes such as structural variants (SVs). GWAS frequently fail to identify them.
Researchers tested if short sequence motifs, known as k‐mers, could identify causal variants regardless of if they are SNPs or SVs. Using a population of 363 cultivated soybean lines, they applied SNP‐, SV‐, and k‐mer‐based GWAS to 13 traits. The researchers found that k‐mers could pinpoint known causal variants at four loci, while identifying promising causal genes for several other traits. These analyses can assess all types of variants in a single analytic framework.
Such analyses hold promise for speeding up the application of genomics to plant breeding. The framework and computational tools developed by the authors for downstream analysis of k‐mer‐based GWAS will facilitate the adoption of this method.
Adapted from Lemay, M.‐A., de Ronne, M., Bélanger, R., & Belzile, F. (2023). k‐mer‐based GWAS enhances the discovery of causal variants and candidate genes in soybean. The Plant Genome, e20374. https://doi.org/10.1002/tpg2.20374
Text © . The authors. CC BY-NC-ND 4.0. Except where otherwise noted, images are subject to copyright. Any reuse without express permission from the copyright owner is prohibited.