Landscape of genomic diversity and trait discovery in soybean

Babu Valliyodan, Qiu Dan Qiu, Gunvant Patil, Peng Zeng, Jiaying Huang, Lu Dai, Chengxuan Chen, Yanjun Li, Trupti Joshi, Li Song, Tri D. Vuong, Theresa A. Musket, Dong Xu, J. Grover Shannon, Cheng Shifeng, Xin Liu, Henry T. Nguyen

Research output: Contribution to journalArticlepeer-review

120 Scopus citations


Cultivated soybean [Glycine max (L.) Merr.] is a primary source of vegetable oil and protein. We report a landscape analysis of genome-wide genetic variation and an association study of major domestication and agronomic traits in soybean. A total of 106 soybean genomes representing wild, landraces, and elite lines were re-sequenced at an average of 17x depth with a 97.5% coverage. Over 10 million high-quality SNPs were discovered, and 35.34% of these have not been previously reported. Additionally, 159 putative domestication sweeps were identified, which includes 54.34 Mbp (4.9%) and 4,414 genes; 146 regions were involved in artificial selection during domestication. A genome-wide association study of major traits including oil and protein content, salinity, and domestication traits resulted in the discovery of novel alleles. Genomic information from this study provides a valuable resource for understanding soybean genome structure and evolution, and can also facilitate trait dissection leading to sequencing-based molecular breeding.

Original languageEnglish
Article number23598
JournalScientific reports
StatePublished - Mar 31 2016


Dive into the research topics of 'Landscape of genomic diversity and trait discovery in soybean'. Together they form a unique fingerprint.

Cite this