Genomic Selection in Plant Breeding: A Comparison of Models

ABSTRACT Simulation and empirical studies of genomic selection (GS) show accuracies sufficient to generate rapid genetic gains. However, with the increased popularity of GS approaches, numerous models have been proposed and no comparative analysis is available to identify the most promising ones. Us...

Full description

Saved in:

Bibliographic Details
Published in	Crop science Vol. 52; no. 1; pp. 146 - 160
Main Authors	Heslot, Nicolas, Yang, Hsiao‐Pei, Sorrells, Mark E., Jannink, Jean‐Luc
Format	Journal Article
Language	English
Published	Madison, WI The Crop Science Society of America, Inc 01.01.2012 Crop Science Society of America American Society of Agronomy
Subjects	Accuracy Agronomy. Soil science and plant productions Algorithms Arabidopsis thaliana artificial intelligence barley Biological and medical sciences breeding value corn correlation data collection Fundamental and applied biological sciences. Psychology Genetic improvement Genetics and breeding of economic plants Genomes Genomics Hordeum vulgare marker-assisted selection phenotypic variation Plant breeding Plant breeding: fundamental aspects and methodology Studies Subpopulations Triticum aestivum Wheat Zea mays Modeling Genomics Genetic improvement
Online Access	Get full text
ISSN	0011-183X 1435-0653
DOI	10.2135/cropsci2011.06.0297

Cover

More Information
Summary:	ABSTRACT Simulation and empirical studies of genomic selection (GS) show accuracies sufficient to generate rapid genetic gains. However, with the increased popularity of GS approaches, numerous models have been proposed and no comparative analysis is available to identify the most promising ones. Using eight wheat (Triticum aestivum L.), barley (Hordeum vulgare L.), Arabidopsis thaliana (L.) Heynh., and maize (Zea mays L.) datasets, the predictive ability of currently available GS models along with several machine learning methods was evaluated by comparing accuracies, the genomic estimated breeding values (GEBVs), and the marker effects for each model. While a similar level of accuracy was observed for many models, the level of overfitting varied widely as did the computation time and the distribution of marker effect estimates. Our comparisons suggested that GS in plant breeding programs could be based on a reduced set of models such as the Bayesian Lasso, weighted Bayesian shrinkage regression (wBSR, a fast version of BayesB), and random forest (RF) (a machine learning method that could capture nonadditive effects). Linear combinations of different models were tested as well as bagging and boosting methods, but they did not improve accuracy. This study also showed large differences in accuracy between subpopulations within a dataset that could not always be explained by differences in phenotypic variance and size. The broad diversity of empirical datasets tested here adds evidence that GS could increase genetic gain per unit of time and cost.
Bibliography:	All rights reserved. No part of this periodical may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Permission for printing and for reprinting the material contained herein has been obtained by the publisher. SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23
ISSN:	0011-183X 1435-0653
DOI:	10.2135/cropsci2011.06.0297