Display options
Share it on

Front Plant Sci. 2015 Nov 17;6:941. doi: 10.3389/fpls.2015.00941. eCollection 2015.

Genomic Prediction in Pea: Effect of Marker Density and Training Population Size and Composition on Prediction Accuracy.

Frontiers in plant science

Nadim Tayeh, Anthony Klein, Marie-Christine Le Paslier, Françoise Jacquin, Hervé Houtin, Céline Rond, Marianne Chabert-Martinello, Jean-Bernard Magnin-Robert, Pascal Marget, Grégoire Aubert, Judith Burstin

Affiliations

  1. INRA, UMR1347 Agroécologie Dijon, France.
  2. INRA, US1279 Etude du Polymorphisme des Génomes Végétaux, CEA-IG/Centre National de Génotypage Evry, France.

PMID: 26635819 PMCID: PMC4648083 DOI: 10.3389/fpls.2015.00941

Abstract

Pea is an important food and feed crop and a valuable component of low-input farming systems. Improving resistance to biotic and abiotic stresses is a major breeding target to enhance yield potential and regularity. Genomic selection (GS) has lately emerged as a promising technique to increase the accuracy and gain of marker-based selection. It uses genome-wide molecular marker data to predict the breeding values of candidate lines to selection. A collection of 339 genetic resource accessions (CRB339) was subjected to high-density genotyping using the GenoPea 13.2K SNP Array. Genomic prediction accuracy was evaluated for thousand seed weight (TSW), the number of seeds per plant (NSeed), and the date of flowering (BegFlo). Mean cross-environment prediction accuracies reached 0.83 for TSW, 0.68 for NSeed, and 0.65 for BegFlo. For each trait, the statistical method, the marker density, and/or the training population size and composition used for prediction were varied to investigate their effects on prediction accuracy: the effect was large for the size and composition of the training population but limited for the statistical method and marker density. Maximizing the relatedness between individuals in the training and test sets, through the CDmean-based method, significantly improved prediction accuracies. A cross-population cross-validation experiment was further conducted using the CRB339 collection as a training population set and nine recombinant inbred lines populations as test set. Prediction quality was high with mean Q (2) of 0.44 for TSW and 0.59 for BegFlo. Results are discussed in the light of current efforts to develop GS strategies in pea.

Keywords: GenoPea 13.2K SNP Array; genomic selection; marker density; pea (Pisum sativum L.); prediction accuracy; training set

References

  1. PLoS One. 2015 Mar 31;10(3):e0120610 - PubMed
  2. New Phytol. 2012 Apr;194(1):116-28 - PubMed
  3. J Stat Softw. 2010;33(1):1-22 - PubMed
  4. Genetics. 2012 Apr;190(4):1503-10 - PubMed
  5. BMC Genomics. 2014 Aug 29;15:740 - PubMed
  6. Front Plant Sci. 2015 Nov 27;6:1037 - PubMed
  7. Bioinformatics. 2012 Aug 1;28(15):2086-7 - PubMed
  8. Theor Appl Genet. 2013 Oct;126(10):2575-86 - PubMed
  9. PLoS Genet. 2015 Feb 17;11(2):e1004982 - PubMed
  10. Theor Popul Biol. 1988 Feb;33(1):54-78 - PubMed
  11. Mol Breed. 2014;34(4):1843-1852 - PubMed
  12. Brief Funct Genomics. 2010 Mar;9(2):166-77 - PubMed
  13. BMC Genomics. 2015 Feb 21;16:105 - PubMed
  14. J Dairy Sci. 2009 Jan;92(1):16-24 - PubMed
  15. PLoS One. 2013 Sep 05;8(9):e74612 - PubMed
  16. Theor Appl Genet. 2014 Jun;127(6):1263-91 - PubMed
  17. Genetics. 2001 Apr;157(4):1819-29 - PubMed
  18. Genetics. 2012 Oct;192(2):715-28 - PubMed
  19. Plant J. 2015 Dec;84(6):1257-73 - PubMed
  20. Bioinformatics. 2001 Jun;17(6):520-5 - PubMed
  21. BMC Genomics. 2014 Jul 04;15:556 - PubMed

Publication Types