Autor: Osval Antonio Montesinos-Lopez

Deep kernel and deep learning for genomic-based prediction

Jose Crossa Paulino Pérez-Rodríguez Juan Burgueño Ravi Singh Philomin Juliana Osval Antonio Montesinos-Lopez Jaime Cuevas (2019)

Deep learning (DL) is a promising method in the context of genomic prediction for selecting individuals early in time without measuring their phenotypes. iI this paper we compare the performance in terms of genome-based prediction of the DL method, deep kernel (arc-cosine kernel, AK) method, Gaussian kernel (GK) method and the conventional kernel method (Genomic Best Linear Unbiased Predictor, GBLUP, GB). We used two real wheat data sets for the benchmarking of these methods. We found that the GK and deep kernel AK methods outperformed the DL and the conventional GB methods, although the gain in terms of prediction performance of AK and GK was not very large but they have the advantage that no tuning parameters are required. Furthermore, although AK and GK had similar genomic-based performance, deep kernel AK is easier to implement than the GK. For this reason, our results suggest that AK is an alternative to DL models with the advantage that no tuning process is required.

Dataset

CIENCIAS AGROPECUARIAS Y BIOTECNOLOGÍA

Replication Data for: Multimodal Deep Learning Methods Enhance Genomic Prediction of Wheat Breeding

Carolina Rivera-Amado Francisco Pinto Francisco Javier Pinera-Chavez David González-Diéguez Paulino Pérez-Rodríguez Huihui Li Osval Antonio Montesinos-Lopez Jose Crossa (2023)

In plant breeding research, several statistical machine learning methods have been developed and studied for assessing the genomic prediction (GP) accuracy of unobserved phenotypes. To increase the GP accuracy of unobserved phenotypes while simultaneously accounting for the complexity of genotype × environment interaction (GE), deep learning (DL) neural networks have been developed.These analyses can potentially include phenomics data obtained through imaging. The two datasets included in this study contain phenomic, phenotypic, and genotypic data for a set of wheat materials. They have been used to compare a novel DL method with conventional GP models.The results of these analyses are reported in the accompanying journal article.

Dataset

CIENCIAS AGROPECUARIAS Y BIOTECNOLOGÍA

Replication Data for: Optimizing sparse testing for genomic prediction of plant breeding crops

Osval Antonio Montesinos-Lopez Carolina Saint Pierre Brandon Alejandro Mosqueda González Alison Bentley Yoseph Beyene Manje Gowda Leonardo Abdiel Crespo Herrera Jose Crossa (2022)

In plant breeding, sparse testing methods have been suggested to improve the efficiency of the genomic selection methodology. The data provided in this dataset were used to evaluate four methods for allocating lines to environments for sparse testing in multi-environment trials. The analysis was conducted using a multi-trait and uni-trait framework. The accompanying article describes the results of the evaluation as well as a cost-benefit analysis to identify the benefits that can be obtained using sparse testing methods.

Dataset

CIENCIAS AGROPECUARIAS Y BIOTECNOLOGÍA

Prediction models for canopy hyperspectral reflectance in wheat breeding data

Osval Antonio Montesinos-Lopez Jose Crossa Gustavo de los Campos Gregorio Alvarado Suchismita Mondal Jessica Rutkoski Lorena González Pérez Juan Burgueño (2016)

Vegetation indices (VI) generated by using some bands from hyperspectral cameras are used as predictors of primary traits. This study proposes models that use all available bands as predictors of primary traits. The proposed models were ordinal least square (OLS), Bayes B, principal components with Bayes B, functional B-spline, functional Fourier and functional partial least square (PLS). The results were compared with the OLS performed using as predictors each of the eight VIs individually and combined. The data set comes from CIMMYT’s Global Wheat Program and comprises 1170 genotypes evaluated for grain yield in five environments with the reflectance data measured in 250 discrete narrow bands ranging between 492 and 851 nm. in 9 time-points of the crop cycle. Results show that using all the bands simultaneously produced better predictions than using one VI alone or all the VI together, but when used only the bands with heritabilities > 0.5 in Drought environment, the predictions improved, while in the rest of the environments, using all the bands simultaneously produced slightly better prediction accuracies. The models with highest prediction when using all bands were functional B-spline and Fourier. Time-point 6 gives gave promising prediction accuracies for wheat lines before harvesting.

Dataset

CIENCIAS AGROPECUARIAS Y BIOTECNOLOGÍA

Prediction models for canopy hyperspectral reflectance in wheat breeding data

Osval Antonio Montesinos-Lopez Jose Crossa Gustavo de los Campos Gregorio Alvarado Suchismita Mondal Jessica Rutkoski Lorena González Pérez Juan Burgueño (2016)

Vegetation indices (VI) generated by using some bands from hyperspectral cameras are used as predictors of primary traits. This study proposes models that use all available bands as predictors of primary traits. The proposed models were ordinal least square (OLS), Bayes B, principal components with Bayes B, functional B-spline, functional Fourier and functional partial least square (PLS). The results were compared with the OLS performed using as predictors each of the eight VIs individually and combined. The data set comes from CIMMYT’s Global Wheat Program and comprises 1170 genotypes evaluated for grain yield in five environments with the reflectance data measured in 250 discrete narrow bands ranging between 492 and 851 nm. in 9 time-points of the crop cycle. Results show that using all the bands simultaneously produced better predictions than using one VI alone or all the VI together, but when used only the bands with heritabilities > 0.5 in Drought environment, the predictions improved, while in the rest of the environments, using all the bands simultaneously produced slightly better prediction accuracies. The models with highest prediction when using all bands were functional B-spline and Fourier. Time-point 6 gives gave promising prediction accuracies for wheat lines before harvesting.

Dataset

CIENCIAS AGROPECUARIAS Y BIOTECNOLOGÍA