The icardaFIGSr package provides tools for applying the Focused Identification of Germplasm Strategy (FIGS) to plant genetic resources data. With FIGS, users can subset collections efficiently to identify promising accessions based on traits, environmental data, and statistical workflows. This package is designed to support researchers and genebank managers in the identification of targeted germplasm subsets for plant breeding, research & developpement, and conservation purposes.
-
Data Retrieval:
- Access and preprocess genebank and environmental datasets.
- Handle climatic data and crop-specific parameters effectively.
-
Modeling and Analysis:
- Train machine learning models with flexible workflows for classification and regression.
- Generate variable importance metrics and predictions.
- Evaluate model performance using tools like ROC curves and confusion matrices.
-
Built-in Datasets:
- Access preloaded datasets such as
DurumWheatDHEWC,BarleyRNOWC, andFIGSsubsets, among others.
- Access preloaded datasets such as
Install the latest release from CRAN:
install.packages("icardaFIGSr")Or, install the development version from GitHub:
devtools::install_github("icarda/icardaFIGSr")library(icardaFIGSr)data("DurumWheatDHEWC")
head(DurumWheatDHEWC)# Train a regression model on the loaded dataset
model <- tuneTrain(data = DurumWheatDHEWC, y = 'DHE', method = 'rf', summary = defaultSummary, classProbs = FALSE)
# Evaluate variable importance
var_imp <- varimpPred(newdata = model$`Test Data`, y = 'DHE', model = model$Training)
var_imp$VariableImportancePlot# Extract onset and climatic data for durum wheat
durum <- getAccessions(crop = 'Durum wheat', coor = FALSE)
onset_data <- getOnset(sites = unique(durum$SiteCode), crop = 'ICDW',
var = c('tavg', 'prec'), cv = TRUE)
# Climate data
head(onset_data[[1]])
# Onset and phenological data
head(onset_data[[2]])# Map accessions by population type
mapAccessions(df = durum, long = "Longitude", lat = "Latitude", y = "PopulationType")More details and examples are available as vignettes:
- Accessing Crop-Related Data:
vignette("CropData") - Predictive Modeling using tuneTrain():
vignette(ML_Workflows) - Extracting Sites Climate Data:
vignette(Sites_climate)
To view vignettes locally:
browseVignettes("icardaFIGSr")This package was developed with contributions from:
- Zakaria Kehel (Maintainer and Author)
- Chafik Analy (Author)
- Khadija Aouzal (Author)
- Khadija Aziz (Author)
- Bancy Ngatia (Author)
- Zainab Azough, Amal Ibnelhobyb, Fawzy Nawar (Contributors)
For questions, please contact : Khadija Aouzal k.aouzal@cgiar.org or Zakaria Kehel z.kehel@cgiar.org