icardaFIGSr: A Toolkit for Focused Identification of Germplasm Strategy (FIGS)

Overview

The icardaFIGSr package provides tools for applying the Focused Identification of Germplasm Strategy (FIGS) to plant genetic resources data. With FIGS, users can subset collections efficiently to identify promising accessions based on traits, environmental data, and statistical workflows. This package is designed to support researchers and genebank managers in the identification of targeted germplasm subsets for plant breeding, research & developpement, and conservation purposes.

Features

Data Retrieval:
- Access and preprocess genebank and environmental datasets.
- Handle climatic data and crop-specific parameters effectively.
Modeling and Analysis:
- Train machine learning models with flexible workflows for classification and regression.
- Generate variable importance metrics and predictions.
- Evaluate model performance using tools like ROC curves and confusion matrices.
Built-in Datasets:
- Access preloaded datasets such as DurumWheatDHEWC, BarleyRNOWC, and FIGS subsets, among others.

Installation

Install the latest release from CRAN:

install.packages("icardaFIGSr")

Or, install the development version from GitHub:

devtools::install_github("icarda/icardaFIGSr")

Getting Started

Load the Package

library(icardaFIGSr)

Example Workflow

1. Load a Built-in Dataset

data("DurumWheatDHEWC")
head(DurumWheatDHEWC)

2. Model Training and Variable Importance

# Train a regression model on the loaded dataset
model <- tuneTrain(data = DurumWheatDHEWC, y = 'DHE', method = 'rf', summary = defaultSummary, classProbs = FALSE)

# Evaluate variable importance
var_imp <- varimpPred(newdata = model$`Test Data`, y = 'DHE', model = model$Training)
var_imp$VariableImportancePlot

3. Extract Onset Data

# Extract onset and climatic data for durum wheat
durum <- getAccessions(crop = 'Durum wheat', coor = FALSE)
onset_data <- getOnset(sites = unique(durum$SiteCode), crop = 'ICDW',
                var = c('tavg', 'prec'), cv = TRUE)
# Climate data
head(onset_data[[1]])

# Onset and phenological data
head(onset_data[[2]])

4. Visualize Spatial Data

# Map accessions by population type
mapAccessions(df = durum, long = "Longitude", lat = "Latitude", y = "PopulationType")

Vignettes

More details and examples are available as vignettes:

Accessing Crop-Related Data: vignette("CropData")
Predictive Modeling using tuneTrain(): vignette(ML_Workflows)
Extracting Sites Climate Data: vignette(Sites_climate)

To view vignettes locally:

browseVignettes("icardaFIGSr")

Acknowledgments

This package was developed with contributions from:

Zakaria Kehel (Maintainer and Author)

Chafik Analy (Author)

Khadija Aouzal (Author)

Khadija Aziz (Author)

Bancy Ngatia (Author)

Zainab Azough, Amal Ibnelhobyb, Fawzy Nawar (Contributors)

Contact

For questions, please contact : Khadija Aouzal k.aouzal@cgiar.org or Zakaria Kehel z.kehel@cgiar.org

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github/workflows		.github/workflows
R		R
data		data
inst		inst
man		man
pkgdown		pkgdown
tests		tests
vignettes		vignettes
.DS_Store		.DS_Store
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
_pkgdown.yml		_pkgdown.yml
icardaFIGSr.Rproj		icardaFIGSr.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

icardaFIGSr: A Toolkit for Focused Identification of Germplasm Strategy (FIGS)

Overview

Features

Installation

Getting Started

Load the Package

Example Workflow

1. Load a Built-in Dataset

2. Model Training and Variable Importance

3. Extract Onset Data

4. Visualize Spatial Data

Vignettes

Acknowledgments

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 3

Languages

Folders and files

Latest commit

History

Repository files navigation

icardaFIGSr: A Toolkit for Focused Identification of Germplasm Strategy (FIGS)

Overview

Features

Installation

Getting Started

Load the Package

Example Workflow

1. Load a Built-in Dataset

2. Model Training and Variable Importance

3. Extract Onset Data

4. Visualize Spatial Data

Vignettes

Acknowledgments

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 3

Languages

Packages