Skip to contents

caretMultimodal extends the caret framework to support late fusion workflows in R, enabling users to train models independently across multiple data modalities and combine their predictions into a single meta-model. Designed for R developers, data scientists, and biomedical researchers, caretMultimodal makes late fusion ensemble modelling as accessible and flexible as single-dataset workflows in caret.

Example late fusion workflow using cross-validation

caretMultimodal cross-validation workflow

Key Features

  • Includes all the functionality of caret, giving users full control over sampling strategies, training methods, hyperparameter tuning, and more
  • Default cross-validation structure with careful handling to prevent data leakage across modalities
  • Late fusion ensembling using stacked generalization
  • Parallelization for faster training across models and datasets
  • Model trimming to reduce memory usage for large ensembles
  • Built-in evaluation tools for performance assessment, ROC curves, and variable importance
  • Detailed error messages to simplify debugging

Documentation

Full API documentation is available at compbio-lab.github.io/caretMultimodal

Installation

The package can be installed using devtools

devtools::install_github("CompBio-Lab/caretMultimodal")

Acknowledgements

The project structure is inspired by Zach Mayer’s caretEnsemble package, which is used for stacking multiple models on a single dataset.