From MaRDI portal
Revision as of 10:38, 24 April 2024 by Stephan Haug (talk | contribs) (→‎Team)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Task Area 3: Statistics and Machine Learning


In the context of MaRDI, and the NFDI more broadly, our task area seeks to address the needs that the Statistics and ML community has in the management of its research data. These research data range from literature, statistical models and algorithms to benchmark data sets and software. The task area will follow FAIR principles in initiating libraries of curated datasets (OpenML | Zenodo)that will be connected to software and research literature by providing an associated library of statistical analyses (mlr-org | Zenodo). The task area will further support FAIR development of new methods by creating workflows and a demonstration platform for how to evaluate, compare or also tune methods through empirical analyses and simulation studies. Finally, the task area will cooperate with journal partners to establish standards for quality control and reproducibility of numerical experiments in the scientific publication process.


mlr3 is an open-source collection of R packages providing a unified interface for machine learning in the R language.

You can make your algorithm available via the mlr3 interface, by creating a pull request in our community repository mlr3extralearners.

Start by checking out our website or by reading our book.


OpenML is a FAIR platform for sharing machine learning research data. You can use OpenML with mlr3, using interface R package mlr3oml.

Zenodo community: Graphical Modelling and Causal Inference

This community offers a space for collecting and sharing datasets and software products that are related to graphical modelling and causal discovery/inference. Graphical models are statistical models that facilitate refined yet tractable data exploration by using graphs to represent complex stochastic independence structures between considered variables. Models based on directed graphs, in particular, provide the state-of-the-art approach for detailed exploration of cause-effect relationships. The different instances of graphical models also go by names such as Bayesian networks, Markov random fields, probabilistic graphical models, structural causal or structural equation models.