Portal/TA3: Difference between revisions

From MaRDI portal
Line 5: Line 5:
the Statistics and ML community has in the management of its research data. These research data
the Statistics and ML community has in the management of its research data. These research data
range from literature, statistical models and algorithms to benchmark data sets and software. The
range from literature, statistical models and algorithms to benchmark data sets and software. The
task area will follow FAIR principles in initiating libraries of curated datasets that will be connected to
task area will follow FAIR principles in initiating libraries of curated datasets ([https://www.openml.org/search?type=study&study_type=task&id=353&sort=tasks_included OpenML] | [https://zenodo.org/communities/mardigmci/records?q=&l=list&p=1&s=10&sort=newest Zenodo])that will be connected to
software and research literature by providing an associated library of statistical analyses. The task
software and research literature by providing an associated library of statistical analyses ([https://mlr-org.com/gallery mlr-org] | [https://doi.org/10.5281/zenodo.10625451 Zenodo]). The task
area will further support FAIR development of new methods by creating workflows and a demonstration  
area will further support FAIR development of new methods by creating workflows and a demonstration  
platform for how to evaluate, compare or also tune methods through empirical analyses and
platform for how to evaluate, compare or also tune methods through empirical analyses and
simulation studies. Finally, the task area will cooperate with journal partners to establish standards
simulation studies. Finally, the task area will cooperate with journal partners to establish standards
for quality control and reproducibility of numerical experiments in the scientific publication process.
for quality control and reproducibility of numerical experiments in the scientific publication process.


== mlr3 ==
== mlr3 ==

Revision as of 09:35, 24 April 2024

Task Area 3: Statistics and Machine Learning

Mission

In the context of MaRDI, and the NFDI more broadly, our task area seeks to address the needs that the Statistics and ML community has in the management of its research data. These research data range from literature, statistical models and algorithms to benchmark data sets and software. The task area will follow FAIR principles in initiating libraries of curated datasets (OpenML | Zenodo)that will be connected to software and research literature by providing an associated library of statistical analyses (mlr-org | Zenodo). The task area will further support FAIR development of new methods by creating workflows and a demonstration platform for how to evaluate, compare or also tune methods through empirical analyses and simulation studies. Finally, the task area will cooperate with journal partners to establish standards for quality control and reproducibility of numerical experiments in the scientific publication process.

mlr3

mlr3 is an open-source collection of R packages providing a unified interface for machine learning in the R language.

You can make your algorithm available via the mlr3 interface, by creating a pull request in our community repository mlr3extralearners.

Start by checking out our website or by reading our book.

OpenML

OpenML is a FAIR platform for sharing machine learning research data. You can use OpenML with mlr3, using interface R package mlr3oml.

Zenodo community: Graphical Modelling and Causal Inference

This community offers a space for collecting and sharing datasets and software products that are related to graphical modelling and causal discovery/inference. Graphical models are statistical models that facilitate refined yet tractable data exploration by using graphs to represent complex stochastic independence structures between considered variables. Models based on directed graphs, in particular, provide the state-of-the-art approach for detailed exploration of cause-effect relationships. The different instances of graphical models also go by names such as Bayesian networks, Markov random fields, probabilistic graphical models, structural causal or structural equation models.

Team