
From MaRDI portal

OpenML564MaRDI QIDQ6033298

OpenML dataset with id 564

No author found.

Full work available at URL:

Upload date: 3 October 2014

Dataset Characteristics

Number of classes: 0
Number of features: 11 (numeric: 11, symbolic: 0 and in total binary: 0 )
Number of instances: 40,768
Number of instances with missing values: 0
Number of missing values: 0

Author: Source: Unknown - Date unknown Please cite:

This is an artificial data set used in Friedman (1991) and also described in Breiman (1996,p.139). The cases are generated using the following method: Generate the values of 10 attributes, X1, ..., X10 independently each of which uniformly distributed over [0,1]. Obtain the value of the target variable Y using the equation:

Y = 10 * sin(pi * X1 * X2) + 20 * (X3 - 0.5)^2 + 10 * X4 + 5 * X5 + sigma(0,1)

Source: collection of regression datasets by Luis Torgo ( at Original source: Breiman (1996, p.139). Characteristics: 40768 cases, 11 continuous attributes


BREIMAN, L. (1996): Bagging Predictors. Machine Learning, 24(3), 123--140. Kluwer Academic Publishers. FRIEDMAN, J. (1991): Multivariate Adaptative Regression Splines. Annals of Statistics, 19:1, 1--141.