Compute correlated t-statistic and p-value for repeated k-fold cross-validated results
Source:R/repkfold_ttest.R
repkfold_ttest.Rd
Compute correlated t-statistic and p-value for repeated k-fold cross-validated results
Arguments
- data
data.frame
of values for model A and model B over repeated k-fold cross-validation. Four named columns are expected:"model"
,"values"
,"k"
, and"k"
- n1
integer
denoting train set size- n2
integer
denoting test set size- k
integer
denoting number of folds used in k-fold- r
integer
denoting number of repeats per fold
References
Nadeau, C., and Bengio, Y. Inference for the Generalization Error. Machine Learning 52, (2003).
Bouckaert, R. R., and Frank, E. Evaluating the Replicability of Significance Tests for Comparing Learning Algorithms. Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science, 3056, (2004).