Helper function to select only the relevant columns for statistical testing
Source:R/select_stat_cols.R
select_stat_cols.Rd
Helper function to select only the relevant columns for statistical testing
Arguments
- data
data.frame
of classification accuracy results- by_set
Boolean
specifying whether you want to compare feature sets (ifTRUE
) or individual features (ifFALSE
).- metric
character
denoting the classification performance metric to use in statistical testing. Can be one of"accuracy"
,"precision"
,"recall"
,"f1"
. Defaults to"accuracy"
- hypothesis
character
denoting whether p-values should be calculated for each feature set or feature (depending onby_set
argument) individually relative to the null ifuse_null = TRUE
intsfeature_classifier
through"null"
, or whether pairwise comparisons between each set or feature should be conducted on main model fits only through"pairwise"
.