
Helper function to select only the relevant columns for statistical testing
Source:R/select_stat_cols.R
select_stat_cols.RdHelper function to select only the relevant columns for statistical testing
Arguments
- data
data.frameof classification accuracy results- by_set
Booleanspecifying whether you want to compare feature sets (ifTRUE) or individual features (ifFALSE).- metric
characterdenoting the classification performance metric to use in statistical testing. Can be one of"accuracy","precision","recall","f1". Defaults to"accuracy"- hypothesis
characterdenoting whether p-values should be calculated for each feature set or feature (depending onby_setargument) individually relative to the null ifuse_null = TRUEintsfeature_classifierthrough"null", or whether pairwise comparisons between each set or feature should be conducted on main model fits only through"pairwise".