Generates a dictionary using results from different binary classification tasks, for example, using different thresholds
output
dictionary containing the name of statistic as a key and a list of that statistic for the data subsets.