stackex_cooking

mldr.datasets::get.mldr("stackex_cooking")

Select your download

Partitions: select your desired partitioning strategy, validation and format

Random Stratified Iterative stratified
Hold out MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr
2x5-fold cross validation MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr
10-fold cross validation MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr

Summary

Instances 10491
Attributes 977
Inputs 577
Labels 400
Labelsets 6386
Single labelsets 5276
Max frequency 134
Cardinality 2.2248
Density 0.0056
Mean IR 37.8576
SCUMBLE 0.1933
TCS 21.1112

Citation

Charte, Francisco; Rivera, Antonio J.; del Jesus, Maria J.; Herrera, Francisco (2015). QUINTA: A question tagging assistant to improve the answering ratio in electronic forums. In EUROCON 2015 - International Conference on Computer as a Tool (EUROCON), IEEE, 1-6.
@inproceedings{,
  title="QUINTA: A question tagging assistant to improve the answering ratio in electronic forums",
  author="Charte, Francisco and Rivera, Antonio J. and del Jesus, Maria J. and Herrera, Francisco",
  booktitle="EUROCON 2015 - International Conference on Computer as a Tool (EUROCON), IEEE",
  year="2015",
  pages="1-6",
  month="Sept"
}

Concurrence plot

In this concurrence plot, sectors represent labels and links between them depict label co-occurrences. SCUMBLE is a measure designed to assess the concurrence among imbalanced labels.