stackex_chemistry

mldr.datasets::get.mldr("stackex_chemistry")

Select your download

Partitions: select your desired partitioning strategy, validation and format

Random Stratified Iterative stratified
Hold out MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr
2x5-fold cross validation MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr
10-fold cross validation MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr MULAN MEKA LibSVM KEEL mldr

Summary

Instances 6961
Attributes 715
Inputs 540
Labels 175
Labelsets 3032
Single labelsets 2331
Max frequency 318
Cardinality 2.1093
Density 0.0121
Mean IR 56.8779
SCUMBLE 0.1867
TCS 19.4733

Citation

Charte, Francisco; Rivera, Antonio J.; del Jesus, Maria J.; Herrera, Francisco (2015). QUINTA: A question tagging assistant to improve the answering ratio in electronic forums. In EUROCON 2015 - International Conference on Computer as a Tool (EUROCON), IEEE, 1-6.
@inproceedings{,
  title="QUINTA: A question tagging assistant to improve the answering ratio in electronic forums",
  author="Charte, Francisco and Rivera, Antonio J. and del Jesus, Maria J. and Herrera, Francisco",
  booktitle="EUROCON 2015 - International Conference on Computer as a Tool (EUROCON), IEEE",
  year="2015",
  pages="1-6",
  month="Sept"
}

Concurrence plot

In this concurrence plot, sectors represent labels and links between them depict label co-occurrences. SCUMBLE is a measure designed to assess the concurrence among imbalanced labels.