Ueda, N.; Saito, K. (2002). Parametric mixture models for multi-labeled text. In Advances in neural information processing systems, 721--728.
yahoo_science
mldr.datasets::get.mldr("yahoo_science")
Summary
Instances | 6428 |
---|---|
Attributes | 37227 |
Inputs | 37187 |
Labels | 40 |
Labelsets | 457 |
Single labelsets | 252 |
Max frequency | 1200 |
Cardinality | 1.4498 |
Density | 0.0362 |
Mean IR | 52.6318 |
SCUMBLE | 0.0575 |
TCS | 20.3373 |
Citation
Ueda, N.; Saito, K. (2002). Parametric mixture models for multi-labeled text. In Advances in neural information processing systems, 721--728.
@inproceedings{,
title="Parametric mixture models for multi-labeled text",
author="Ueda, N. and Saito, K.",
booktitle="Advances in neural information processing systems",
pages="721--728",
year="2002"
}
Concurrence plot
In this concurrence plot, sectors represent labels and links between them depict label co-occurrences. SCUMBLE is a measure designed to assess the concurrence among imbalanced labels.