Ueda, N.; Saito, K. (2002). Parametric mixture models for multi-labeled text. In Advances in neural information processing systems, 721--728.
yahoo_reference
mldr.datasets::get.mldr("yahoo_reference")
Summary
Instances | 8027 |
---|---|
Attributes | 39712 |
Inputs | 39679 |
Labels | 33 |
Labelsets | 275 |
Single labelsets | 144 |
Max frequency | 3038 |
Cardinality | 1.1744 |
Density | 0.0356 |
Mean IR | 461.8628 |
SCUMBLE | 0.0486 |
TCS | 19.7019 |
Citation
Ueda, N.; Saito, K. (2002). Parametric mixture models for multi-labeled text. In Advances in neural information processing systems, 721--728.
@inproceedings{,
title="Parametric mixture models for multi-labeled text",
author="Ueda, N. and Saito, K.",
booktitle="Advances in neural information processing systems",
pages="721--728",
year="2002"
}
Concurrence plot
In this concurrence plot, sectors represent labels and links between them depict label co-occurrences. SCUMBLE is a measure designed to assess the concurrence among imbalanced labels.