Abstract
Data reduction plays an important role in machine learning and pattern recognition with a high-dimensional data. In real-world applications data usually exists with hybrid formats, and a unified data reducing technique for hybrid data is desirable. In this paper, an information measure is proposed to computing discernibility power of a crisp equivalence relation or a fuzzy one, which is the key concept in classical rough set model and fuzzy-rough set model. Based on the information measure, a general definition of significance of nominal, numeric and fuzzy attributes is presented. We redefine the independence of hybrid attribute subset, reduct, and relative reduct. Then two greedy reduction algorithms for unsupervised and supervised data dimensionality reduction based on the proposed information measure are constructed. Experiments show the reducts found by the proposed algorithms get a better performance compared with classical rough set approaches.
| Original language | English |
|---|---|
| Pages (from-to) | 414-423 |
| Number of pages | 10 |
| Journal | Pattern Recognition Letters |
| Volume | 27 |
| Issue number | 5 |
| DOIs | |
| State | Published - 1 Apr 2006 |
Keywords
- Attribute reduction
- Fuzzy-rough set
- Hybrid data
- Information measure
Fingerprint
Dive into the research topics of 'Information-preserving hybrid data reduction based on fuzzy-rough techniques'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver