A hierarchical multi-class classification system for face and text datasets
Files
Date
2025-06-20
Embargo
Advisor
Coadvisor
Journal Title
Journal ISSN
Volume Title
Publisher
Frontiers Media S.A.
Language
English
Alternative Title
Abstract
In an era of rapidly growing multimedia data, the need for robust and efficient classification systems has become critical, specifically the identification of class names and poses or styles. This study provides an understanding of the organization of data, and feature selection (i.e., edge) using the k-means segmentation technique is explained. Furthermore, for the optimization of features, the linear regression technique is used. The optimized features can be directly used with classifiers, but to reduce the noise, outliers are identified and removed from the training data. The classifiers are involved in training and recognizing the face or text class label. After the prediction of class labels, the distance matrix-based technique is used to identify the style or pose name. Finally, the experiments are conducted with the help of the ORL dataset (40 classes and 10 poses in each class) and character dataset (36 characters and 10 font styles in each character). The experimental results indicated that the proposed methodology accurately classifies hierarchically organized data and demonstrates superiority over KNN and Bayesian-based classification when compared to support vector machine (SVM). The system provides classification outcomes with up to 100% accuracy for outlier-removed data, and up to 98% for basic features. Unlike traditional flat classification approaches, our system leverages hierarchical structures to enhance classification accuracy, scalability, and interpretability.
Keywords
Data mining, support vector machine, Bayes classifier, k-nearest neighbor, machine learning
Document Type
Journal article
Version
Publisher Version
Citation
Saini, A., Gill, N. S., Gulia, P., Singh, K., & Moreira, F. (2025). A hierarchical multi-class classification system for face and text datasets. Frontiers in Computer Science, 7, 1550453, 1-12. https://doi.org/10.3389/fcomp.2025.1550453. Repositório Institucional UPT. https://hdl.handle.net/11328/6401
Identifiers
TID
Designation
Access Type
Open Access