Interpretable classification of Wiki-review streams

dc.contributor.authorGarcía-Méndez, Silvia
dc.contributor.authorMalheiro, Benedita
dc.contributor.authorBurguillo-Rial, Juan Carlos
dc.contributor.authorLeal, Fátima
dc.date.accessioned2023-12-18T17:44:45Z
dc.date.available2023-12-18T17:44:45Z
dc.date.issued2023-12-13
dc.description.abstractWiki articles are created and maintained by a crowd of editors, producing a continuous stream of reviews. Reviews can take the form of additions, reverts, or both. This crowdsourcing model is exposed to manipulation since neither reviews nor editors are automatically screened and purged. To protect articles against vandalism or damage, the stream of reviews can be mined to classify reviews and profile editors in real-time. The goal of this work is to anticipate and explain which reviews to revert. This way, editors are informed why their edits will be reverted. The proposed method employs stream-based processing, updating the profiling and classification models on each incoming event. The profiling uses side and content-based features employing Natural Language Processing, and editor profiles are incrementally updated based on their reviews. Since the proposed method relies on self-explainable classification algorithms, it is possible to understand why a review has been classified as a revert or a non-revert. In addition, this work contributes an algorithm for generating synthetic data for class balancing, making the final classification fairer. The proposed online method was tested with a real data set from Wikivoyage, which was balanced through the aforementioned synthetic data generation. The results attained near-90% values for all evaluation metrics (accuracy, precision, recall, and F -measure).
dc.identifier.citationGarcía-Méndez, S., Leal, F., Malheira, B., & Burguillo-Rial, J. C. (2023). Interpretable classification of Wiki-review streams. IEEE Access, (Published online: 13 december 2023), 1-15. 10.1109/ACCESS.2023.3342472. Repositório Institucional UPT. https://hdl.handle.net/11328/5292
dc.identifier.doi10.1109/ACCESS.2023.3342472
dc.identifier.issn2169-3536
dc.identifier.urihttps://hdl.handle.net/11328/5292
dc.language.isoeng
dc.publisherIEEE
dc.relation.hasversionhttps://ieeexplore.ieee.org/document/10356073
dc.rightsrestricted access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectData reliability and fairness
dc.subjectData-stream processing and classification
dc.subjectSynthetic data
dc.subjectTransparency
dc.subjectVandalism
dc.subjectWikis
dc.titleInterpretable classification of Wiki-review streams
dc.typejournal article
dspace.entity.typePublication
oaire.citation.endPage15
oaire.citation.issuePublished online: 13 december 2023
oaire.citation.startPage1
oaire.citation.titleIEEE Access
person.affiliation.nameREMIT – Research on Economics, Management and Information Technologies
person.familyNameLeal
person.givenNameFátima
person.identifier.ciencia-id2211-3EC7-B4B6
person.identifier.orcid0000-0003-4418-2590
person.identifier.ridY-3460-2019
person.identifier.scopus-author-id57190765181
relation.isAuthorOfPublication8066078f-1e30-4b0a-aa84-3b6a2af4185c
relation.isAuthorOfPublication.latestForDiscovery8066078f-1e30-4b0a-aa84-3b6a2af4185c

Ficheiros

Principais
A mostrar 1 - 1 de 1
Nome:
Interpretable_Classification_of_Wiki-Review_Streams.pdf
Tamanho:
2 MB
Formato:
Adobe Portable Document Format