Simulation, modelling and classification of wiki contributors: Spotting the good, the bad, and the ugly

García-Méndez, Silvia; Malheiro, Benedita; Burguillo-Rial, Juan Carlos; Veloso, Bruno; Chis, Adriana E.; González-Vélez, Horacio; Leal, Fátima

Simulation, modelling and classification of wiki contributors: Spotting the good, the bad, and the ugly

dc.contributor.author	García-Méndez, Silvia
dc.contributor.author	Malheiro, Benedita
dc.contributor.author	Burguillo-Rial, Juan Carlos
dc.contributor.author	Veloso, Bruno
dc.contributor.author	Chis, Adriana E.
dc.contributor.author	González-Vélez, Horacio
dc.contributor.author	Leal, Fátima
dc.date.accessioned	2022-06-27T10:56:39Z
dc.date.available	2022-06-27T10:56:39Z
dc.date.issued	2022-06
dc.description.abstract	Data crowdsourcing is a data acquisition process where groups of voluntary contributors feed platforms with highly relevant data ranging from news, comments, and media to knowledge and classifications. It typically processes user-generated data streams to provide and refine popular services such as wikis, collaborative maps, e-commerce sites, and social networks. Nevertheless, this modus operandi raises severe concerns regarding ill-intentioned data manipulation in adversarial environments. This paper presents a simulation, modelling, and classification approach to automatically identify human and non-human (bots) as well as benign and malign contributors by using data fabrication to balance classes within experimental data sets, data stream modelling to build and update contributor profiles and, finally, autonomic data stream classification. By employing WikiVoyage – a free worldwide wiki travel guide open to contribution from the general public – as a testbed, our approach proves to significantly boost the confidence and quality of the classifier by using a class-balanced data stream, comprising both real and synthetic data. Our empirical results show that the proposed method distinguishes between benign and malign bots as well as human contributors with a classification accuracy of up to 92 %.	pt_PT
dc.identifier.citation	García-Méndez, S., Leal, F., Malheiro, B., Burguillo-Rial, J. C., Veloso, B., Chis, A. E., & González-Vélez, H. (2022). Simulation, modelling and classification of wiki contributors: Spotting the good, the bad, and the ugly. Simulation Modelling Practice and Theory, 120, 102616, 1-13. https://doi.org/10.1016/j.simpat.2022.102616. Repositório Institucional UPT. http://hdl.handle.net/11328/4289	pt_PT
dc.identifier.doi	https://doi.org/10.1016/j.simpat.2022.102616	pt_PT
dc.identifier.issn	1569-190X (Print)
dc.identifier.uri	http://hdl.handle.net/11328/4289
dc.language.iso	eng	pt_PT
dc.peerreviewed	yes	pt_PT
dc.publisher	Elsevier	pt_PT
dc.rights	open access	pt_PT
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	pt_PT
dc.subject	Classification	pt_PT
dc.subject	Data reliability	pt_PT
dc.subject	Stream processing	pt_PT
dc.subject	Synthetic data	pt_PT
dc.subject	Data fabrication	pt_PT
dc.subject	Wiki contributors	pt_PT
dc.title	Simulation, modelling and classification of wiki contributors: Spotting the good, the bad, and the ugly	pt_PT
dc.type	journal article	pt_PT
degois.publication.firstPage	1	pt_PT
degois.publication.lastPage	13	pt_PT
degois.publication.title	Simulation Modelling Practice and Theory	pt_PT
degois.publication.volume	120	pt_PT
dspace.entity.type	Publication	en
person.affiliation.name	REMIT – Research on Economics, Management and Information Technologies
person.familyName	Leal
person.givenName	Fátima
person.identifier.ciencia-id	2211-3EC7-B4B6
person.identifier.orcid	0000-0003-4418-2590
person.identifier.rid	Y-3460-2019
person.identifier.scopus-author-id	57190765181
relation.isAuthorOfPublication	8066078f-1e30-4b0a-aa84-3b6a2af4185c
relation.isAuthorOfPublication.latestForDiscovery	8066078f-1e30-4b0a-aa84-3b6a2af4185c

Files

Original bundle

Now showing 1 - 2 of 2

Name:: SIMPAT 2022.pdf
Size:: 1023.84 KB
Format:: Adobe Portable Document Format

Download

Name:: Imagem1.png
Size:: 219.08 KB
Format:: Portable Network Graphics

Download

Collections

REMIT - Artigos em Revistas Internacionais / Papers in International Journals