Automatic POI Matching using an Outlier Detection Based Approach

dc.contributor.authorGomes, Rui Jorge Reis
dc.contributor.authorAlmeida, Alexandre
dc.contributor.authorAlves, Ana
dc.date.accessioned2025-07-11T09:50:12Z
dc.date.available2025-07-11T09:50:12Z
dc.date.issued2018-10-05
dc.description.abstractPoints of Interest (POI) are widely used in many applications nowadays mainly due to the increasing amount of related data available online, notably from volunteered geographic information (VGI) sources. Being able to connect these data from different sources is useful for many things like validating, cor- recting and also removing duplicated data in a database. However, there is no standard way to identify the same POIs across different sources and doing it manually could be very expensive. Therefore, automatic POI matching has been an attractive research topic. In our work, we propose a novel data-driven machine learning approach based on an outlier detection algorithm to match POIs automatically. Surprisingly, works that have been presented so far do not use data-driven machine learning approaches. The reason for this might be that such approaches need a training dataset to be constructed by manually matching some POIs. To mitigate this, we have taken advantage of the Crosswalk API, available at the time we started our project, which allowed us to retrieve already matched POI data from different sources in US territory. We trained and tested our model with a dataset containing Factual, Facebook and Foursquare POIs from New York City and were able to successfully apply it to another dataset of Facebook and Foursquare POIs from Porto, Portugal, finding matches with an accuracy around 95%. These are encouraging results that confirm our approach as an effective way to address the problem of automatically matching POIs. They also show that such a model can be trained with data available from multiple sources and be applied to other datasets with different locations from those used in training. Furthermore, as a data-driven machine learning approach, the model can be continuously improved by adding new validated data to its training dataset
dc.identifier.citationAlmeida, A., Alves, A., Gomes, R. (2018). Automatic POI Matching Using an Outlier Detection Based Approach. In W. Duivesteijn, A. Siebes, A. Ukkonen (Eds.), Advances in Intelligent Data Analysis XVII: 17th International Symposium, IDA 2018 Proceedings. Lecture Notes in Computer Science, ’s-Hertogenbosch, Netherlands, 24-26 October 2018, (vol. 11191, pp. 40-51). https://doi.org/10.1007/978-3-030-01768-2_4. Repositório Institucional UPT. https://hdl.handle.net/11328/6446
dc.identifier.isbn978-3-030-01767-5
dc.identifier.isbn978-3-030-01768-2
dc.identifier.urihttps://hdl.handle.net/11328/6446
dc.language.isoeng
dc.publisherSpringer
dc.relation.hasversionhttps://doi.org/10.1007/978-3-030-01768-2_4
dc.rightsrestricted access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectMachine Learning
dc.subjectOutlier Detection
dc.subjectPoint-Of-Interest
dc.subjectGIS
dc.subject.fosCiências Naturais - Ciências da Computação e da Informação
dc.subject.ods11 - sustainable cities and communities
dc.titleAutomatic POI Matching using an Outlier Detection Based Approach
dc.typeconference paper
dcterms.referenceshttps://link.springer.com/chapter/10.1007/978-3-030-01768-2_4
dspace.entity.typePublication
oaire.citation.conferenceDate2018-10-24
oaire.citation.conferencePlace’s-Hertogenbosch, Netherlands
oaire.citation.endPage51
oaire.citation.startPage40
oaire.citation.titleAdvances in Intelligent Data Analysis XVII: 17th International Symposium, IDA 2018 Proceedings
oaire.citation.volume11191
oaire.versionhttp://purl.org/coar/version/c_ab4af688f83e57aa
person.affiliation.nameREMIT – Research on Economics, Management and Information Technologies
person.familyNameGomes
person.givenNameRui Jorge Reis
person.identifier.ciencia-idBD1D-F316-C1AA
person.identifier.gsidhttps://scholar.google.pt/citations?user=SAHc0xsAAAAJ&hl=pt-PT
person.identifier.orcid0000-0001-7233-0736
person.identifier.ridN-7429-2018
person.identifier.scopus-author-id55938890400
relation.isAuthorOfPublication0f0e295b-09de-4caa-9534-42d59e6b94a2
relation.isAuthorOfPublication.latestForDiscovery0f0e295b-09de-4caa-9534-42d59e6b94a2

Files

Original bundle

Now showing 1 - 1 of 1
Name:
ida_2018.pdf
Size:
203.66 KB
Format:
Adobe Portable Document Format