Automatic POI Matching using an Outlier Detection Based Approach

Data

2018-10-05

Embargo

Orientador

Coorientador

Título da revista

ISSN da revista

Título do volume

Editora

Springer
Idioma
Inglês

Projetos de investigação

Unidades organizacionais

Fascículo

Título Alternativo

Resumo

Points of Interest (POI) are widely used in many applications nowadays mainly due to the increasing amount of related data available online, notably from volunteered geographic information (VGI) sources. Being able to connect these data from different sources is useful for many things like validating, cor- recting and also removing duplicated data in a database. However, there is no standard way to identify the same POIs across different sources and doing it manually could be very expensive. Therefore, automatic POI matching has been an attractive research topic. In our work, we propose a novel data-driven machine learning approach based on an outlier detection algorithm to match POIs automatically. Surprisingly, works that have been presented so far do not use data-driven machine learning approaches. The reason for this might be that such approaches need a training dataset to be constructed by manually matching some POIs. To mitigate this, we have taken advantage of the Crosswalk API, available at the time we started our project, which allowed us to retrieve already matched POI data from different sources in US territory. We trained and tested our model with a dataset containing Factual, Facebook and Foursquare POIs from New York City and were able to successfully apply it to another dataset of Facebook and Foursquare POIs from Porto, Portugal, finding matches with an accuracy around 95%. These are encouraging results that confirm our approach as an effective way to address the problem of automatically matching POIs. They also show that such a model can be trained with data available from multiple sources and be applied to other datasets with different locations from those used in training. Furthermore, as a data-driven machine learning approach, the model can be continuously improved by adding new validated data to its training dataset

Palavras-chave

Machine Learning, Outlier Detection, Point-Of-Interest, GIS

Tipo de Documento

Comunicação em conferência

Citação

Almeida, A., Alves, A., Gomes, R. (2018). Automatic POI Matching Using an Outlier Detection Based Approach. In W. Duivesteijn, A. Siebes, A. Ukkonen (Eds.), Advances in Intelligent Data Analysis XVII: 17th International Symposium, IDA 2018 Proceedings. Lecture Notes in Computer Science, ’s-Hertogenbosch, Netherlands, 24-26 October 2018, (vol. 11191, pp. 40-51). https://doi.org/10.1007/978-3-030-01768-2_4. Repositório Institucional UPT. https://hdl.handle.net/11328/6446

Identificadores


978-3-030-01767-5
978-3-030-01768-2

TID

Designação

Tipo de Acesso

Acesso Restrito

Apoio

Descrição