Search and integration of business license data is a key process for many entities, whether to consult a reliable source of data on potential business partners or for studies related to urban development. Despite being available as open data, currently there is no established data standard and there is no free platform to consult them. In this direction, this work in progress uses open data from business licenses provided by the Municipality of Curitiba and data from Federal Revenue of Brazil to carry out the search for Legal Entity data. Through the new approach, using entity matching with Sørensen-Dice and Jaccard Similarity algorithms, the objective is to improve the textual search and present the result through a web interface.
Guillen, Bruno; de Santana, Gabriel V.; and Kozievitch, Nádia P., "Dados de Alvarás - Uma Abordagem de Integração para Busca Textual" (2023). ISLA 2023 Proceedings. 5.