Search Engine for Open Geospatial Consortium Web Services Improving Discoverability through Natural Language Processing-Based Processing and Ranking
-
Published:2024-04-12
Issue:4
Volume:13
Page:128
-
ISSN:2220-9964
-
Container-title:ISPRS International Journal of Geo-Information
-
language:en
-
Short-container-title:IJGI
Author:
Ferrari Elia1, Striewski Friedrich1, Tiefenbacher Fiona1, Bereuter Pia1ORCID, Oesch David2, Di Donato Pasquale2
Affiliation:
1. Institute of Geomatics, FHNW University of Applied Sciences and Arts Northwestern Switzerland, 4132 Muttenz, Switzerland 2. Federal Office of Topography Swisstopo, 3084 Wabern, Switzerland
Abstract
The improvement of search engines for geospatial data on the World Wide Web has been a subject of research, particularly concerning the challenges in discovering and utilizing geospatial web services. Despite the establishment of standards by the Open Geospatial Consortium (OGC), the implementation of these services varies significantly among providers, leading to issues in dataset discoverability and usability. This paper presents a proof of concept for a search engine tailored to geospatial services in Switzerland. It addresses challenges such as scraping data from various OGC web service providers, enhancing metadata quality through Natural Language Processing, and optimizing search functionality and ranking methods. Semantic augmentation techniques are applied to enhance metadata completeness and quality, which are stored in a high-performance NoSQL database for efficient data retrieval. The results show improvements in dataset discoverability and search relevance, with NLP-extracted information contributing significantly to ranking accuracy. Overall, the GeoHarvester proof of concept demonstrates the feasibility of improving the discoverability and usability of geospatial web services through advanced search engine techniques.
Funder
Federal Coordination Body for Geoinformation Swiss Conference of Directors of Construction, Planning and Environment
Reference45 articles.
1. Ma, J., Co, J.E., and Quintanilla, A. (2010, January 5–7). A Semantic Index Structure for Integrating OGC Services in a Spatial Search Engine. Proceedings of the 2010 IEEE Conference on Open Systems (ICOS 2010), Kuala Lumpur, Malaysia. 2. De la Beaujardiere, J. (2023, November 11). OpenGIS® Web Map Server Implementation Specification 2006. Available online: https://portal.ogc.org/files/?artifact_id=14416. 3. Maso, J., Pomakis, K., and Julià, N. (2023, November 11). OpenGIS® Web Map Tile Service Implementation Standard 2010. Available online: https://portal.ogc.org/files/?artifact_id=35326. 4. Vretanos, P.A. (2023, November 11). Web Feature Service Implementation Specification 2005. Available online: https://portal.ogc.org/files/?artifact_id=8339. 5. Yue, P., Di, L., Zhao, P., Yang, W., Yu, G., and Wei, Y. (August, January 31). Semantic Augmentations for Geospatial Catalogue Service. Proceedings of the 2006 IEEE International Symposium on Geoscience and Remote Sensing, Denver, CO, USA.
|
|