Integrating deep learning image analysis into Web GIS applications: A Hybrid Flask - Spring Boot architecture for automated place detection
Article Sidebar
Issue Vol. 38 (2026)
-
Comparative analysis of Next.js and Astro frameworks
Patryk Gieda, Marek Miłosz1-5
-
Behavioral analysis of ransomware threats to ESXi Hypervisors: a machine learning-based predictive model
Upakar Bhatta6-10
-
Performance and usability evaluation of a VR virtual museum application
Kamil Gabrysiewicz, Krzysztof Dziedzic11-18
-
Comparative analysis of selected data visualization methods
Damian Węcławski, Radosław Tomczyk, Paweł Powroźnik19-25
-
A performance comparison of web programming interfaces: GraphQL, gRPC and Thrift
Piotr Rożek, Mariusz Dzieńkowski26-31
-
A review of security mechanisms in electronic payment systems
Omniah ALibrahim, Suhair Alshehri32-42
-
Clustering methods in machine learning
Bartłomiej Głuszczak, Paweł Powroźnik43-50
-
Comparative analysis of interpretable artificial intelligence methods
Aleksandra Kuszewska, Małgorzata Charytanowicz51-58
-
Comparative analysis of machine learning classifiers
Łukasz Krukowski, Grzegorz Kozieł59-65
-
Analysis of the impact of machine learning algorithms on the quality of generated sounds
Krzysztof Pedrycz, Mateusz Pikula66-72
-
Comparative analysis of the functionalities of applications supporting the self-control process of anticoagulation therapy
Marcin Furmaga, Vitalii Baida73-80
-
Application of machine learning for predicting Formula 1 race results
Sylwia Krzysztoń, Jakub Smołka81-86
-
Analysis of latency reduction and performance improvement methods in selected VR applications
Mateusz Czapczyński, Krzysztof Dziedzic87-94
-
Integrating deep learning image analysis into Web GIS applications: A Hybrid Flask - Spring Boot architecture for automated place detection
Medjon HYSENAJ95-101
Main Article Content
DOI
Sustainable Development Goals (SDG)
- Industry, Innovation, Technology and Infrastructure
Authors
Abstract
This paper presents a hybrid web GIS architecture that integrates deep learning–based image recognition with a robust spatial data management interface. Developed using a combination of Spring Boot (Java), Leaflet (JavaScript), and PostgreSQL/PostGIS, the application provides users with a rich interface for route planning, layer visualization, and CRUD operations on categorized places and groups. At the core of this study is the novel integration of a lightweight Python Flask microservice, which leverages OpenAI’s CLIP model and EXIF metadata extraction to automate the classification and geolocation of uploaded images. This hybrid system allows users to add new places either manually by entering coordinates, selecting categories, and uploading an image or automatically, using Smart Detection mode. In this second mode, the image becomes the primary input source, from which the application semantically infers the appropriate group (e.g., museum, park, church) and extracts GPS coordinates from the image’s EXIF data. This dual-mode input architecture enhances both user flexibility and data accuracy while demonstrating a practical fusion of deep learning and GIS through modern web frameworks.
Keywords:
References
[1] F. Su, et al., Advancing intelligent geography: Current status, innovations, and future prospects, Geogr. Sustain. 5 (2025) 100375, https://doi.org/10.1016/j.geosus.2025.100375. DOI: https://doi.org/10.1016/j.geosus.2025.100375
[2] J. Estima, M. Painho, Investigating the potential of OpenStreetMap for land use/land cover production: A case study for continental Portugal, in OpenStreetMap in GIScience, Springer, 2015, 273-293. DOI: https://doi.org/10.1007/978-3-319-14280-7_14
[3] M. Haklay, C. Tobón, Usability evaluation and PPGIS: Towards a user-centred design approach, Int. J. Geogr. Inf. Sci. 17(6) (2003) 577-592, https://doi.org/10.1080/1365881031000114107. DOI: https://doi.org/10.1080/1365881031000114107
[4] R. Sieber, Public participation geographic information systems: A literature review and framework, Ann. Assoc. Am. Geogr. 96(3) (2006) 491-507, https://doi.org/10.1111/j.1467-8306.2006.00702.x. DOI: https://doi.org/10.1111/j.1467-8306.2006.00702.x
[5] V. Antoniou, A. Skopeliti, Measures and indicators of VGI quality: An overview, ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci. II-3/W5 (2015) 345-351, https://doi.org/10.5194/isprsannals-II-3-W5-345-2015. DOI: https://doi.org/10.5194/isprsannals-II-3-W5-345-2015
[6] X. X. Zhu, et al., Deep learning in remote sensing: A comprehensive review and list of resources, IEEE Geosci. Remote Sens. Mag. 5(4) (2017) 8-36, https://doi.org/10.1109/MGRS.2017.2762307. DOI: https://doi.org/10.1109/MGRS.2017.2762307
[7] A. Radford, et al., Learning transferable visual models from natural language supervision, Proc. 38th Int. Conf. Mach. Learn. PMLR 139 (2021) 8748-8763, https://proceedings.mlr.press/v139/radford21a.html.
[8] G. Goh, N. Cammarata, C. Voss, S. Carter, M. Petrov, L. Schubert, A. Radford, C. Olah, Multimodal neurons in artificial neural networks, Distill 6(3) (2021) e30, https://doi.org/10.23915/distill.00030. DOI: https://doi.org/10.23915/distill.00030
[9] U. Stilla, Y. Xu, Change detection of urban objects using 3D point clouds: A review, ISPRS J. Photogramm. Remote Sens. 197 (2023) 228-255, https://doi.org/10.1016/j.isprsjprs.2023.01.010. DOI: https://doi.org/10.1016/j.isprsjprs.2023.01.010
[10] P. Harvey, ExifTool by Phil Harvey [Software] (2022), https://exiftool.org/.
[11] M. F. Goodchild, Citizens as sensors: The world of volunteered geography, GeoJournal 69(4) (2007) 211-221, https://doi.org/10.1007/s10708-007-9111-y. DOI: https://doi.org/10.1007/s10708-007-9111-y
[12] R. Kitchin, The real-time city? Big data and smart urbanism, GeoJournal 79(1) (2013) 1-14, https://doi.org/10.1007/s10708-013-9516-8. DOI: https://doi.org/10.1007/s10708-013-9516-8
[13] Z. Fan, T. Pei, T. Ma, Y. Du, C. Song, Z. Liu, C. Zhou, Estimation of urban crowd flux based on mobile phone location data: A case study of Beijing, China, Comput. Environ. Urban Syst. 69 (2018) 114-123, https://doi.org/10.1016/j.compenvurbsys.2018.01.005. DOI: https://doi.org/10.1016/j.compenvurbsys.2018.01.005
[14] C. Walls, Spring Boot in Action, Manning Publications, 2015, ISBN 9781617292545.
[15] V. Agafonkin, Leaflet.js: An open-source JavaScript library for interactive maps (2010), https://leafletjs.com/.
[16] R. O. Obe, L. S. Hsu, PostGIS in Action, 2nd ed., Manning Publications, 2015, ISBN 9781617291395.
[17] S. Li, et al., Geospatial big data handling theory and methods: A review and research challenges, ISPRS J. Photogramm. Remote Sens. 115 (2016) 119-133, https://doi.org/10.1016/j.isprsjprs.2015.10.012. DOI: https://doi.org/10.1016/j.isprsjprs.2015.10.012
[18] A. Dosovitskiy, et al., An image is worth 16×16 words: Transformers for image recognition at scale, arXiv preprint arXiv:2010.11929 (2020), https://arxiv.org/abs/2010.11929.
[19] A. Çöltekin, et al., Geospatial information visualization and extended reality displays, In: H. Guo, M. F. Goodchild, A. Annoni (eds.) Manual of Digital Earth, Springer (2020) 229–277, https://doi.org/10.1007/978-981-32-9915-3_7. DOI: https://doi.org/10.1007/978-981-32-9915-3_7
[20] A. Vali, S. Comai, M. Matteucci, Deep learning for land use and land cover classification based on hyperspectral and multispectral Earth observation data: A review, Remote Sens. 12(15) (2020) 2495, https://doi.org/10.3390/rs12152495. DOI: https://doi.org/10.3390/rs12152495
[21] T. Rattenbury, N. Good, M. Naaman, Towards automatic extraction of event and place semantics from Flickr tags, Proc. 30th Annu. Int. ACM SIGIR Conf. Res. Dev. Inf. Retr. ACM (2007) 103-110, https://doi.org/10.1145/1277741.1277762. DOI: https://doi.org/10.1145/1277741.1277762
[22] Z. Wang, J. Liu, Y. Wang, A survey of vision-language pretrained models, arXiv preprint arXiv:2202.10936 (2022), https://arxiv.org/abs/2202.10936.
Article Details
Abstract views: 3

