GeoTool-GPT: a trainable method for facilitating Large Language Models to master GIS tools
Cheng Wei Yifan Zhang Xinru Zhao Ziyi Zeng Zhiyun Wang Jianfeng Lin Qingfeng Guan Wenhao Yu a School of Geography and Information Engineering,China University of Geosciences,Wuhan,Chinab Meituan,Beijing,Chinac National Engineering Research Center for Geographic Information System,China University of Geosciences,Wuhan,ChinaCheng Wei is a master student in the School of Geography and Information Engineering,China University of Geosciences,Wuhan,China (CUG). His research interests include GeoAI and natural language processing techniques. His contribution includes conceptualization,methodology,data curation,software,visualization,writing – original draft,writing – review and editing,validation and supervision.Yifan Zhang is a Ph.D. student in the School of Geography and Information Engineering China University of Geosciences,Wuhan,China (CUG). His research interests include GeoAI and map generalization. His contribution includes methodology,data curation,software,writing – original draft,writing – review and editing and supervision.Xinru Zhao is a master student in the School of Geography and Information Engineering,China University of Geosciences,Wuhan,China (CUG). Her research interests include GeoAI and deep learning. Her contribution includes data curation and validation.Ziyi Zeng is a Ph.D. student in the School of Geography and Information Engineering China University of Geosciences,Wuhan,China (CUG). Her research interests include GeoAI and LLMs. Her contribution includes data curation and validation.Zhiyun Wang is an undergraduate student in the School of Geography and Information Engineering,China University of Geosciences,Wuhan,China (CUG). Her research interests include GeoAI and deep learning. Her contribution includes data curation and validation.Jianfeng Lin is the leader of the cycling data mining team at Meituan,Beijing,China. His research interests include artificial intelligence and multimodal learning. His contribution includes supervision and project administration.Qingfeng Guan is a professor at China University of Geosciences,Wuhan,China (CUG). His research interests include high-performance spatial computing,spatial computational intelligence,spatiotemporal big data. His contribution includes supervision and project administration.Wenhao Yu received the B.S. and Ph.D. degrees in Geoinformatics from the Wuhan University,Wuhan,China,in 2010 and 2015,respectively. He is a professor at China University of Geosciences,Wuhan,China (CUG). His research interests include spatial data mining,map generalization,and LLMs. His contribution includes conceptualization,writing – original draft,writing – review and editing,supervision,validation,project administration and funding acquisition.
DOI: https://doi.org/10.1080/13658816.2024.2438937
2024-12-13
International Journal of Geographical Information Science
Abstract:Large Language Models (LLMs) excel in natural language-relevant tasks like text generation and question answering Q&A. To further expand their application, efforts focus on enabling LLMs to utilize real-world tools. However, their tool-use ability in professional GIS remains under explored due to two main challenges. Firstly, LLMs are usually trained on general-domain corpora, lacking sufficient and comprehensive GIS-specific data to align with professional knowledge, including understanding the functions of GIS tools. Secondly, researchers often need to combine multiple GIS tools to solve geospatial tasks. To address these challenges, we propose a trainable method to enable LLMs to master GIS tools. We curated a comprehensive set of resources: instruction-response data (GeoTool, 1950 instructions) to enhance the understanding of LLMs for GIS tools, instruction-solution data (GeoSolution, 3645 instructions) to improve their ability to generate tool-use solutions for geospatial tasks, and annotated instruction-solution evaluation data (GeoTask, 300 instructions) for evaluating LLMs' GIS tool-use proficiency. Using the collected training data (GeoTool and GeoSolution), we fine-tuned a professional-domain LLM called GeoTool-GPT based on an open-source general-domain LLM, the LLaMA-2-7b model. The experiment based on evaluation data validates our method's effectiveness in enhancing the tool-use ability of general-domain LLMs in the professional GIS domain, with the performance of our model closely approaching that of GPT-4.
geography, physical,computer science, information systems,information science & library science