Abstract:Large language models (LLMs) have demonstrated remarkable capacities on various tasks, and integrating the capacities of LLMs into the Internet of Things (IoT) applications has drawn much research attention recently. Due to security concerns, many institutions avoid accessing state-of-the-art commercial LLM services, requiring the deployment and utilization of open-source LLMs in a local network setting. However, open-source LLMs usually have more limitations regarding their performance, such as their arithmetic calculation and reasoning capacities, and practical systems of applying LLMs to IoT have yet to be well-explored. Therefore, we propose a LLM-based Generative IoT (GIoT) system deployed in the local network setting in this study. To alleviate the limitations of LLMs and provide service with competitive performance, we apply prompt engineering methods to enhance the capacities of the open-source LLMs, design a Prompt Management Module and a Post-processing Module to manage the tailored prompts for different tasks and process the results generated by the LLMs. To demonstrate the effectiveness of the proposed system, we discuss a challenging Table Question Answering (Table-QA) task as a case study of the proposed system, as tabular data is usually more challenging than plain text because of their complex structures, heterogeneous data types and sometimes huge sizes. We conduct comprehensive experiments on two popular Table-QA datasets, and the results show that our proposal can achieve competitive performance compared with state-of-the-art LLMs, demonstrating that the proposed LLM-based GIoT system can provide competitive performance with tailored prompting methods and is easily extensible to new tasks without training.

TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks

Tasks People Prompt: A Taxonomy of LLM Downstream Tasks in Software Verification and Falsification Approaches

Comparative Analysis of Prompt Strategies for Large Language Models: Single-Task vs. Multitask Prompts

The language of prompting: What linguistic properties make a prompt successful?

State of What Art? A Call for Multi-Prompt LLM Evaluation

Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts

A Survey of Prompt Engineering Methods in Large Language Models for Different NLP Tasks

Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles

Which is better? Exploring Prompting Strategy For LLM-based Metrics

Benchmarking LLMs on the Semantic Overlap Summarization Task

A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization

Efficient Prompting for LLM-based Generative Internet of Things

Prompt Design Matters for Computational Social Science Tasks but in Unpredictable Ways

SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types

On The Role of Prompt Construction In Enhancing Efficacy and Efficiency of LLM-Based Tabular Data Generation

PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

Prompting LLMs to Compose Meta-Review Drafts from Peer-Review Narratives of Scholarly Manuscripts

StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code

Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide