AHCI RESEARCH GROUP
Publications
Papers published in international journals,
proceedings of conferences, workshops and books.
OUR RESEARCH
Scientific Publications
How to
Here you can find the complete list of our publications.
You can use the tag cloud to select only the papers dealing with specific research topics.
You can expand the Abstract, Links and BibTex record for each paper.
You can use the tag cloud to select only the papers dealing with specific research topics.
You can expand the Abstract, Links and BibTex record for each paper.
2025
Yang, G.; Sun, Z.; Wang, Y.
ShellBox: Adversarially Enhanced LLM-Interactive Honeypot Framework Journal Article
In: IEEE Access, vol. 13, pp. 143618–143630, 2025, ISSN: 21693536 (ISSN), (Publisher: Institute of Electrical and Electronics Engineers Inc.).
Abstract | Links | BibTeX | Tags: Complex networks, Dynamic error, Dynamic error simulation, Dynamics, Error simulation, Errors, honeypot, Honeypots, Interaction history, Interaction history pruning algorithm, Language Model, Large language model, Multi-turn, Network attack, Network attack and multi-turn, Network Security, Pruning algorithms, Systems analysis, Virtual Reality
@article{yang_shellbox_2025,
title = {ShellBox: Adversarially Enhanced LLM-Interactive Honeypot Framework},
author = {G. Yang and Z. Sun and Y. Wang},
url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-105013277136&doi=10.1109%2FACCESS.2025.3598779&partnerID=40&md5=f776206ec2788fc77e3766f209bc82f1},
doi = {10.1109/ACCESS.2025.3598779},
issn = {21693536 (ISSN)},
year = {2025},
date = {2025-01-01},
journal = {IEEE Access},
volume = {13},
pages = {143618–143630},
abstract = {Honeypot technology is an active defence strategy designed to mitigate the asymmetry inherent in network attacks and defence dynamics. In recent years, honeypot systems powered by large language models (LLMs) have become a focal point of research owing to their ability to simulate complex network environments and generate highly deceptive virtual assets. However, response inconsistency in multi-turn dialogues and prompt injection vulnerabilities inherent to LLMs significantly reduce the deceptive capability of honeypots. This study first defines the threat model, and then introduces a relevance-based interaction history pruning algorithm and dynamic error simulation strategy to mitigate these challenges. Considering practical issues such as response latency and network instability, our experiments were conducted using multiple locally deployed open-source LLMs. The experimental results demonstrated that the proposed dynamic error simulation mechanism achieved a maximum accuracy of 81.63% for the DeepSeek-R1 model. Furthermore, applying the interaction history pruning algorithm improved the turn-level coherence score (TCS) by 34.5% compared with the baseline. Finally, this paper outlines potential future research directions for LLM-based honeypot technologies in active multi-turn mechanisms. © 2025 Elsevier B.V., All rights reserved.},
note = {Publisher: Institute of Electrical and Electronics Engineers Inc.},
keywords = {Complex networks, Dynamic error, Dynamic error simulation, Dynamics, Error simulation, Errors, honeypot, Honeypots, Interaction history, Interaction history pruning algorithm, Language Model, Large language model, Multi-turn, Network attack, Network attack and multi-turn, Network Security, Pruning algorithms, Systems analysis, Virtual Reality},
pubstate = {published},
tppubtype = {article}
}
Honeypot technology is an active defence strategy designed to mitigate the asymmetry inherent in network attacks and defence dynamics. In recent years, honeypot systems powered by large language models (LLMs) have become a focal point of research owing to their ability to simulate complex network environments and generate highly deceptive virtual assets. However, response inconsistency in multi-turn dialogues and prompt injection vulnerabilities inherent to LLMs significantly reduce the deceptive capability of honeypots. This study first defines the threat model, and then introduces a relevance-based interaction history pruning algorithm and dynamic error simulation strategy to mitigate these challenges. Considering practical issues such as response latency and network instability, our experiments were conducted using multiple locally deployed open-source LLMs. The experimental results demonstrated that the proposed dynamic error simulation mechanism achieved a maximum accuracy of 81.63% for the DeepSeek-R1 model. Furthermore, applying the interaction history pruning algorithm improved the turn-level coherence score (TCS) by 34.5% compared with the baseline. Finally, this paper outlines potential future research directions for LLM-based honeypot technologies in active multi-turn mechanisms. © 2025 Elsevier B.V., All rights reserved.