Journal of Multi Disciplinary Engineering Technologies
Volume 19 • Issue 01 • Published: Dec 2025 • ISSN (Print): 0974-1771 • ISSN (Online): 2581-9372

Realistic Log Generation through Large Language Models: Bridging Data Gaps in Log Analysis

Athira Gopal1, Supriya Bajpai2*

1 Former student, Computational and Data Science Department, Indian Institute of Science (IISC), Bengaluru, Karnataka, India.
2 Former PhD student, IIT Bombay & Monash University, IIT Bombay, Mumbai, Maharashtra, India.
*Corresponding author(s): supriyamishra39@gmail.com
Contributing authors: athira.anudarshan@gmail.com

Abstract

System logs are very useful for monitoring, debugging and diagnosing failures of large systems but they tend to be sensitive and thus not easy to share and make use of in a research. We propose a solution to this problem: to produce synthetic logs information of sufficiently good quality such that the structure and the statistical distribution of the real logs and privacy information are preserved. The strategy has three steps. The former part is masking the sensitive material on the logs by using a multi-pass algorithm of anonymization that generates a dictionary of sensitive words and substitutes them with generic names. Second, using the anonymized logs synthetic logs are generated using large language models (LLMs) and to ensure continuity between chunks summaries of previously generated synthetic log chunks are used and reflective verification loops also. Lastly, the produced logs are fed through a multi-agent debating model which is automated and functions using an LLM pipeline to achieve the structural and contextual correctness. With the help of this model, organizations can generate viable, interchangeable logs without revealing information that is confidential that can be used to aid in the completion of tasks such as log enquiry, failure enquiry, and root-cause research.

Keywords

Generative AI
AI agent
Synthetic log generation
Large Language Models

Article information

Journal: Journal of Multi Disciplinary Engineering Technologies
Volume / Issue: 19 / 01
Published: Dec 2025
ISSN (Print): 0974-1771
ISSN (Online): 2581-9372