A Keyword-Guided Training Approach to Large Language Models for Judicial Document Generation

Peng, Yi-Ting; Lei, Chin-Laung

doi:10.32604/cmes.2025.073258

A Keyword-Guided Training Approach to Large Language Models for Judicial Document Generation

Journal

CMES - Computer Modeling in Engineering and Sciences

Journal Volume

145

Journal Issue

3

Start Page

3969

End Page

3992

ISSN

1526-1506

Date Issued

2025-12-23

Author(s)

Peng, Yi-Ting

Lei, Chin-Laung

DOI

10.32604/cmes.2025.073258

URI

https://www.scopus.com/record/display.uri?eid=2-s2.0-105025945066&origin=resultslist

https://scholars.lib.ntu.edu.tw/handle/123456789/735951

Abstract

The rapid advancement of Large Language Models (LLMs) has enabled their application in diverse professional domains, including law. However, research on automatic judicial document generation remains limited, particularly for Taiwanese courts. This study proposes a keyword-guided training framework that enhances LLMs’ ability to generate structured and semantically coherent judicial decisions in Chinese. The proposed method first employs LLMs to extract representative legal keywords from absolute court judgments. Then it integrates these keywords into Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback using Proximal Policy Optimization (RLHF-PPO). Experimental evaluations using models such as Chinese Alpaca 7B and TAIDE-LX-7B demonstrate that keyword-guided training significantly improves generation quality, achieving ROUGE-1, ROUGE-2, and ROUGE-L score gains of up to 17%, 16%, and 20%, respectively. The results confirm that the proposed framework effectively aligns generated judgments with human-written legal logic and structural conventions. This research advances domain-adaptive LLM fine-tuning strategies and establishes a technical foundation for AI-assisted judicial document generation in the Taiwanese legal context. This research provides empirical evidence that domain-adaptive LLM fine-tuning strategies can significantly improve performance in complex, structured legal text generation.

Subjects

generative AI

large language models

Legal AI

legal document generation

natural language processing

Publisher

Tech Science Press

Type

journal article

A Keyword-Guided Training Approach to Large Language Models for Judicial Document Generation

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)