CCLUPE: Benchmark for Credit Context Log Understanding and Prediction Evaluation

Zhizhuo Kou; Yanting Zhang; Lei Zhu; Zhenghao Zhu; Yakun Cui; Zhiqiang Qian; Haoran Li; Han Wu; Huozhi Zhou; Jian Xie; Sirui Han; Yike Guo

doi:10.20944/preprints202604.1432.v1

Submitted:

19 April 2026

Posted:

21 April 2026

You are already at the latest version

Abstract

While Large Language Models (LLMs) have shown great promise in transforming credit risk assess-ment, existing evaluation frameworks are almost exclusively concerned with general financial NLP tasks and neglect the specific reasoning needed by practitioners. To address this, we develop the Credit Context Log Understanding and Prediction Evaluation (CCLUPE) benchmark. Unlike the previous benchmarks, CCLUPE aims to capture and evaluate the intricate reasoning unique to each constituent of the Chinese credit market, where evaluations are heavily based on the integration and synthesis of complex transacted logs and the prediction of hidden financial behaviors. Unlike previous benchmarks, CCLUPE aims to capture and evaluate the intricate reasoning unique to each constituent of the Chinese credit market. Unlike previous benchmarks, CCLUPE aims to capture and evaluate the intricate reasoning unique to each constituent. CCLUPE boasts more than 4,000 premium samples segmented by individual and micro-enterprise customers and distributed among 7 principal log types and 16 sub log types. A comprehensive assessment process involving upwards of 20 professional annotators is enacted to guarantee the quality of the dataset. Moreover, we introduce Log-Score, a novel evaluation metric designed to incorporate log misunderstanding penalties and assess multifaceted competencies. Even the state-of-the-art models underperform when it comes to these high-stakes tasks. CCLUPE serves as a rigorous testbed for the next generation of financial LLMs, ensuring their robustness for deployment in complex real credit scenarios.

Keywords:

LLM

;

credit analysis

Subject:

Computer Science and Mathematics - Other

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

CCLUPE: Benchmark for Credit Context Log Understanding and Prediction Evaluation

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe