This repository contains the benchmark datasets for evaluating convergent and divergent thinking capabilities in large language models, as described in our paper accepted at the KDD 2025 Workshop on Generative AI for Evaluation.
Reasoning Beyond the Obvious: Evaluating Divergent and Convergent Thinking in LLMs for Financial Scenarios
Zhuang Qiang Bok, Watson Wei Khong Chua
convergent_thinking_dataset/- Dataset and evaluation framework for convergent thinking tasksdivergent_thinking_dataset/- Dataset and evaluation framework for divergent thinking tasks
Each dataset folder contains its own README with detailed information about the specific benchmark, evaluation metrics, and usage instructions.
If you use this dataset in your research, please cite our paper:
@misc{bokandchua2025reasoningobvious,
title={Reasoning Beyond the Obvious: Evaluating Divergent and Convergent Thinking in LLMs for Financial Scenarios},
author={Zhuang Qiang Bok and Watson Wei Khong Chua},
year={2025},
eprint={2507.18368},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2507.18368},
}This work is licensed under Creative Commons.