CLA-QA: An Expert-Annotated Benchmark of Patient Questions and LLM Responses for Complex Lymphatic Anomalies

Name: CLA-QA: An Expert-Annotated Benchmark of Patient Questions and LLM Responses for Complex Lymphatic Anomalies
Keywords: Large Language Models, Automated Evaluation, Natural Language Processing, Rare Diseases, Vascular Anomaly

Zhao, Min

doi:10.7936/6rxs-108301

CLA-QA: An Expert-Annotated Benchmark of Patient Questions and LLM Responses for Complex Lymphatic Anomalies

Zhao, Min

2025

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Cite

Description

The CLA-QA dataset contains 25 common patient questions about Complex Lymphatic Anomalies (CLAs), 175 responses generated by seven large language models (LLMs), and physician-assigned accuracy score from three board-certified clinical experts using a 5-point Likert scale. The dataset was developed to support research on automated evaluation methods for LLM-generated free-text responses in rare diseases. It provides a benchmark resource for comparing traditional NLP similarity metrics and LLM-based evaluation against expert physician judgment.

Details

Title

CLA-QA: An Expert-Annotated Benchmark of Patient Questions and LLM Responses for Complex Lymphatic Anomalies

Creator

Zhao, Min (Washington University in St. Louis)

Corresponding Author

Sisk, Bryan, siskb@wustl.edu

Contributor

Zhao, Min DataCurator (Washington University in St. Louis)
Oh, Inez Researcher (Washington University in St. Louis)
Gupta, Aditi Researcher (Washington University in St. Louis)
Cohen-Cutler, Sally Producer (Children's Hospital of Philadelphia)
Harmoney, Kathryn Producer (University of New Mexico)
Lai, Albert Researcher (Washington University in St. Louis)
Sisk, Bryan Producer (Washington University in St. Louis)

Subject

Computer and information sciences
Health sciences

Keywords

Large Language Models, Automated Evaluation, Natural Language Processing, Rare Diseases, Vascular Anomaly

Resource Type

Dataset

Data Type

Tabular

Published Date

2025-10-15

Publisher

Washington University in St. Louis

DOI

https://doi.org/10.7936/6rxs-108301

License

Creative Commons Attribution Non-Commercial (CC BY-NC ) 4.0 International, (https://creativecommons.org/licenses/by-nc/4.0/)

Coverage Dates

Collected: 2025-03-05/2025-06-24

Funding Source

Orphan Disease Center, University of Pennsylvania, Million Dollar Bike Ride program
Alvin J. Siteman Cancer Center, Pedal the Cause
St. Louis Children's Hospital, Siteman Kids

Collection

WashU Researcher Data
All RDM Records

Language

English

Record ID

108301

Tabular

Files

Statistics

Download Full History

CLA-QA: An Expert-Annotated Benchmark of Patient Questions and LLM Responses for Complex Lymphatic Anomalies

Description

Details

Related Items

Tabular

Files

Statistics