Leveraging Large Language Models to Effectively Generate Visual Data for Canine Musculoskeletal Diagnoses

dc.contributor.authorThißen, Martin
dc.contributor.authorTran, Thi Ngoc Diep
dc.contributor.authorEsteve Ratsch, Barbara
dc.contributor.authorSchönbein, Ben Joel
dc.contributor.authorTrapp, Ute
dc.contributor.authorEgner, Beate
dc.contributor.authorPiat, Romana
dc.contributor.authorHergenröther
dc.contributor.editorSkala, Václav
dc.date.accessioned2025-07-30T08:47:31Z
dc.date.available2025-07-30T08:47:31Z
dc.date.issued2025
dc.description.abstract-translatedIt is well-established that more data generally improves AI model performance. However, data collection can be challenging for certain tasks due to the rarity of occurrences or high costs. These challenges are evident in our use case, where we apply AI models to a novel approach for visually documenting the musculoskeletal condition of dogs. Here, abnormalities are marked as colored strokes on a body map of the dog. Since these strokes correspond to distinct muscles or joints, they can be mapped to the textual domain in which large language models (LLMs) operate. LLMs have recently demonstrated impressive capabilities across a wide range of tasks, including medical applications, offering promising potential for generating synthetic training data. In this work, we investigate whether LLMs can effectively generate synthetic visual training data for canine musculoskeletal diagnoses. For this, we developed a mapping that segments visual documentations into over 200 labeled regions representing muscles or joints. Using techniques like guided decoding, chain-of-thought reasoning, and few-shot prompting, we generated 1,000 synthetic visual documentations for patellar luxation (kneecap dislocation) diagnosis, the diagnosis for which we have the most real-world data. Our analysis shows that the generated documentations are sensitive to location and severity of the diagnosis while remaining independent of the dog’s sex. However, they were also independent of the patient’s weight and age, highlighting some limitations. We further generated 1,000 visual documentations for various other diagnoses to create a binary classification dataset. A model trained solely on this synthetic data achieved an F1 score of 88% on 70 real-world documentations. These results demonstrate the potential of LLM-generated synthetic data, which is particularly valuable for addressing data scarcity in rare diseases or rare instances in general. While our methodology is tailored to the medical domain, the insights and techniques can be adapted to other fields.en
dc.format12 s.cs
dc.format.mimetypeapplication/pdf
dc.identifier.doihttp://www.doi.org/10.24132/CSRN.2025-3
dc.identifier.issn2464-4617 (Print)
dc.identifier.issn2464-4625 (online)
dc.identifier.urihttp://hdl.handle.net/11025/62209
dc.language.isoenen
dc.publisherVaclav Skala - UNION Agencyen
dc.rights© Vaclav Skala - UNION Agencyen
dc.rights.accessopenAccessen
dc.subjectgenerativní umělá inteligencecs
dc.subjectmodely velkých jazyků (LLM)cs
dc.subjectsyntetická data pro veterinární medicínucs
dc.subject.translatedgenerative AIen
dc.subject.translatedLarge Language Models (LLMs)en
dc.subject.translatedsynthetic data for veterinary medicineen
dc.titleLeveraging Large Language Models to Effectively Generate Visual Data for Canine Musculoskeletal Diagnosesen
dc.typekonferenční příspěvekcs
dc.typeconferenceObjecten
dc.type.statusPeer revieweden
dc.type.versionpublishedVersionen
local.files.count1*
local.files.size8553624*
local.has.filesyes*

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
A53.pdf
Size:
8.16 MB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: