Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation

Paiola, Paiola

Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation

dc.contributor.author	Paiola, Paiola
dc.contributor.author	Garcia, Gabriel Lino
dc.contributor.author	Manesco, João Renato Ribeiro
dc.contributor.author	Roder, Mateus
dc.contributor.author	Rodrigues, Douglas
dc.contributor.author	Papa, João Paulo
dc.contributor.editor	Skala, Václav
dc.date.accessioned	2025-07-30T10:46:47Z
dc.date.available	2025-07-30T10:46:47Z
dc.date.issued	2025
dc.description.abstract-translated	This study evaluates the performance of large language models (LLMs) as medical agents in Portuguese, aiming to develop a reliable and relevant virtual assistant for healthcare professionals. The HealthCareMagic-100k-en and MedQuAD datasets, translated from English using GPT-3.5, were used to fine-tune the ChatBode-7B model using the PEFT-QLoRA method. The InternLM2 model, with initial training on medical data, presented the best overall performance, with high precision and adequacy in metrics such as accuracy, completeness, and safety. However, DrBode models, derived from ChatBode, exhibited a phenomenon of catastrophic forgetting of acquired medical knowledge. Despite this, these models performed frequently or even better in grammaticality and coherence. A significant challenge was low inter-rater agreement, highlighting the need for more robust assessment protocols. This work paves the way for future research, such as evaluating multilingual models specific to the medical field, improving the quality of training data, and developing more consistent evaluation methodologies for the medical field.	en
dc.format	4 s.	cs
dc.format.mimetype	application/pdf
dc.identifier.doi	http://www.doi.org/10.24132/CSRN.2025-37
dc.identifier.issn	2464-4617 (Print)
dc.identifier.issn	2464-4625 (online)
dc.identifier.uri	http://hdl.handle.net/11025/62246
dc.language.iso	en	en
dc.publisher	Vaclav Skala - UNION Agency	en
dc.rights	© Vaclav Skala - UNION Agency	en
dc.rights.access	openAccess	en
dc.subject	rozsáhlé jazykové modely	cs
dc.subject	jemné doladění	cs
dc.subject	virtuální lékařský asistent	cs
dc.subject	brazilská portugalština	cs
dc.subject.translated	large language models	en
dc.subject.translated	fine-tuning	en
dc.subject.translated	virtual medical assistant	en
dc.subject.translated	Brazilian Portuguese	en
dc.title	Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation	en
dc.type	konferenční příspěvek	cs
dc.type	conferenceObject	en
dc.type.status	Peer reviewed	en
dc.type.version	publishedVersion	en
local.files.count	1	*
local.files.size	747911	*
local.has.files	yes	*

Files

Original bundle

Showing 1 - 1 out of 1 results

Name:: A13.pdf
Size:: 730.38 KB
Format:: Adobe Portable Document Format

Download

License bundle

Showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

WSCG 2025: Full Papers Proceedings