Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation
| dc.contributor.author | Paiola, Paiola | |
| dc.contributor.author | Garcia, Gabriel Lino | |
| dc.contributor.author | Manesco, João Renato Ribeiro | |
| dc.contributor.author | Roder, Mateus | |
| dc.contributor.author | Rodrigues, Douglas | |
| dc.contributor.author | Papa, João Paulo | |
| dc.contributor.editor | Skala, Václav | |
| dc.date.accessioned | 2025-07-30T10:46:47Z | |
| dc.date.available | 2025-07-30T10:46:47Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract-translated | This study evaluates the performance of large language models (LLMs) as medical agents in Portuguese, aiming to develop a reliable and relevant virtual assistant for healthcare professionals. The HealthCareMagic-100k-en and MedQuAD datasets, translated from English using GPT-3.5, were used to fine-tune the ChatBode-7B model using the PEFT-QLoRA method. The InternLM2 model, with initial training on medical data, presented the best overall performance, with high precision and adequacy in metrics such as accuracy, completeness, and safety. However, DrBode models, derived from ChatBode, exhibited a phenomenon of catastrophic forgetting of acquired medical knowledge. Despite this, these models performed frequently or even better in grammaticality and coherence. A significant challenge was low inter-rater agreement, highlighting the need for more robust assessment protocols. This work paves the way for future research, such as evaluating multilingual models specific to the medical field, improving the quality of training data, and developing more consistent evaluation methodologies for the medical field. | en |
| dc.format | 4 s. | cs |
| dc.format.mimetype | application/pdf | |
| dc.identifier.doi | http://www.doi.org/10.24132/CSRN.2025-37 | |
| dc.identifier.issn | 2464-4617 (Print) | |
| dc.identifier.issn | 2464-4625 (online) | |
| dc.identifier.uri | http://hdl.handle.net/11025/62246 | |
| dc.language.iso | en | en |
| dc.publisher | Vaclav Skala - UNION Agency | en |
| dc.rights | © Vaclav Skala - UNION Agency | en |
| dc.rights.access | openAccess | en |
| dc.subject | rozsáhlé jazykové modely | cs |
| dc.subject | jemné doladění | cs |
| dc.subject | virtuální lékařský asistent | cs |
| dc.subject | brazilská portugalština | cs |
| dc.subject.translated | large language models | en |
| dc.subject.translated | fine-tuning | en |
| dc.subject.translated | virtual medical assistant | en |
| dc.subject.translated | Brazilian Portuguese | en |
| dc.title | Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation | en |
| dc.type | konferenční příspěvek | cs |
| dc.type | conferenceObject | en |
| dc.type.status | Peer reviewed | en |
| dc.type.version | publishedVersion | en |
| local.files.count | 1 | * |
| local.files.size | 747911 | * |
| local.has.files | yes | * |