On self-supervision in historical handwritten document segmentation
| dc.contributor.author | Baloun, Josef | |
| dc.contributor.author | Prantl, Martin | |
| dc.contributor.author | Lenc, Ladislav | |
| dc.contributor.author | Martínek, Jiří | |
| dc.contributor.author | Král, Pavel | |
| dc.date.accessioned | 2026-03-31T18:05:25Z | |
| dc.date.available | 2026-03-31T18:05:25Z | |
| dc.date.issued | 2025 | |
| dc.date.updated | 2026-03-31T18:05:25Z | |
| dc.description.abstract | Historical document analysis plays a crucial role in understanding and preserving our past. However, this task is oftenhindered by challenges such as limited annotated training data and the diverse nature of historical handwritten documents. Inthis paper,we explore the potential of self-supervised learning (SSL) in historical document analysis,with a particular focus onhistorical handwritten document segmentation, to overcome the need for extensive annotated data while enhancing efficiencyand robustness. We present an overview of SSL methods suitable for historical document analysis and discuss their potentialapplications and benefits. Furthermore, we present an approach for SSL in the document domain, considering various setups,augmentations, and resolutions. We also provide experimental results that demonstrate its feasibility and effectiveness. Ourfindings indicate that most document segmentation tasks can be effectively addressed using SSL features, highlighting thepotential of SSL to advance historical document analysis and pave the way for more efficient and robust document processingworkflows. | en |
| dc.format | 16 | |
| dc.identifier.document-number | 001520730500001 | |
| dc.identifier.doi | 10.1007/s10032-025-00538-6 | |
| dc.identifier.issn | 1433-2833 | |
| dc.identifier.obd | 43946890 | |
| dc.identifier.orcid | Baloun, Josef 0000-0003-1923-5355 | |
| dc.identifier.orcid | Prantl, Martin 0000-0002-7900-5028 | |
| dc.identifier.orcid | Lenc, Ladislav 0000-0002-1066-7269 | |
| dc.identifier.orcid | Martínek, Jiří 0000-0003-2981-1723 | |
| dc.identifier.orcid | Král, Pavel 0000-0002-3096-675X | |
| dc.identifier.uri | http://hdl.handle.net/11025/67479 | |
| dc.language.iso | en | |
| dc.project.ID | EH23_021/0008436 | |
| dc.relation.ispartofseries | International Journal on Document Analysis and Recognition | |
| dc.rights.access | A | |
| dc.subject | historical handwritten document | en |
| dc.subject | self-supervised learning | en |
| dc.subject | document digitization | en |
| dc.subject | semantic segmentation | en |
| dc.title | On self-supervision in historical handwritten document segmentation | en |
| dc.type | Článek v databázi WoS (Jimp) | |
| dc.type | ČLÁNEK | |
| dc.type.status | Published Version | |
| local.files.count | 1 | * |
| local.files.size | 13302218 | * |
| local.has.files | yes | * |
| local.identifier.eid | 2-s2.0-105009863832 |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- s10032-025-00538-6.pdf
- Size:
- 12.69 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: