On self-supervision in historical handwritten document segmentation
| dc.contributor.author | Baloun, Josef | |
| dc.contributor.author | Prantl, Martin | |
| dc.contributor.author | Lenc, Ladislav | |
| dc.contributor.author | Martínek, Jiří | |
| dc.contributor.author | Král, Pavel | |
| dc.date.accessioned | 2026-02-18T10:43:30Z | |
| dc.date.available | 2026-02-18T10:43:30Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract-translated | Historical document analysis plays a crucial role in understanding and preserving our past. However, this task is often hindered by challenges such as limited annotated training data and the diverse nature of historical handwritten documents. In this paper,we explore the potential of self-supervised learning (SSL) in historical document analysis,with a particular focus on historical handwritten document segmentation, to overcome the need for extensive annotated data while enhancing efficiency and robustness. We present an overview of SSL methods suitable for historical document analysis and discuss their potential applications and benefits. Furthermore, we present an approach for SSL in the document domain, considering various setups, augmentations, and resolutions. We also provide experimental results that demonstrate its feasibility and effectiveness. Our findings indicate that most document segmentation tasks can be effectively addressed using SSL features, highlighting the potential of SSL to advance historical document analysis and pave the way for more efficient and robust document processing workflows. | en |
| dc.description.sponsorship | EH23_021/0008436, SGS-2025-02 | cs |
| dc.format | 16 s. | cs |
| dc.identifier.uri | http://hdl.handle.net/11025/64671 | |
| dc.language.iso | en | en |
| dc.publisher | Springer | en |
| dc.rights | © CC BY 4.0 | en |
| dc.rights.access | openAccess | en |
| dc.subject | historický ručně psaný dokument | cs |
| dc.subject | samostudium | cs |
| dc.subject | digitalizace dokumentů | cs |
| dc.subject | sémantická segmentace | cs |
| dc.subject.translated | historical handwritten document | en |
| dc.subject.translated | self-supervised learning | en |
| dc.subject.translated | document digitization | en |
| dc.subject.translated | semantic segmentation | en |
| dc.title | On self-supervision in historical handwritten document segmentation | en |
| dc.type | article | en |
| dc.type | článek | cs |
| dc.type.status | Peer reviewed | en |
| dc.type.version | publishedVersion | en |
| local.files.count | 1 | * |
| local.files.size | 13302218 | * |
| local.has.files | yes | * |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- s10032-025-00538-6.pdf
- Size:
- 12.69 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: