FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization

dc.contributor.authorPicek, Lukáš
dc.contributor.authorJanouskova, Klara
dc.contributor.authorČermák, Vojtěch
dc.contributor.authorMatas, Jiří
dc.date.accessioned2026-05-03T18:05:47Z
dc.date.available2026-05-03T18:05:47Z
dc.date.issued2025
dc.date.updated2026-05-03T18:05:47Z
dc.description.abstractWe introduce a new, challenging benchmark and a dataset, FungiTastic, based on fungal records continuously collected over a twenty-year span. The dataset is labelled and curated by experts and consists of about 350k multimodal observations of 6k fine-grained categories (species). The fungi observations include photographs and additional data, e.g., meteorological and climatic data, satellite images, and body part segmentation masks. FungiTastic is one of the few benchmarks that include a test set with DNA-sequenced ground truth of unprecedented label reliability. The benchmark is designed to support (i) standard closed-set classification, (ii) open-set classification, (iii) multi-modal classification, (iv) few-shot learning, (v) domain shift, and many more. We provide tailored baselines for many use cases, a multitude of ready-to-use pre-trained models on HuggingFace, and a framework for model training. The documentation and the baselines are available at GitHub and Kaggle.en
dc.format11
dc.identifier.doi10.1109/CVPRW67362.2025.00192
dc.identifier.isbn979-8-3315-9994-2
dc.identifier.issn2160-7508
dc.identifier.obd43947619
dc.identifier.orcidPicek, Lukáš 0000-0002-6041-9722
dc.identifier.urihttp://hdl.handle.net/11025/67974
dc.language.isoen
dc.project.IDSS73020004
dc.publisherIEEE Computer Society
dc.relation.ispartofseries2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2025
dc.subjectclassificationen
dc.subjectdataseten
dc.subjectfew-shoten
dc.subjectfungien
dc.subjectmultimodalen
dc.subjectopen-seten
dc.subjectsegmentationen
dc.titleFungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorizationen
dc.typeStať ve sborníku (D)
dc.typeSTAŤ VE SBORNÍKU
dc.type.statusPublished Version
local.files.count1*
local.files.size1272962*
local.has.filesyes*
local.identifier.eid2-s2.0-105017853201

Files

Original bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
FungiTastic_A_Multi-Modal_Dataset_and_Benchmark_for_Image_Categorization.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
License bundle
Showing 1 - 1 out of 1 results
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: