Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification
| dc.contributor.author | Lehečka, Jan | |
| dc.contributor.author | Švec, Jan | |
| dc.contributor.author | Ircing, Pavel | |
| dc.contributor.author | Šmídl, Luboš | |
| dc.date.accessioned | 2021-02-22T11:00:20Z | |
| dc.date.available | 2021-02-22T11:00:20Z | |
| dc.date.issued | 2020 | |
| dc.description.abstract-translated | In this paper, we present our experiments with BERT models in the task of Large-scale Multi-label Text Classification (LMTC). In the LMTC task, each text document can have multiple class labels, while the total number of classes is in the order of thousands. We propose a pooling layer architecture on top of BERT models, which improves the quality of classification by using information from the standard [CLS] token in combination with pooled sequence output. We demonstrate the improvements on Wikipedia datasets in three different languages using public pre-trained BERT models. | en |
| dc.format | 8 s. | cs |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | LEHEČKA, J., ŠVEC, J., IRCING, P., ŠMÍDL, L. Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification. In: Text, Speech, and Dialogue 23rd International Conference, TSD 2020, Brno, Czech Republic, September 8-11, 2020, Proceedings. Cham: Springer, 2020. s. 214-221. ISBN 978-3-030-58322-4, ISSN 0302-9743. | cs |
| dc.identifier.doi | 10.1007/978-3-030-58323-1_23 | |
| dc.identifier.isbn | 978-3-030-58322-4 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.obd | 43930358 | |
| dc.identifier.uri | 2-s2.0-85091136861 | |
| dc.identifier.uri | http://hdl.handle.net/11025/42716 | |
| dc.language.iso | en | en |
| dc.project.ID | DG18P02OVV016/Vývoj centralizovaného rozhraní pro vytěžování velkých dat z webových archivů | cs |
| dc.publisher | Springer | en |
| dc.relation.ispartofseries | Text, Speech, and Dialogue 23rd International Conference, TSD 2020, Brno, Czech Republic, September 8-11, 2020, Proceedings | en |
| dc.rights | Plný text není přístupný. | cs |
| dc.rights | © Springer | en |
| dc.rights.access | closedAccess | en |
| dc.subject.translated | Text classification, BERT model | en |
| dc.title | Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification | en |
| dc.type | konferenční příspěvek | cs |
| dc.type | conferenceObject | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |