Finding similar movies: dataset, tools, and methods
| dc.contributor.author | Leng, Hongkun | |
| dc.contributor.author | De La Cruz Paulino, Caleb | |
| dc.contributor.author | Haider, Momina | |
| dc.contributor.author | Lu, Rui | |
| dc.contributor.author | Zhou, Zhehui | |
| dc.contributor.author | Mengshoel, Ole | |
| dc.contributor.author | Brodin, Per-Erik | |
| dc.contributor.author | Forgeat, Julien | |
| dc.contributor.author | Jude, Alvin | |
| dc.contributor.editor | Skala, Václav | |
| dc.date.accessioned | 2019-05-14T06:27:59Z | |
| dc.date.available | 2019-05-14T06:27:59Z | |
| dc.date.issued | 2018 | |
| dc.description.abstract | Recommender systems are becoming ubiquitous in online commerce as well as in video-on-demand (VOD) and music streaming services. A popular form of giving recommendations is to base them on a currently selected product (or items), and provide “More Like This,” “Items Similar to This,” or “People Who Bought This also Bought” functionality. These recommendations are based on similarity computations, also known as item-item similarity computations. Such computations are typically implemented by heuristic algorithms, which may not match the perceived item-item similarity of users. In contrast, we study in this paper a data-driven approach to similarity for movies using labels crowdsourced from a previous work. Specifically, we develop four similarity methods and investigate how user-contributed labels can be used to improve similarity computations to better match user perceptions in movie recommendations. These four methods were tested against the best known method with a user experiment (n = 114) using the MovieLens 20M dataset. Our experiment showed that all our supervised methods beat the unsupervised benchmark and the differences were both statistically and practically significant. This paper’s main contributions include user evaluation of similarity methods for movies, user-contributed labels indicating movie similarities, and code for the annotation tool which can be found at http://MovieSim.org. | en |
| dc.format | 10 s. | cs |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | WSCG '2018: short communications proceedings: The 26th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2016 in co-operation with EUROGRAPHICS: University of West Bohemia, Plzen, Czech Republic May 28 - June 1 2018, p. 115-124. | en |
| dc.identifier.doi | https://doi.org/10.24132/CSRN.2018.2802.15 | |
| dc.identifier.isbn | 978-80-86943-41-1 | |
| dc.identifier.issn | 2464-4617 | |
| dc.identifier.uri | wscg.zcu.cz/WSCG2018/!!_CSRN-2802.pdf | |
| dc.identifier.uri | http://hdl.handle.net/11025/34663 | |
| dc.language.iso | en | en |
| dc.publisher | Václav Skala - UNION Agency | en |
| dc.relation.ispartofseries | WSCG '2018: short communications proceedings | en |
| dc.rights | © Václav Skala - UNION Agency | cs |
| dc.rights.access | openAccess | en |
| dc.subject | doporučující systémy | cs |
| dc.subject | podobnost položek | cs |
| dc.subject | crowdsourcing | cs |
| dc.subject | učení pod dohledem | cs |
| dc.subject | MovieLens | cs |
| dc.subject.translated | recommender systems | en |
| dc.subject.translated | item-item similarity | en |
| dc.subject.translated | crowdsourcing | en |
| dc.subject.translated | supervised learning | en |
| dc.subject.translated | MovieLens | en |
| dc.title | Finding similar movies: dataset, tools, and methods | en |
| dc.type | konferenční příspěvek | cs |
| dc.type | conferenceObject | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- Leng.pdf
- Size:
- 1.55 MB
- Format:
- Adobe Portable Document Format
- Description:
- Plný text
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: