CUDA expression templates
| dc.contributor.author | Wiemann, Paul | |
| dc.contributor.author | Wenger, Stephan | |
| dc.contributor.author | Magnor, Marcus | |
| dc.contributor.editor | Baranoski, Gladimir | |
| dc.contributor.editor | Skala, Václav | |
| dc.date.accessioned | 2014-03-20T11:03:58Z | |
| dc.date.available | 2014-03-20T11:03:58Z | |
| dc.date.issued | 2011 | |
| dc.description.abstract | Many algorithms require vector algebra operations such as the dot product, vector norms or component-wise manipulations. Especially for large-scale vectors, the efficiency of algorithms depends on an efficient implementation of those calculations. The calculation of vector operations benefits from the continually increasing chip level parallelism on graphics hardware. Very efficient basic linear algebra libraries like CUBLAS make use of the parallelism provided by CUDA-enabled GPUs. However, existing libraries are often not intuitively to use and programmers may shy away from working with cumbersome and errorprone interfaces. In this paper we introduce an approach to simplify the usage of parallel graphics hardware for vector calculus. Our approach is based on expression templates that make it possible to obtain the performance of a hand-coded implementation while providing an intuitive and math-like syntax. We use this technique to automatically generate CUDA kernels for various vector calculations. In several performance tests our implementation shows a superior performance compared to CPU-based libraries and comparable results to a GPU-based library. | en |
| dc.format | 8 s. | cs |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | WSCG '2011: Communication Papers Proceedings: The 19th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, p. 185-192. | en |
| dc.identifier.isbn | 978-80-86943-82-4 | |
| dc.identifier.uri | http://wscg.zcu.cz/WSCG2011/!_2011_WSCG-Short_Papers.pdf | |
| dc.identifier.uri | http://hdl.handle.net/11025/10836 | |
| dc.language.iso | en | en |
| dc.publisher | Václav Skala - UNION Agency | cs |
| dc.relation.ispartofseries | WSCG '2011: Communication Papers Proceedings | en |
| dc.rights | © Václav Skala - UNION Agency | cs |
| dc.rights.access | openAccess | en |
| dc.subject | grafické procesory | cs |
| dc.subject | paralelní výpočty | cs |
| dc.subject | CUDA | cs |
| dc.subject | lineární algebra | cs |
| dc.subject.translated | graphic processing units | en |
| dc.subject.translated | parallel computing | en |
| dc.subject.translated | CUDA | en |
| dc.subject.translated | linear algebra | en |
| dc.title | CUDA expression templates | en |
| dc.type | konferenční příspěvek | cs |
| dc.type | conferenceObject | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | en |
| dc.type.driver | info:eu-repo/semantics/publishedVersion | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |
Files
Original bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- Wiemann.pdf
- Size:
- 220.95 KB
- Format:
- Adobe Portable Document Format
- Description:
- Plný text
License bundle
1 - 1 out of 1 results
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: