Please use this identifier to cite or link to this item:
http://hdl.handle.net/1893/30035
Appears in Collections: | Computing Science and Mathematics Conference Papers and Proceedings |
Author(s): | Maier, Patrick Livesey, Daria Loidl, Hans-Wolfgang Trinder, Phil |
Contact Email: | patrick.maier@stir.ac.uk |
Title: | High-Performance Computer Algebra: A Hecke Algebra Case Study |
Editor(s): | Silva, F Dutra, I Costa Santos, V |
Citation: | Maier P, Livesey D, Loidl H & Trinder P (2014) High-Performance Computer Algebra: A Hecke Algebra Case Study. In: Silva F, Dutra I & Costa Santos V (eds.) Euro-Par 2014 Parallel Processing. Lecture Notes on Computer Science, 8632. Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, 25.08.2014-29.08.2014. Cham, Switzerland: Springer, pp. 415-426. https://doi.org/10.1007/978-3-319-09873-9_35 |
Issue Date: | 2014 |
Date Deposited: | 16-Aug-2019 |
Series/Report no.: | Lecture Notes on Computer Science, 8632 |
Conference Name: | Euro-Par 2014 Parallel Processing: 20th International Conference |
Conference Dates: | 2014-08-25 - 2014-08-29 |
Conference Location: | Porto, Portugal |
Abstract: | We describe the first ever parallelisation of an algebraic computation at modern HPC scale. Our case study poses challenges typical of the domain: it is a multi-phase application with dynamic task creation and irregular parallelism over complex control and data structures. Our starting point is a sequential algorithm for finding invariant bilinear forms in the representation theory of Hecke algebras, implemented in the GAP computational group theory system. After optimising the sequential code we develop a parallel algorithm that exploits the new skeleton-based SGP2 framework to parallelise the three most computationally-intensive phases. To this end we develop a new domain-specific skeleton, parBufferTryReduce. We report good parallel performance both on a commodity cluster and on a national HPC, delivering speedups up to 548 over the optimised sequential implementation on 1024 cores. |
Status: | AM - Accepted Manuscript |
Rights: | This is a post-peer-review, pre-copyedit version of a paper published in Silva F, Dutra I & Costa Santos V (eds.) Euro-Par 2014 Parallel Processing. Lecture Notes on Computer Science, 8632. Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, 25.08.2014-29.08.2014. Cham, Switzerland: Springer, pp. 415-426. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-09873-9_35 |
Licence URL(s): | https://storre.stir.ac.uk/STORREEndUserLicence.pdf |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Maier_Livesey_Loidl_Trinder_EuroPar2014_preprint.pdf | Fulltext - Accepted Version | 358.11 kB | Adobe PDF | View/Open |
This item is protected by original copyright |
Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/
If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.