High-Performance Computer Algebra: A Hecke Algebra Case Study

Maier, Patrick; Livesey, Daria; Loidl, Hans-Wolfgang; Trinder, Phil

doi:10.1007/978-3-319-09873-9_35

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/30035

Appears in Collections:	Computing Science and Mathematics Conference Papers and Proceedings
Author(s):	Maier, Patrick Livesey, Daria Loidl, Hans-Wolfgang Trinder, Phil
Contact Email:	patrick.maier@stir.ac.uk
Title:	High-Performance Computer Algebra: A Hecke Algebra Case Study
Editor(s):	Silva, F Dutra, I Costa Santos, V
Citation:	Maier P, Livesey D, Loidl H & Trinder P (2014) High-Performance Computer Algebra: A Hecke Algebra Case Study. In: Silva F, Dutra I & Costa Santos V (eds.) Euro-Par 2014 Parallel Processing. Lecture Notes on Computer Science, 8632. Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, 25.08.2014-29.08.2014. Cham, Switzerland: Springer, pp. 415-426. https://doi.org/10.1007/978-3-319-09873-9_35
Issue Date:	2014
Date Deposited:	16-Aug-2019
Series/Report no.:	Lecture Notes on Computer Science, 8632
Conference Name:	Euro-Par 2014 Parallel Processing: 20th International Conference
Conference Dates:	2014-08-25 - 2014-08-29
Conference Location:	Porto, Portugal
Abstract:	We describe the first ever parallelisation of an algebraic computation at modern HPC scale. Our case study poses challenges typical of the domain: it is a multi-phase application with dynamic task creation and irregular parallelism over complex control and data structures. Our starting point is a sequential algorithm for finding invariant bilinear forms in the representation theory of Hecke algebras, implemented in the GAP computational group theory system. After optimising the sequential code we develop a parallel algorithm that exploits the new skeleton-based SGP2 framework to parallelise the three most computationally-intensive phases. To this end we develop a new domain-specific skeleton, parBufferTryReduce. We report good parallel performance both on a commodity cluster and on a national HPC, delivering speedups up to 548 over the optimised sequential implementation on 1024 cores.
Status:	AM - Accepted Manuscript
Rights:	This is a post-peer-review, pre-copyedit version of a paper published in Silva F, Dutra I & Costa Santos V (eds.) Euro-Par 2014 Parallel Processing. Lecture Notes on Computer Science, 8632. Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, 25.08.2014-29.08.2014. Cham, Switzerland: Springer, pp. 415-426. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-09873-9_35
Licence URL(s):	https://storre.stir.ac.uk/STORREEndUserLicence.pdf

Files in This Item:

File	Description	Size	Format
Maier_Livesey_Loidl_Trinder_EuroPar2014_preprint.pdf	Fulltext - Accepted Version	358.11 kB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show full item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository