Design and Analysis of Two Highly Scalable Sparse Grid Combination Algorithms

dc.contributor.authorStrazdins, Peter E.
dc.contributor.authorAli, Md Mohsin
dc.contributor.authorHarding, Brendan
dc.date.accessioned2016-01-20T04:56:39Z
dc.date.available2016-01-20T04:56:39Z
dc.date.issued2016
dc.description.abstractMany petascale and exascale scientific simulations involve the time evolution of systems modelled as Partial Differential Equations (PDEs). The sparse grid combination technique (SGCT) is a cost-effective method for solve time-evolving PDEs. It consists of evolving PDE over a set of grids of differing resolution in each dimension, and then combining the results to approximate the solution of the PDE on a grid of high resolution in all dimensions. We present two new parallel algorithms for the SGCT that support the full distributed memory parallelization over the dimensions of the component grids, as well as across the grids as well. They also support algorithmic-based fault tolerance. The direct algorithm is so called because it directly implements a SGCT combination formula. We give details of the design and implementation of a `partial' sparse grid data structure, which is needed for its efficient implementation. The second algorithm converts each component grid into their hierarchical surpluses, and then uses the direct algorithm on each of the hierarchical surpluses. The conversion to/from the hierarchical surpluses is also an important algorithm in its own right. It requires a technique called sub-griding in order to correctly deal with the combination of very small surpluses. An analysis of both indicates the direct algorithm minimizes the number of messages, whereas the hierarchical surplus minimizes memory consumption and offers a modest reduction in bandwidth. The direct algorithm also scales better with respect to SGCT level and grid dimensionality. Experimental results indicates that, for scenarios of practical interest, both are sufficiently scalable with respect to core count but algorithm has generally better performance, at least by a factor of 2 in most cases. It also scales better with the SGCT level. Hierarchical surplus formation is much less communication intensive, but shows less scalability with increasing core counts. Altering the layout of processes in the process grids and the mapping of processes affects the performance of the 2D SGCT by less than 10%, and affects even less the application part of an SGCT advection application even less.en_AU
dc.identifier.citationJournal of Computational Scienceen_AU
dc.identifier.issn1877-7503en_AU
dc.identifier.urihttp://hdl.handle.net/1885/95531
dc.publisherElsevieren_AU
dc.relationhttp://purl.org/au-research/grants/arc/LP110200410en_AU
dc.rights© Elsevier. http://www.sherpa.ac.uk/romeo/issn/1877-7503/..."Authors pre-print on any website, including arXiv and RePEC" from SHERPA/RoMEO site (as at 20/01/16).en_AU
dc.subjecthigh performance computingen_AU
dc.subjectparallel computingen_AU
dc.subjectPDE solversen_AU
dc.subjectsparse grid combination techniqueen_AU
dc.subjectalgorithm-based fault toleranceen_AU
dc.titleDesign and Analysis of Two Highly Scalable Sparse Grid Combination Algorithmsen_AU
dc.typeJournal articleen_AU
dcterms.accessRightsOpen Accessen_AU
local.contributor.affiliationStrazdins, P. E., The Australian National Universityen_AU
local.contributor.authoruidu8914893en_AU
local.description.notesThe article will be published around July 2016.en_AU
local.publisher.urlhttp://www.elsevier.com/en_AU
local.type.statusSubmitted Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
main-techrep.pdf
Size:
952.84 KB
Format:
Adobe Portable Document Format
Description:
main article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
884 B
Format:
Item-specific license agreed upon to submission
Description: