"Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method" . "The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present optimization by kernel fusion for element subroutines. Moreover, we show how the optimization is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St.\\,Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 -- 1.54 depending on used GPU architecture. Although the optimization technique is demonstrated on element subroutines for using St.\\,Venant-Kirchhoff material, it is generally usable for wider area of computationally-demanding problems." . . . "Argonne, IL, USA" . "The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present optimization by kernel fusion for element subroutines. Moreover, we show how the optimization is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St.\\,Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 -- 1.54 depending on used GPU architecture. Although the optimization technique is demonstrated on element subroutines for using St.\\,Venant-Kirchhoff material, it is generally usable for wider area of computationally-demanding problems."@en . "Filipovi\u010D, Ji\u0159\u00ED" . "4"^^ . "14330" . . . "4"^^ . . . . . "Lakom\u00FD, Bed\u0159ich" . "GPGPU; code optimization; kernel fusion; FEM"@en . "IEEE" . . . . . "Madzin, Mat\u00FA\u0161" . "2012-01-01+01:00"^^ . "Symposium on Application Accelerators in High Performance Computing" . "[B81FC1457E12]" . . . "Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method"@en . "RIV/00216224:14330/12:00057469" . . . "LOS ALAMITOS, CA, USA" . . "124206" . "RIV/00216224:14330/12:00057469!RIV13-GA0-14330___" . . . "4"^^ . "Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method"@en . . "Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method" . . "Fousek, Jan" . "9781467328821" . "P(GD102/09/H042), S" . "000309942800019" . .