Keuper, Janis, and Franz-Josef Pfreundt. “Asynchronous Parallel Stochastic Gradient Descent-A Numeric Core for Scalable Distributed Machine Learning Algorithms.” arXiv preprint arXiv:1505.04956 (2015).


Stoyanov, Dimitar, and Franz-Josef Pfreundt. Hybrid-Parallel Sparse Matrix–Vector Multiplication and Iterative Linear Solvers with the communication library GPI.”.

Oden, Lena. “GPI2 for GPUs: A PGAS framework for efficient communication in hybrid clusters.” Parallel Computing: Accelerating Computational Science and Engineering (CSE) 25 (2014): 461.

Gruenewald, D., Ettrich, N., Rahn, M.,  and Pfreundt, F. J. (2014, September). FRTM-A Productive Framework for Reverse Time Migration. In EAGE Workshop on High Performance Computing for Upstream.

Breitbart, Jens, Mareike Schmidtobreick, and Vincent Heuveline. “Evaluation of the Global Address Space Programming Interface (GASPI).” Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International. IEEE, 2014.

Simmendinger, Christian, Mirko Rahn, and Daniel Gruenewald. “The GASPI API: A failure tolerant PGAS API for asynchronous dataflow on heterogeneous architectures.” Sustained Simulation Performance 2014. Springer International Publishing, 2015. 17-32.


Shahzad, Faisal, et al. “PGAS implementation of SpMVM and LBM using GPI.” 7th International Conference on PGAS Programming Models.

Grünewald, Daniel, and Christian Simmendinger. “The GASPI API specification and its implementation GPI 2.0.” 7th International Conference on PGAS Programming Models. Vol. 243. 2013.

Machado, Rui, Salvador Abreu, and Daniel Diaz. “Parallel Local Search: Experiments with a PGAS-based programming model.” arXiv preprint arXiv:1301.7699 (2013).

Machado, Rui, Salvador Abreu, and Daniel Diaz. “Parallel Performance of Declarative Programming Using a PGAS Model.” Practical Aspects of Declarative Languages. Springer Berlin Heidelberg, 2013. 244-260.


Simmendinger, Christian, et al. “A PGAS-based implementation for the unstructured CFD solver TAU.” Proceedings of the 5th Conference on Partitioned Global Address Space Programming Models, PGAS. Vol. 11. 2011.

Machado, Rui, et al. “Unbalanced tree search on a manycore system using the GPI programming model.” Computer Science-Research and Development 26.3-4 (2011): 229-236.

Machado, Rui, and Carsten Lojewski. “The Fraunhofer virtual machine: a communication library and runtime system based on the RDMA model.” Computer Science-Research and Development 23.3-4 (2009): 125-132.