Assistant Professor, Department of Computer Science, University of Warwick
Member of ACM
Dr. Gihan Mudalige is an Assistant Professor in the Department of Computer Science. His research work focuses on the development of next-generation High Performance Computing numerical simulation software libraries through the utilization of domain–specific high-level abstraction (HLA) frameworks. The key motivation of this research is to develop techniques to automatically parallelize an application while at the same time maintain near-optimal performance on diverse multi-core and many-core parallel systems. As part of this work Dr. Mudalige acts as one of the main developers of the OP2 and OPS embedded domain specific languages (EDSLs), two of the earliest high-level frameworks to demonstrate the significant utility of this novel technique for developing production-grade HPC applications. His current and recent research work includes projects with Rolls-Royce Plc, UK MOD, Imperial College London, University of Bristol, University of Southampton and STFC.
G.D. Balogh, I.Z. Reguly, G.R. Mudalige. (2018) Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs. In: Jarvis S., Wright S., Hammond S. (eds) High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation. PMBS 2017. Lecture Notes in Computer Science, vol 10724. Springer, Cham. DOI: 10.1007/978-3-319-72971-8_2 (arXiv preprint / PDF)
I.Z. Reguly, G.R. Mudalige and M.B. Giles. (2017). Beyond 16GB: Out-of-Core Stencil Computations. In Proceedings of the Workshop on Memory Centric Programming for HPC (MCHPC'17). Held in conjunction with IEEE/ACM Supercomputing 2017(SC'17) Denver, CO, USA, November 12, 2017. ACM, New York, NY, USA, 20-29. DOI: 10.1145/3145617.3145619 (preprint PDF)
R.O. Kirk, G.R. Mudalige, I.Z. Reguly, S.A. Wright, M.J. Martineau and S.A. Jarvis, Achieving Performance Portability for a Heat Conduction Solver Mini-Application on Modern Multi-core Systems, 2017 IEEE International Conference on Cluster Computing (CLUSTER), Honolulu, HI, 2017, pp. 834-841. doi: 10.1109/CLUSTER.2017.122 (preprint PDF)
S.P. Jammy, G.R. Mudalige, I.Z. Reguly, N.D. Sandham, M.B. Giles. Block-structured compressible Navier–Stokes solution using the OPS high-level abstraction.(2016) International Journal of Computational Fluid Dynamics 30 (6), 450-454, doi:10.1080/10618562.2016.1243663
G.R. Mudalige , I.Z. Reguly and M.B. Giles. Auto-Vectorizing a Large-scale Production Unstructured-mesh CFD Application. In proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing (WPMVP 2016) held as part of ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP16), Barcelona, Spain, March 2016 (PDF)
I. Z. Reguly , E. László , G.R. Mudalige , and M.B. Giles. (2015) Vectorizing unstructured mesh computations for many-core architectures . Concurrency Computat.: Pract. Exper., doi: 10.1002/cpe.3621 .
I. Z. Reguly, G. R. Mudalige , C. Bertolli, M. B. Giles, A. Betts, P. H. J. Kelly, and D. Radford. Acceleration of a Full-scale Industrial CFD Application with OP2 . (2015) IEEE Transactions on Parallel and Distributed Systems, doi:10.1109/TPDS.2015.2453972 (arXiv / data / pre-print PDF)
I.Z. Reguly, G.R. Mudalige and M.B. Giles, Design and Development of Domain Specific Active Libraries with Proxy Applications, In proceeding of the Workshop on Representative Applications (WRAp) , 2015 IEEE International Conference on Cluster Computing, Chicago, IL, 2015, pp. 738-745. doi: 10.1109/CLUSTER.2015.128 (PDF)
S.P. Jammy, N.D. Sandham, G.R. Mudalige , I.Z. Reguly, M.B. Giles. Block Structured Compressible Navier Stokes Solution Using the OPs High-Level abstraction , 27th International Conference on Parallel CFD, May 2015, Montreal, Canada (PDF)
G.R. Mudalige , I. Reguly, M. Giles, A. Mallinson, W. Gaudin, and J. Herdman, In proceedings of the 5th International Workshop on Performance Modeling, Benchmarking, and Simulation (PMBS 2014), Held in conjunction with IEEE/ACM Supercomputing 2014(SC'14) New Orleans, LA, USA, November 16, 2014. Revised Selected Papers. Springer, 2015, vol. 8966, no. 1, ch. Performance Analysis of a High-level Abstractions-based Hydrocode on Future Computing Systems , pp. 85–104. ISBN 978-3-319-17248-4 (pre-print PDF)
I.Z. Reguly, G.R. Mudalige , M.B. Giles, D. Curran and S. McIntosh-Smith, The OPS Domain Specific Abstraction for Multi-Block Structured Grid Computations . In proceedings of the 4th international workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing(WOLFHPC '14).Held in conjunction with IEEE/ACM Supercomputing 2014(SC'14). (pre-print PDF)
I.Z Reguly, E. Laszlo, G.R. Mudalige , M.B Giles, Vectorizing Unstructured Mesh Computations for Many-core Architectures , PMAM’14, Proceedings of Programming Models and Applications on Multicores and Manycores, February 2014 ( online ) (pre-print PDF)
G.R. Mudalige , M.B. Giles, J. Thiyagalingam, I. Reguly, C. Bertolli, P.H.J. Kelly and A.E. Trefethen, Design and Initial Performance of a High-level Unstructured Mesh Framework on Heterogeneous Parallel Systems. Parallel Comput.(2013) . http://dx.doi.org/10.1016/j.parco.2013.09.004 (online) (PDF)
C. Bertolli, A. Betts, N. Loriant, G.R. Mudalige , D. Radford, D.A. Ham, M.B. Giles, and P.H.J. Kelly. Compiler Optimizations for Industrial Unstructured Mesh CFD Applications on GPUs , Languages and Compilers for Parallel Computing, pp.112-126, Springer, 2013. (PDF)
M.B. Giles, G.R. Mudalige , C. Bertolli, P.H.J. Kelly, E. Laszlo, and I. Reguly, An Analytical Study of Loop Tiling for a Large-Scale Unstructured Mesh Application . SC Companion 2012: 477-482. (PDF)
M.B. Giles, G.R. Mudalige , B. Spencer, C. Bertolli, I. Reguly, Designing OP2 for GPU architectures , Journal of Parallel and Distributed Computing, Volume 73, Issue 11, November 2013, Pages 1451-1460, ISSN 0743-7315. ( PDF) (pre-print PDF)
G.R. Mudalige , I. Reguly, M.B. Giles, C. Bertolli and P.H.J. Kelly. OP2: An Active Library Framework for Solving Unstructured Mesh-based Applications on Multi-Core and Many-Core Architectures. In Proceedings of Innovative Parallel Computing (InPar), 2012, pp.1-12, 13-14 May 2012. (PDF)
G.R. Mudalige , M.B. Giles, C. Bertolli, and P.H.J. Kelly. Predictive Modeling and Analysis of OP2 on Distributed Memory GPU Clusters . SIGMETRICS Perform. Eval. Rev. 40, 2 :61-67 (2012). (PDF)
G.R. Mudalige , M.B. Giles, C. Bertolli, and P.H.J. Kelly. 2011. Predictive Modeling and analysis of OP2 on distributed memory GPU clusters . In Proceedings of the second international workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS '11). ACM, New York, NY, USA, 3-4. Held in conjunction with IEEE/ACM Supercomputing 2011 (SC'11), Seattle, WA, USA
M.B. Giles, G.R. Mudalige , Z. Sharif, G. Markall, P.H.J. Kelly. Performance Analysis and Optimization of the OP2 Framework on Many-core Architectures (2012) Computer Journal, 55 (2), pp. 168-180. ISSN 0010-4620.(PDF) (pre-print PDF)
S.J. Pennycook, S.D. Hammond, G.R. Mudalige , S.A. Wright, and S.A. Jarvis, On the Acceleration of Wavefront Applications Using Distributed Many-core Architectures (2012) Computer Journal, 55 (2), pp. 138-153. ISSN 0010-4620
C. Bertolli, A. Betts, G.R. Mudalige , M.B. Giles, P.H.J. Kelly. Design and Performance of the OP2 Library for Unstructured Mesh Applications , Euro-Par 2011 Parallel Processing Workshops, Lecture Notes in Computer Science, Springer, 2012. (PDF)
C. Bertolli, A. Betts, P.H.J. Kelly, G.R. Mudalige , and M.B. Giles. Mesh Independent Loop Fusion for Unstructured Mesh Applications in Proceedings of the 9th Conference on Computing Frontiers (CF ’12). pp.43-52 ACM, 2012. (PDF)
J.A. Davis, G.R. Mudalige , S.D. Hammond, J.A. Herdman, I. Miller, S.A. Jarvis. (2011) Predictive Analysis of a Hydrodynamics Application on Large-Scale CMP Clusters . In: International Supercomputing Conference (ISC11). Lecture Notes in Computer Science (R&D), 26 (3-4). Springer, Hamburg, Germany, pp. 175-185.
M.B. Giles, G.R. Mudalige , Z. Sharif, G. Markall, P.H.J Kelly. Performance Analysis of the OP2 Framework on Many-core Architectures . (2011). ACM SIGMETRICS Perform. Eval. Rev. 38 (4) ISSN 0163-5999. (PDF)
S.J. Pennycook, S.D. Hammond, S.A. Jarvis and G.R. Mudalige. (2011) Performance Analysis of a Hybrid MPI/CUDA Implementation of the NAS-LU Benchmark. ACM SIGMETRICS Perform. Eval. Rev. 38 (4). ISSN 0163-5999 (PDF)
S.D. Hammond, G.R. Mudalige , J.A. Smith, S.A. Jarvis, J. Holt, I. Miller, J.A Herdman and A. Vadgama, To Upgrade or not to Upgrade? Catamount vs. Cray Linux Environment , Large Scale Parallel Processing (LSPP) held in conjunction with IPDPS10, April, 2010, Atlanta, USA (PDF)
G.R. Mudalige , S.D. Hammond, J.A. Smith, and S.A. Jarvis. Predictive Analysis and Optimisation of Pipelined Wavefront Computations . In Proc. 11th Workshop on Advances in Parallel and Distributed Computational Models (APDCM 2009), 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2009), Rome, Italy, May 2009. IEEE Computer Society. (PDF)
S.D. Hammond, G.R. Mudalige , J.A. Smith, A.B. Mills, S.A. Jarvis, J. Holt, I. Miller, J.A. Herdman and A. Vadgama. Performance Prediction and Procurement in Practise: Assessing the Suitability of Commodity Cluster Components for Wavefront Codes, IET Softw (3)6:509-521. 2009. (PDF)
S.D. Hammond, G.R. Mudalige , J.A. Smith, S.A. Jarvis, J.A. Herdman and A. Vadgama, WARPP: A Toolkit for Simulating High Performance Parallel Scientific Codes , 2nd International Conference on Simulation Tools and Techniques (SIMUTools 2009 (ACM SIGSIM, ICST, SCS)), March, 2009, Rome, Italy. (PDF)
G.R.Mudalige , M. K. Vernon and S. A. Jarvis. A Plug-and-Play Model for Evaluating Wavefront Computations on Parallel Architectures, IEEE International Parallel and Distributed Processing Symposium (IPDPS 2008), Miami, Florida USA, April 2008. IEEE Computer Society. (PDF)
G.R.Mudalige , S. A. Jarvis, D.P. Spooner and G.R. Nudd Predictive Performance Analysis of a Parallel Pipelined Synchronous Wavefront Application for Commodity Processor Cluster Systems, IEEE International Conference on Cluster Computing, Barcelona Sept 2006. IEEE Computer Society. (PDF)
S. A. Jarvis, D.P. Spooner, G.R.Mudalige , B.P. Foley, J. Cao and G.R. Nudd. Performance Evaluation of Parallel and Distributed Systems. chapter in Performance Prediction Techniques for Large-scale Distributed Environments. Mohamed Ould-Khaoua and Geyong Min Eds. Nova Science, 2005
OP2 - Oxford Parallel Library for Unstructured-mesh applications
OPS - Oxford Parallel Library for Multi-block Structured-mesh applications
Preparing for Exascale with Mini-applications (Principal Investigator) - Archer Resource Allocation Panel, Funded by EPSRC . Time on the UK National Supercomputer (Archer) 9694.8 kAUs (July 2017 - June 2018)
Developing OP2 Hydra for Next Generation Systems (Principal Investigator) - Funded by Rolls-Royce Plc., with Dr.I.Z Reguly (Co-I) PPCU Hungary. £90,000 (April 2017 – March 2018)
Future-proof Massively-Parallel Execution of Multi-block Applications. (Researcher Co-investigator) - Funded by the UK EPSRC (Ref: EP/K038494/1 ) with Prof. M.B. Giles (PI), £280,147. (Nov 2013 - Feb 2017)
Warwick, Wisconsin-Madison Research Exchange Internship, Research funding for an exchange year at the University of Wisconsin-Madison. (Aug 2006 - Aug 2007) ~$12k.
Overseas Research Students (ORS) Award, (Sept 2004 - Sept 2008)
Warwick Postgraduate Research Fellowship, (Sept 2005 - Sept 2008)
Member ACM: Member of the Association for Computing Machinery (ACM).
Co-Chair - The 24th International European Conference On Parallel and Distributed Computing (Euro-Par 2018), High Performance Architectures and Compilers Track, August 27-31, 2018, Turin, Italy.
Program Committee member - Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP 2018), Special session on High Performance Computing in Modeling and Simulation, March 2018, Cambridge, UK.
Program Committee member - Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS17) held as part of ACM/IEEE Supercomputing 2017 (SC17), Denver, CO, USA.
Program Committee member - The 6th ACM SIGPLAN Workshop on Functional High-Performance Computing (FHPC17) - Co-located with the International Conference on Functional Programming (ICFP 2017), Oxford, UK.
Program Committee member - 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2017 - (Applications and Big-Data track).
Program Committee member - Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS16) held as part of ACM/IEEE Supercomputing 2016 (SC16), Salt Lake City, UT, USA.
Program Committee member - 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2016 - (Programming Models and Runtime Systems track).
Program Committee member - Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS15) held as part of ACM/IEEE Supercomputing 2015 (SC15), Austin, TX, USA.