Hello!
I'm working on an CFD code (fortran) which uses OpenMPI for parallel runs and works with crazy huge arrays ;).
My focus therefore is on fast running code because the usual runtime of my code is measured not in hours but in weeks (sometimes months :D)