This paper uses Template Haskell to automatically select appropriate skeleton implementations in ... more This paper uses Template Haskell to automatically select appropriate skeleton implementations in the Eden parallel dialect of Haskell. The approach allows implementation parameters to be statically tuned according to architectural cost models based on source analyses. This permits us to target a range of parallel architecture classes from a single source specification. A major advantage of the approach is that cost models are user-definable and can be readily extended to new data or computation structures etc. * This work is generously supported by EPSRC grants GR/R 70545/01, GR/R 91298/01 and GR/S 15198/01 and by joint travel grants from the British Council/DAAD.
The use of distributed programming libraries is very common in the development of scientific and ... more The use of distributed programming libraries is very common in the development of scientific and engineering applications. These libraries, from message passing libraries to numerical libraries, are designed in a very general way to be useful for a wide range of applications. Therefore, there are several polices that must be adapted to the particular application, system and input data to provide the expected performance. Our objective is develop an environment for tuning the use of a distributed library on the fly according to the dynamic behavior of the applications. In this paper, we present as an example a tuning environment for PVM-based applications. We show potential bottlenecks when using PVM. We also include tuning scenarios that describe the evaluation of the application behavior and the solutions that can improve the performance.
Abstract. Performance is a key issue in the development of parallel/distributed applications. The... more Abstract. Performance is a key issue in the development of parallel/distributed applications. The main goal of these applications is to solve the considered problem as fast as possible utilizing a certain minimum of parallel system ca-pacities. Therefore, developers must optimize ...
To develop an efficient parallel application is not an easy task. Applications rarely achieve a g... more To develop an efficient parallel application is not an easy task. Applications rarely achieve a good performance immediately therefore, a careful performance analysis and optimization are crucial. These tasks are difficult and require a thorough understanding of the program’s behavior. In this paper, we propose an on-line performance modeling technique, which enables the automated discovery of causal execution flows, composed of communication and computational activities, in MPI parallel programs. Our model reflects an application behavior and is made up of elements correlated with high-level program structures, such as loops and communication operations. Moreover, our approach enables an assortment of on-line diagnosis techniques which may further automate the performance understanding process.
This paper uses Template Haskell to automatically select appropriate skeleton implementations in ... more This paper uses Template Haskell to automatically select appropriate skeleton implementations in the Eden parallel dialect of Haskell. The approach allows implementation parameters to be statically tuned according to architectural cost models based on source analyses. This permits us to target a range of parallel architecture classes from a single source specification. A major advantage of the approach is that cost models are user-definable and can be readily extended to new data or computation structures etc. * This work is generously supported by EPSRC grants GR/R 70545/01, GR/R 91298/01 and GR/S 15198/01 and by joint travel grants from the British Council/DAAD.
The use of distributed programming libraries is very common in the development of scientific and ... more The use of distributed programming libraries is very common in the development of scientific and engineering applications. These libraries, from message passing libraries to numerical libraries, are designed in a very general way to be useful for a wide range of applications. Therefore, there are several polices that must be adapted to the particular application, system and input data to provide the expected performance. Our objective is develop an environment for tuning the use of a distributed library on the fly according to the dynamic behavior of the applications. In this paper, we present as an example a tuning environment for PVM-based applications. We show potential bottlenecks when using PVM. We also include tuning scenarios that describe the evaluation of the application behavior and the solutions that can improve the performance.
Abstract. Performance is a key issue in the development of parallel/distributed applications. The... more Abstract. Performance is a key issue in the development of parallel/distributed applications. The main goal of these applications is to solve the considered problem as fast as possible utilizing a certain minimum of parallel system ca-pacities. Therefore, developers must optimize ...
To develop an efficient parallel application is not an easy task. Applications rarely achieve a g... more To develop an efficient parallel application is not an easy task. Applications rarely achieve a good performance immediately therefore, a careful performance analysis and optimization are crucial. These tasks are difficult and require a thorough understanding of the program’s behavior. In this paper, we propose an on-line performance modeling technique, which enables the automated discovery of causal execution flows, composed of communication and computational activities, in MPI parallel programs. Our model reflects an application behavior and is made up of elements correlated with high-level program structures, such as loops and communication operations. Moreover, our approach enables an assortment of on-line diagnosis techniques which may further automate the performance understanding process.
Uploads
Papers by Oleg Morajko