Skip to main content
This paper discusses the design and the implementation of the LU factorization routines included in the Heterogeneous ScaLAPACK library, which is built on top of ScaLAPACK. These routines are used in the factorization and solution of a... more
    • by 
    •   13  
      Information SystemsComputer ScienceDistributed ComputingInformatics
In this paper, we study the problem of optimal matrix partitioning for parallel dense factorization on heterogeneous processors. First, we outline existing algorithms solving the problem that use a constant performance model of... more
    • by 
    •   12  
      Cognitive ScienceParallel AlgorithmsDistributed ComputingParallel Computing
The paper presents a performance model that can be used to optimally distribute computations over heterogeneous computers. This model is application-centric representing the speed of each computer by a function of the problem size. This... more
    • by 
    •   7  
      EngineeringDistributed ComputingPerformance ModelStandard Model
In this paper, we address the problem of optimal distribution of computational tasks on a network of heterogeneous computers when one or more tasks do not fit into the main memory of the processors and when relative speeds vary with the... more
    • by 
    •   11  
      Distributed ComputingHigh Performance ComputingHigh Performance Computing Applications development for Atmosphere modelingPerformance Model
The paper presents an approach to performance analysis of heterogeneous parallel algorithms. As a typical heterogeneous parallel algorithm is just a modification of some homogeneous one, the idea is to compare the heterogeneous algorithm... more
    • by 
    •   8  
      Cognitive ScienceParallel AlgorithmsDistributed ComputingParallel Computing
In this paper, we present an efficient procedure for building a piecewise linear function approximation of the speed function of a processor with hierarchical memory structure. The procedure tries to minimize the experimental time used... more
    • by 
    •   10  
      Distributed ComputingGrid ComputingParallel ComputingTime Use
The paper presents a classification of mathematical problems encountered during partitioning of data when designing parallel algorithms on networks of heterogeneous computers. We specify problems with known efficient solutions and open... more
    • by 
    •   4  
      Parallel and Distributed ComputingParallel AlgorithmHeterogeneous ComputingHeterogeneous Network
The paper presents Heterogeneous MPI (HMPI), an extension of MPI for programming high-performance computations on heterogeneous networks of computers. It allows the application programmer to describe the performance model of the... more
    • by 
    •   24  
      Parallel AlgorithmsDistributed ComputingParallel ComputingComputer Networks
The paper presents Heterogeneous MPI (HMPI), an extension of MPI for programming high-performance computations on heterogeneous networks of computers. It allows the application programmer to describe the performance model of the... more
    • by 
    •   24  
      Parallel AlgorithmsDistributed ComputingParallel ComputingComputer Networks
The paper presents a performance model of a network of heterogeneous computers that takes account of the heterogeneity of memory structure and other architectural differences. Under this model, the speed of each processor is represented... more
    • by 
    •   26  
      Computer ScienceDistributed ComputingGrid ComputingParallel Programming
In this paper, we address the problem of optimal distribution of computational tasks on a network of heterogeneous computers when one or more tasks do not fit into the main memory of the processors and when relative speeds cannot be... more
    • by 
    •   26  
      Computer ScienceDistributed ComputingGrid ComputingParallel Programming
This paper discusses the design and the implementation of the LU factorization routines included in the Heterogeneous ScaLAPACK library, which is built on top of ScaLAPACK. These routines are used in the factorization and solution of a... more
    • by 
    •   13  
      Information SystemsComputer ScienceDistributed ComputingInformatics
In this paper, we study the problem of optimal matrix partitioning for parallel dense factorization on heterogeneous processors. First, we outline existing algorithms solving the problem that use a constant performance model of... more
    • by 
    •   12  
      Cognitive ScienceParallel AlgorithmsDistributed ComputingParallel Computing
The paper presents a performance model that can be used to optimally distribute computations over heterogeneous computers. This model is application-centric representing the speed of each computer by a function of the problem size. This... more
    • by 
    •   7  
      EngineeringDistributed ComputingPerformance ModelStandard Model
In this paper, we address the problem of optimal distribution of computational tasks on a network of heterogeneous computers when one or more tasks do not fit into the main memory of the processors and when relative speeds vary with the... more
    • by 
    •   11  
      Distributed ComputingHigh Performance ComputingHigh Performance Computing Applications development for Atmosphere modelingPerformance Model
The paper presents an approach to performance analysis of heterogeneous parallel algorithms. As a typical heterogeneous parallel algorithm is just a modification of some homogeneous one, the idea is to compare the heterogeneous algorithm... more
    • by 
    •   8  
      Cognitive ScienceParallel AlgorithmsDistributed ComputingParallel Computing
In this paper, we present an efficient procedure for building a piecewise linear function approximation of the speed function of a processor with hierarchical memory structure. The procedure tries to minimize the experimental time used... more
    • by 
    •   10  
      Distributed ComputingGrid ComputingParallel ComputingTime Use
The paper presents a classification of mathematical problems encountered during partitioning of data when designing parallel algorithms on networks of heterogeneous computers. We specify problems with known efficient solutions and open... more
    • by 
    •   4  
      Parallel and Distributed ComputingParallel AlgorithmHeterogeneous ComputingHeterogeneous Network
The paper presents Heterogeneous MPI (HMPI), an extension of MPI for programming high-performance computations on heterogeneous networks of computers. It allows the application programmer to describe the performance model of the... more
    • by 
    •   24  
      Parallel AlgorithmsDistributed ComputingParallel ComputingComputer Networks
The paper presents Heterogeneous MPI (HMPI), an extension of MPI for programming high-performance computations on heterogeneous networks of computers. It allows the application programmer to describe the performance model of the... more
    • by 
    •   24  
      Parallel AlgorithmsDistributed ComputingParallel ComputingComputer Networks