Skip to main content
    • by 
    •   6  
      ProgrammingAutomatic ParallelizationData AccessData Distribution
We present the design and implementation of an automatic polyhedral source-to-source transformation framework that can optimize regular programs (sequences of possibly imperfectly nested loops) for parallelism and locality simultaneously.... more
    • by 
    •   10  
      Automatic ParallelizationAlgorithm DesignCode GenerationCompiler Optimization
With the advent of digitization and growing abundance of graphic and image processing tools, use cases for clipping using circular windows have grown considerably. This paper presents an efficient clipping algorithm for line segments... more
    • by 
    •   14  
      Alfred the Great and the Alfredian CircleVienna CircleAutomatic ParallelizationQuality Circles
Existing theories of multiple object tracking (MOT) offer different predictions concerning the role of higher level cognitive processes, individual differences, effortful attention and parallel processing in MOT. Pylyshyn's model (1989)... more
    • by 
    •   14  
      PsychologyCognitive SciencePerceptionSpatial Memory
This paper presents an overview of the SUIF compiler, which automatically parallelizes and optimizes sequential programs for shared-memory multiprocessors. We describe new technology in this system for locating coarse-grain parallelism... more
    • by 
    •   4  
      New TechnologyAutomatic ParallelizationCoarse Grained SoilComputer
Abstract We present the architecture of nreduce, a distributed virtual machine which uses parallel graph reduction to run programs across a set of computers. It executes code written in a simple functional language which supports lazy... more
    • by  and +1
    •   4  
      Distributed ComputingFunctional ProgrammingAutomatic ParallelizationGraph Reduction
In the area of automatic parallelization of programs, analyzing and transforming loop nests with parametric a ne loop bounds requires fundamental mathematical results. The most common geometrical model of iteration spaces, called the... more
    • by 
    •   5  
      Automatic ParallelizationVLSI signal processingElectrical And Electronic EngineeringLINEAR PROGRAM
Multiprocessor computers are rapidly becoming the norm. Parallel workstations are widely available today and it is likely that most PCs in the near future will also be parallel. To accommodate these changes, some classes of applications... more
    • by  and +1
    •   6  
      Software DevelopmentFortranAutomatic ParallelizationHigh performance
Data-oriented workflows are often used in scientific applications for executing a set of dependent tasks across multiple computers. We discuss how these can be modeled using lambda calculus, and how ideas from functional programming are... more
    • by 
    •   5  
      Distributed ComputingFunctional ProgrammingLambda CalculusScientific Workflows
Previous literature in alphabetic languages suggests that the occipital-temporal region (the ventral pathway) is specialized for automatic parallel word recognition, whereas the parietal region (the dorsal pathway) is specialized for... more
    • by 
    •   25  
      PersonalityCharacter RecognitionVocabularyMagnetic Resonance Imaging
Performance optimization of stencil computations has been widely studied in the literature, since they occur in many computationally intensive scientific and engineering applications. Compiler frameworks have also been developed that can... more
    • by 
    •   3  
      Automatic ParallelizationLoad BalancePerformance Optimization
GPUs are a class of specialized parallel architectures with tremendous computational power. The new Compute Unified Device Architecture (CUDA) programming model from NVIDIA facilitates programming of general purpose applications on their... more
    • by 
    •   12  
      Program TransformationShared memoryAutomatic ParallelizationData Dependence
A robot-controlled wafer bonding machine was developed for the bonding of different sizes of wafers ranging up to 8 inches diameter. The features of this equipment are such that: (1) After the automatic parallel adjustment for 8-inch... more
    • by 
    •   6  
      ElectronicAutomatic ParallelizationWafer BondingInfrared
This paper develops and experimentally demonstrates a robust automatic parallel parking algorithm for parking in tight spaces. Novel fuzzy logic controllers are designed for each step of the maneuvering process. The controllers are first... more
    • by 
    •   8  
      Mechanical EngineeringGeneticsFuzzy LogicGenetic Algorithm
The ubiquity of multicore processors in commodity computing systems has raised a significant programming challenge for their effective use. An attractive but challenging approach is automatic parallelization of sequential codes. Although... more
    • by 
    •   7  
      Program TransformationShared memoryAutomatic ParallelizationApplication development
The widespread use of multicore processors is not a consequence of significant advances in parallel programming.
    • by 
    •   4  
      Cognitive ScienceDistributed ComputingParallel ComputingAutomatic Parallelization
The desire to simulate more and more geometrical and physical features of technical structures and the availability of parallel computers and parallel numerical solvers which can exploit the power of these machines have lead to a steady... more
    • by  and +2
    •   13  
      Applied MathematicsMathematical PhysicsMesh generationAutomatic Parallelization
The Support Vector Machine (SVM) is a supervised learning algorithm used for recognizing patterns in data. It is a very popular technique in Machine Learning and has been successfully used in applications such as image classification,... more
    • by 
    •   4  
      Distributed ComputingComputer SoftwareAutomatic ParallelizationSupport vector machine
In this paper we present JaMP, an adaptation of the OpenMP standard. JaMP is fitted to Jackal, a software-based DSM implementation for Java.
    • by  and +1
    •   10  
      Distributed ComputingDistributed Shared Memory SystemComputer SoftwareFortran
Two key steps in the compilation of strict functional languages are the conversion of higher-order functions to data structures (closures) and the transformation to tail-recursive style. We show how to perform both steps at once by... more
    • by 
    •   28  
      Object Oriented ProgrammingFunctional ProgrammingTime UseGlobal Analysis
Current Fortran optimizing compilers often include source to source transforma-tions for automatic parallelization or vectorization of loops. Lower level optimiza-tions, such as those that aim to exploit ILP, are performed at later stages... more
    • by 
    •   6  
      FortranAutomatic ParallelizationProfitabilityData Dependence
Abstract— Program parallelization becomes increasingly important when new multi-core architectures provide ways to improve performance. One of the greatest challenges of this development lies in programming parallel applications. Us-ing... more
    • by 
    •   2  
      Constraint ProgrammingAutomatic Parallelization
The desire to simulate more and more geometrical and physical features of technical structures and the availability of parallel computers and parallel numerical solvers which can exploit the power of these machines have lead to a steady... more
    • by  and +2
    •   11  
      Applied MathematicsMathematical PhysicsMesh generationAutomatic Parallelization
Speeding up sequential programs on multicores is a challenging problem that is in urgent need of a solution. Automatic parallelization of irregular pointer-intensive codes, exemplified by the SPECint codes, is a very hard problem. This... more
    • by  and +1
    •   5  
      Automatic ParallelizationSemantic InformationSemantic AnnotationSource Code
This paper depicts the development of backward automatic parallel parking system for nonholonomic mobile robot. The configuration of the system consists of ultrasonic sensor, rotary encoder, controller, and actuators. The path planning... more
    • by 
    •   12  
      RoboticsControl Systems EngineeringMobile RoboticsPath planning
Static scheduling of a program represented by a directed task graph on a multiprocessor system to minimize the program completion time is a well-known problem in parallel processing. Since finding an optimal schedule is an NPcomplete... more
    • by 
    •   12  
      Graph TheoryParallel ProcessingGenetic AlgorithmOptimal mine design and scheduling
As multicore systems become the dominant mainstream computing technology, one of the most difficult challenges the industry faces is the software. Applications with large amounts of explicit thread-level parallelism naturally scale... more
    • by 
    •   6  
      Software MaintenanceParallel ProgrammingThread Level SpeculationAutomatic Parallelization
Parallelization of image analysis tasks forms a basic key for processing huge image data in realtime. At this, suitable subtasks for parallel processing have to be extracted and mapped to components of a distributed system. Basically,... more
    • by 
    •   7  
      Distributed ComputingModelingImage AnalysisParallel Processing
The evolution of high performance computers is progressing toward increasingly heterogeneous systems. These new architectures pose new challenges, particularly in the field of programming languages. New tools and languages are needed if... more
    • by 
    •   6  
      Case StudyAutomatic ParallelizationCode GenerationHeterogeneous Systems
Porting applications to new high performance parallel and distributed computing platforms is a challenging task. Since writing parallel code by hand is time consuming and costly, porting codes would ideally be automated by using some... more
    • by 
    •   7  
      FortranParallel and Distributed ComputingShared memoryAutomatic Parallelization
The ubiquity of multicore processors in commodity computing systems has raised a significant programming challenge for their effective use. An attractive but challenging approach is automatic parallelization of sequential codes. Although... more
    • by 
    •   7  
      Program TransformationShared memoryAutomatic ParallelizationApplication development
Recent advances in polyhedral compilation technology have made it feasible to automatically transform affine sequential loop nests for tiled parallel execution on multi-core processors. However, for multi-statement input programs with... more
    • by 
    •   8  
      Linear AlgebraAutomatic ParallelizationCode GenerationMulticore processors
an NSF Graduate Research Fellowship and NSF and Darpa grants to the Fugu and Raw projects. While provided a vital support network. Most of all, I have relied on my wife, Kathleen Shannon, and my children, Karissa and Anya. Their love has... more
    • by 
    •   5  
      Flow ControlAutomatic ParallelizationSpeculative ExecutionHardware Implementation of Algorithms
We introduce StarFlow, a script-centric environment for data analysis. StarFlow has four main features: (1) extraction of control and data-flow dependencies through a novel combination of static analysis, dynamic runtime analysis, and... more
    • by 
    •   8  
      Static AnalysisData AnalysisFlow ControlAutomatic Parallelization
This article deals with automatic parallelization of static control programs.
    • by 
    •   8  
      Cognitive ScienceDistributed ComputingParallel ComputingScheduling
Most parallel databases exploit two types of parallelism: intra-query parallelism and inter-transaction concurrency. Between these two cases lies another type of parallelism: inter-query parallelism within a transaction or application.... more
    • by 
    •   8  
      Computer ScienceParallel AlgorithmsDatabase SystemsParallel Programming
Automatic parallelization in the polyhedral model is based on a ne transformations from an original computation domain (iteration space) to a target space-time domain, often with a di erent transformation for each variable. Code... more
    • by  and +1
    •   9  
      Distributed ComputingParallel ProgrammingComputer SoftwareAutomatic Parallelization
per. Software package build upon proposed algorithm is described. Several practical examples of mesh generation on multiprocessor computational systems are given. It is shown that developed parallel algorithm enables us to reduce mesh... more
    • by 
    •   9  
      Mesh generationAutomatic ParallelizationParallel AlgorithmThree Dimensional
GPUs are a class of specialized parallel architectures with tremendous computational power. The new Compute Unified Device Architecture (CUDA) programming model from NVIDIA facilitates programming of general purpose applications on their... more
    • by  and +1
    •   12  
      Program TransformationShared memoryAutomatic ParallelizationData Dependence
per. Software package build upon proposed algorithm is described. Several practical examples of mesh generation on multiprocessor computational systems are given. It is shown that developed parallel algorithm enables us to reduce mesh... more
    • by 
    •   9  
      Mesh generationAutomatic ParallelizationParallel AlgorithmThree Dimensional
We describe pHPF, an research prototype HPF compiler for the IBM SP series parallel machines. The compiler accepts as input Fortran 90 and Fortran 77 programs, augmented with HPF directives; sequential loops are automatically... more
    • by 
    •   16  
      Computer ArchitectureData AnalysisDistributed Shared Memory SystemParallel Processing
The aim of this paper is to explain the importance of polytope and polyhedra in automatic parallelization. We show that the semantics of parallel programs is best described geometrically, as properties of sets of integral points in... more
    • by 
    •   2  
      Operations ResearchAutomatic Parallelization
Increased complexity of memory systems to ameliorate the gap between the speed of processors and memory has made it increasingly harder for compilers to optimize an arbitrary code within a palatable amount of time. With the emergence of... more
    • by 
    •   8  
      Shared memoryAutomatic ParallelizationMemory SystemsDomain Specificity
The desire to simulate more and more geometrical and physical features of technical structures and the availability of parallel computers and parallel numerical solvers which can exploit the power of these machines have lead to a steady... more
    • by  and +1
    •   13  
      Applied MathematicsMathematical PhysicsMesh generationAutomatic Parallelization
Static scheduling of a program represented by a directed task graph on a multiprocessor system to minimize the program completion time is a well-known problem in parallel processing. Since finding an optimal schedule is an NPcomplete... more
    • by 
    •   12  
      Graph TheoryParallel ProcessingGenetic AlgorithmOptimal mine design and scheduling
The growing number of processing cores in a single CPU is demanding more parallelism from sequential programs. But in the past decades few work has succeeded in automatically exploiting enough parallelism, which casts a shadow over the... more
    • by 
    •   4  
      Computer ScienceComputer ArchitectureParallel ComputingAutomatic Parallelization
per. Software package build upon proposed algorithm is described. Several practical examples of mesh generation on multiprocessor computational systems are given. It is shown that developed parallel algorithm enables us to reduce mesh... more
    • by  and +1
    •   9  
      Mesh generationAutomatic ParallelizationParallel AlgorithmThree Dimensional
Distributed-memory multicomputers such as the the Intel Paragon, the IBM SP-2, and the Thinking Machines CM-5 o er signi cant advantages over shared-memory multiprocessors in terms of cost and scalability. Unfortunately, extracting all... more
    • by 
    •   19  
      BusinessComputer ArchitectureComputer GraphicsComputational Modeling