Skip to main content
Abstract. Many image processing algorithms rely on nearest neighbor (NN) or on the k nearest neighbor (kNN) search problem. Several meth-ods have been proposed to reduce the computation time, for instance using space partitionning.... more
    • by 
    •   9  
      Image ProcessingData StructureNearest NeighborGpu programming
Solving linear system with a magnitude of thousand to ten thousand of unknowns takes a very long time in serial fashion. Furthermore, linear system that is discretised from Partial Differentiation Equations (PDE) is also typically solved... more
    • by 
    •   4  
      Gpu programmingJacobiOpenACCIterative Method
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applications for GPGPU. While the Khronos group developed OpenCL with programming portability in mind, performance is not necessarily portable.... more
    • by 
    •   8  
      Cognitive ScienceDistributed ComputingParallel ComputingComputer Networks
To cope with the complexity of programming GPU accelerators for medical imaging computations, we developed a framework to describe image processing kernels in a domainspecific language, which is embedded into C++. The description uses... more
    • by 
    •   10  
      Image ProcessingMedical ImagingProgrammingBiomedical Imaging
GPUs provide impressive computing power, but GPU programming can be challenging. Here, an experience in porting real-world earthquake code to Nvidia GPUs is described. Specifically, an annotation-based programming model, called Mint, and... more
    • by 
    •   14  
      Distributed ComputingAtmospheric ScienceScientific ComputingEmerging Technologies
    • by 
    •   8  
      Computer ScienceEmbedded SystemsParallel ProcessingProgramming
Cycles count in a graph is an NP-complete problem. This work minimizes the execution time to solve the problem compared to the other traditional serial, CPU based one. It reduces the hardware resources needed to a single commodity GPU. We... more
    • by 
    •   11  
      Computer ScienceApproximation AlgorithmsCompute Unified Device Architecture NVIDIA CUDAInformation
Cycles count in a graph is an NP-complete problem. This work minimizes the execution time to solve the problem compared to the other traditional serial, CPU based one. It reduces the hardware resources needed to a single commodity GPU. We... more
    • by 
    •   4  
      Approximation AlgorithmsGpu programmingGraph CyclesMulti-threaded applications
Graphical Processing Unit (GPU) programming languages are used extensively for general-purpose computations. However, GPU programming languages are at a level of abstraction suitable only for use by expert parallel programmers. This paper... more
    • by 
    •   7  
      High Performance ComputingComputer GraphicsParallel ProgrammingSoftware Architecture
A wide class of numerical methods needs to solve a linear system, where the matrix pattern of non-zero coefficients can be arbitrary. These problems can greatly benefit from highly multithreaded computational power and large memory... more
    • by 
    •   8  
      Parallel ProcessingNumerical MethodCompute Unified Device Architecture NVIDIA CUDAGpu programming
Recent advances have enabled GPUs to be used as general-purpose parallel processors on commodity hardware for little cost. However, the ability to program these devices has not kept up with their performance. The programming model for... more
    • by 
    •   3  
      Gpu programmingProgramming ModelMeta Programming
In this work, we describe a new algorithm for rendering polygons defined by cubic Bé zier curve segments in current GPUs. Unlike other approaches, our algorithm has a simple preprocessing that does not require computing tessellations, and... more
    • by 
    • Gpu programming
Denial of Service (DoS) and Distributed Denial of Service (DDoS) attacks are major threat to the internet and is a serious cyber-crime. The rapid increase in number of people using internet and the development of technology has given... more
    • by 
    •   4  
      Distributed Denial of Service AttackGpu programmingDenial of ServiceRate Limiting
This paper presents a comparison between two architectures for parallel computing: Compute Unified Device Architecture (CUDA) and Open Computing Language (OpenCL). Some works in the literature have presented a computational performance... more
    • by  and +2
    •   26  
      Parallel AlgorithmsParallel ComputingProgramming LanguagesParallel Programming
    • by 
    •   6  
      Computer ScienceLocal Area NetworksGpu programmingHigh performance
    • by 
    •   6  
      Cognitive ScienceMachine VisionGpu programmingPose Estimation
We describe how to harness the graphics processing abilities of a consumer video game console (Xbox 360) for general programming on graphics processing unit (GPGPU) purposes. In particular, we implement a linear GP (LGP) system to solve... more
    • by 
    •   11  
      Computer GraphicsEvolutionary ComputationGenetic ProgrammingGenetic Algorithms
Ant Colony Optimisation (ACO) is an effective population-based meta-heuristic for the solution of a wide variety of problems. As a population-based algorithm, its computation is intrinsically massively parallel, and it is therefore... more
    • by 
    •   8  
      Ant Colony OptimizationAnt Colony OptimisationCluster ComputingGpu programming
GPUs provide impressive computing power, but GPU programming can be challenging. Here, an experience in porting real-world earthquake code to Nvidia GPUs is described. Specifically, an annotation-based programming model, called Mint, and... more
    • by 
    •   11  
      Distributed ComputingAtmospheric ScienceScientific ComputingCode Generation
It has been experimentally proven that realizing universal quantum gates using higher-radices logic is practically and technologically possible. We developed a Parallel Genetic Algorithm that synthesizes Boolean reversible circuits... more
    • by 
    •   9  
      Genetic AlgorithmsQuantum Logicparallel Genetic algorithmGpu programming
In this paper the GPU implementation of a real-time isosurface volume-rendering system is described in detail, which aims at autostereoscopic displays. Since autostereoscopic displays provide images for many views, and thus require... more
    • by 
    •   12  
      Computer GraphicsVisualizationRendering (Computer Graphics)GPU based volume rendering
We propose a new set of OS abstractions to support GPUs and other accelerator devices as first class computing resources. These new abstractions, collectively called the PTask API, support a dataflow programming model. Because a PTask... more
    • by 
    •   7  
      OPERATING SYSTEMDataflowPerformance ImprovementGpu programming
Abstract. We introduced a real time Image Processing technique using modern programmable Graphic Processing Units (GPU) in this paper. GPU is a SIMD (Single Instruction, Multiple Data) device that is inherently data-parallel. By utilizing... more
    • by 
    •   11  
      Computer ScienceComputer VisionImage ProcessingCompute Unified Device Architecture NVIDIA CUDA
    • by 
    •   10  
      Partial Differential EquationsLinear AlgebraGeometryGpu programming
The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma microturbulence. This work presents novel analysis and optimization techniques to enhance the performance of GTC on large-scale machines.... more
    • by 
    •   4  
      Distributed ComputingHigh Performance Computing Applications development for Atmosphere modelingGpu programmingParticle in Cell
Genetic Programming is very efficient in problem solving compared to other proposals but its performance is very slow when the size of the data increases. This paper proposes a model for multi-threaded Genetic Programming classification... more
    • by 
    •   6  
      Machine LearningProblem SolvingGpu programmingEvaluation Model
In recent years, GPUs have emerged as an extremely cost-effective means for achieving high performance. Many application developers, including those with no prior parallel programming experience, are now trying to scale their applications... more
    • by 
    •   11  
      Static AnalysisConcurrencyMultithreadingGpu programming
In this paper, we describe an optimized version of a Gaussian-mixture-based acoustic model likelihood evaluation algorithm for graphical processing units (GPUs). The evaluation of these likelihoods is one of the most computationally... more
    • by 
    •   15  
      EngineeringComputer ArchitectureAutomatic Speech RecognitionSpeech Recognition
We propose a method for generating all possible rules of multidimension Boolean cellular automata (CA). Based on an original encoding method and the programming of graphical processor units (GPU), this method allows us to visualize the CA... more
    • by 
    •   16  
      Computer GraphicsVisualizationComputational ModelingInformation Visualization
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance computations. The introduction of new programming languages, such as CUDA and OpenCL, makes GPU programming attractive to a wide variety of... more
    • by 
    •   7  
      Gpu programmingCompiler OptimizationHigh performance computerPerformance Optimization
    • by 
    •   11  
      Financial EngineeringGPU ComputingCluster ComputingMonetary Policy
    • by 
    •   20  
      EngineeringComputer ScienceAlgorithmsComputer Hardware
In recent years, with the development of graphics processors, graphics cards have been widely used to perform general-purpose calculations. Especially with release of CUDA C programming languages in 2007, most of the researchers have been... more
    • by 
    •   5  
      Image segmentationGpu programmingCUDAScaling
Height maps are a very efficient surface representation, initially developed for terrain modeling and visualization. They are also present in other applications, such as mesostructure rendering. However, height maps are incapable of... more
    • by 
    •   9  
      Gpu programmingComputer GraphicHigh performanceObject Representation
This paper presents a method to obtain the approximate indirect diffuse reflection on a dynamic object, caused by a diffuse or a moderately glossy environment. Instead of tracing rays to find the incoming illumination, we look up the... more
    • by 
    •   3  
      Gpu programmingGlobal IlluminationDiffuse reflectance
    • by 
    •   6  
      Computational Fluid DynamicsMemory ManagementGpu programmingFloating Point
The shift to parallel computing-including multi-core computer architectures, cloud distributed computing, and generalpurpose GPU programming-leads to fundamental changes in the design of software and systems. As a result, learning... more
    • by 
    •   10  
      Computer ArchitectureDistributed ComputingCloud ComputingParallel and Distributed Computing
We present BSGP, a new programming language for general purpose computation on the GPU. A BSGP program looks much the same as a sequential C program. Programmers only need to supply a bare minimum of extra information to describe parallel... more
    • by 
    •   5  
      Information SystemsStream ProcessingGpu programmingSource Code
The use of Graphics Processing Units (GPUs) in general purpose computing has been shown to incur significant performance benefits, for applications ranging from scientific computing to database sorting and search. The emergence of... more
    • by 
    •   9  
      System DesignScientific ComputingEnergy SavingGpu programming
Graphical Processing Unit (GPU) programming languages are used extensively for general-purpose computations. However, GPU programming languages are at a level of abstraction suitable only for use by expert parallel programmers. This paper... more
    • by 
    •   7  
      High Performance ComputingComputer GraphicsParallel ProgrammingSoftware Architecture
: Rendering successive layers, from left to right: one, two, three and four layers. The last figure identifies four successive layers by a different color. Notice that odd layers add geometric information while even layers subtract,... more
    • by 
    •   8  
      Gpu programmingComputer GraphicHigh performanceObject Representation
The tracking of features in real-time video streams forms the integral part of many important applications in human-computer interaction and computer vision. Unfortunately tracking is a computationally intensive task, since the video... more
    • by 
    •   16  
      Computer GraphicsHuman Computer InteractionComputer VisionImage Processing
In this paper, we describe an optimized version of a Gaussian-mixture-based acoustic model likelihood evaluation algorithm for graphical processing units (GPUs). The evaluation of these likelihoods is one of the most computationally... more
    • by  and +1
    •   14  
      EngineeringComputer ArchitectureAutomatic Speech RecognitionSpeech Recognition
In this paper, we consider the implementation of a thermal flow solver based on the lattice Boltzmann method (LBM) for graphics processing units (GPUs). We first describe the hybrid thermal LBM model implemented, and give a concise review... more
    • by 
    •   8  
      Mechanical EngineeringApplied MathematicsGpu programmingLattice Boltzmann
    • by 
    •   3  
      Image ProcessingGpu programmingOpenCL
In recent years, with the development of graphics processors, graphics cards have been widely used to perform general-purpose calculations. Especially with release of CUDA C programming languages in 2007, most of the researchers have been... more
    • by 
    •   5  
      Image segmentationGpu programmingCUDAScaling
Graphs are powerful data representations favored in many computational domains. Modern GPUs have recently shown promising results in accelerating computationally challenging graph problems but their performance suffered heavily when the... more
    • by 
    •   5  
      Gpu programmingGpgpuData representationGraph Algorithm
    • by 
    •   4  
      Image ProcessingInsightGpu programmingOpenCL
This paper describes a numerical method for the parallel solution of the differential measure inclusion problem posed by mechanical multibody systems containing bilateral and unilateral frictional constraints. The method proposed has been... more
    • by  and +1
    •   13  
      Numerical MethodCompute Unified Device Architecture NVIDIA CUDAComplex SystemGpu programming
The advent of general purpose graphics processing units (GPGPU's) brings about a whole new platform for running numerically intensive applications at high speeds. Their multi-core architectures enable large degrees of parallelism via a... more
    • by 
    •   18  
      Computer ArchitectureComputational ModelingParallel ProgrammingNumerical Analysis