Skip to main content
Filter by
Sorted by
Tagged with
1 vote
0 answers
34 views

OpenMP target error `alias definitions not supported in this configuration`

I want to use OpenMP target construct to run a loop on a GPU device. After many trial, I discovered that my gcc compiler fails to compile a recursive function. After many trials, I isolated my problem ...
Antonio Ragagnin's user avatar
0 votes
0 answers
37 views

where is the TLS session key stored when configuring nginx ssl engine as pkcs11?

I configure ssl_engine in nginx as pkcs11. It means that nginx server used the key stored in hsm as its identity in TLS connection. However, the TLS master key (or shared key, or session key) is ...
Latte Xu's user avatar
0 votes
0 answers
84 views

GCC 11.2.0 Installation with nvptx-tools Fails on CentOS 7: 'ACCEL_COMPILER_acc_device' Not Declared

I am trying to run OpenACC code on a GPU and need to use GCC for compiling the C++ code. Specifically, I am attempting to install GCC 11.2.0 with nvptx-tools on a cluster running CentOS 7. The compute ...
S M Shamimul Hasan's user avatar
1 vote
0 answers
43 views

Domain decomposition chunks: Verification passes on 1 GPU but fails on 2 GPUs

I am using OpenMP, C++, icpx, on Intel Developer Cloud's 4x GPU Max 1100 + Xeon CPU code: double rsd[ISIZ3][ISIZ2/2*2+1][ISIZ1/2*2+1][5]; double frct[ISIZ3][ISIZ2/2*2+1][ISIZ1/2*2+1][5]; double rho_i[...
Max's user avatar
  • 11
1 vote
0 answers
99 views

How to use CMake with offloading-enabled gcc built from source

I want to use an offloading-enabled gcc in a CMake project. To this end, I built gcc from source to use it for OpenMP offloading to an NVIDIA GPU. I could not use distro packages, because of recent ...
RL-S's user avatar
  • 927
1 vote
1 answer
183 views

Bluefield-2 NVMe-oF Target Offloading received IB Backend ctrl event: XRQ NVMF backend ctrl timeout error (22)

I'm trying to build the NVMe-oF Target offloading environment based on the Bluefield-2. I installed the MLNX OFED driver MLNX_OFED_LINUX-23.10-2.1.3.1-rhel7.9-x86_64. The OS version is Centos 7.2009 ...
Jasonxubiubiu's user avatar
0 votes
0 answers
105 views

OpenMP GPU offloading vector iterator

I have this working test code, which maps std::vector for offloading and checks if the calculation is correct: #include <iostream> #include <chrono> #include <vector> int main() { ...
user7667996's user avatar
1 vote
0 answers
124 views

HAProxy's TLS offload feature not working as I expect

I am in the process of setting up HAProxy 2.8 via Docker to leverage it's ability to provide TLS offload in my network to avoid having certificates attached to every VM instance I deploy. My problem ...
LinuxNinja83's user avatar
1 vote
0 answers
33 views

GPU offloading of triangular loop

I have triangular loop to calculate forces, which I ported to GPU via OpenMP offloading: !$omp target teams distribute parallel do simd & !$omp private(i,j,ii,jj,rinvdenom,tp,index,fix0) &...
realnewton's user avatar
0 votes
1 answer
112 views

OpenMP task reduction with target offloading segfaults when running single threaded

I was using OpenMP with target offload and found that my application segfaults when limiting OpenMP to a single thread. I could boil it down to the following snippet: #include <omp.h> int main()...
lulle2007200's user avatar
1 vote
1 answer
743 views

OpenMP Offload Error during linking with gcc with nvptx-none: unresolved symbol _fputwc_r

I am trying to compile a simple test problem using OpenMP offloading for an Nvidia GPU. I am using gcc with the nvptx-none target. I have installed the gcc+nvptx package with spack (or compiled gcc-13 ...
ZehDeckel's user avatar
0 votes
1 answer
520 views

Receive side scaling (RSS) based on just one field inside UDP payload

I have an issue with HW-offloaded balancing of network packets between CPU cores on DPDK. In my case I have only several endpoints whose number is less than the number of CPU cores I have. All the ...
budoattack's user avatar
1 vote
1 answer
797 views

OpenMP offloading target map alloc - how does it work

I have always been confused and never understood how the alloc map-type of the map clause of the target (or target data) construct works. What is my application - I would like to have a temporary ...
Jakub Homola's user avatar
0 votes
1 answer
247 views

OpenMP Vs. OpenMP Offloading

I am trying to code vector addition code using OpenMP in host and OpenMP Offloading. But time taken for OpenMP offloading is more than OpenMP in host. Why is that? openmp-host.c #include <assert.h&...
Ravindu Hirimuthugoda's user avatar
3 votes
1 answer
579 views

OpenMP offloading on GPU, 'simd' specificities

I was wondering how to interpret the following OpenMP constructs: #pragma omp target teams distribute parallel for for(int i = 0; i < N; ++i) { // compute } #pragma omp target teams distribute ...
Etienne M's user avatar
  • 656
0 votes
1 answer
243 views

Intel VTune - Estimate data offload to GPU

I'm interested in estimate the data transfer, in terms of bytes, of an algorithm or function to be executed on a GPU using Intel VTune. For example, if my algorithm computes a multiplication between ...
hpcanalisys's user avatar
2 votes
0 answers
436 views

OpenMP offload doesn't recognize my Nvidia GPU

I'm trying to test OpenMP's GPU offloading on my machine with a GTX 1660ti card. Running this code gives me a 0. #include <omp.h> #include <stdio.h> int main() { printf("%d\n&...
Stefan de Souza's user avatar
1 vote
2 answers
2k views

OpenMP offloading with Intel oneAPI DPC++ compiler to NVIDIA GPU

I'm on a mission to write a program with OpenMP offloading to a GPU. At the moment I compile my code with Intel oneAPI DPC++ compiler icpx v2022.1.0 and aim to utilise an NVIDIA Tesla V100 at the ...
mabalenk's user avatar
  • 1,033
0 votes
0 answers
82 views

Does OpenMP 5.x support classes/type bound procedures in Fortran?

It looks like OpenACC supports Fortran Type Bound Procedures. So I took that code and translated the directives to OpenMP Offloading directives and that code bombs out (memory error, using nvhpc 22.x)....
rosenbe2's user avatar
0 votes
0 answers
589 views

OpenMP: Marking functions to be included in the offloaded code

I'm trying to compile a simple vector addition example daxpy with OpenMP offloading to GPU. When I compile the code with g++ v10.2.0 I see the following error: daxpy_loop_target.cpp:37:6: error: ...
mabalenk's user avatar
  • 1,033
2 votes
1 answer
323 views

How to correctly use the update() clause in OpenMP

I have a program that was originally being executed sequentially and now I'm trying to parallelize it via OpenMP Offloading. The thing is that when I use the update clause, depending on the case, if I ...
gamersensual's user avatar
0 votes
0 answers
237 views

OpenMP target offloading matrix multiplication compilation error

I am currently trying to implement a simple matrix multiplication of 2 nxn matrices using OpenMP target offloading. The code is taken from here: template<typename T> void multiplyJIK(T *A, T *B, ...
Dogyman's user avatar
  • 31
0 votes
1 answer
599 views

Dpdk - Segmentation/fragmentation offload for application-level packet reassembly

I'm looking into the capabilities of fragment/packet reassembly hardware and wondering whether one could use them to perform application-level packet reassembly. I have to receive and reassemble a ...
Touisteur EmporteUneVache's user avatar
0 votes
1 answer
326 views

Why OpenMP Doesn't Offload Arrays to GPU?

I am currently writing some codes in C and want to utilize GPUs to do the calculation. My code has a test function like this: void test_func(int *x, int N){ // x is allocated using x = malloc(N*(...
Tianshu Wang's user avatar
1 vote
1 answer
797 views

OpenMP Target Task reduction

I'm using OpenMP target offloading do offload some nested loops to the gpu. I'm using the nowait to tun it asynchronous. This makes it a task. With the same input values the result differs from the ...
ThiloOS's user avatar
  • 92
1 vote
2 answers
863 views

DPDK19.11.10: HW offload for IPV4 with VLAN tag is not working properly

I am using DPDK19.11.10 on centos. The application is working fine with HW offloading if I send only the IPV4 packet without the VLAN header. If I add the VLAN header with IPV4, HW offloading is not ...
Adarsha Verma's user avatar
2 votes
0 answers
156 views

Python C extension compiled with nvc++ OpenMP offloading cannot run on GPU

I have a Python extension in C/C++ that I want to use OpenMP offloading with. Using NVIDIA's nvc++, compiling works out as well as using/running the extension in python. The problem ist that it's not ...
ThiloOS's user avatar
  • 92
0 votes
1 answer
221 views

Perform a triple pointer (C) offloading to NVIDIA GPU with OpenMP

I've been working with a heat transfer code. This code, basically, stablishes the initial conditions for a cube and all of its faces. The six faces start at different temperatures, and then the code ...
DrewHdz's user avatar
1 vote
0 answers
359 views

GPU number threads not showing in OpenMP runtime

I am trying out offloading an array calculation with GPU (GTX 1080Ti) using OpenMP and C++ on this dummy code that I have written: #include <omp.h> #include <iostream> using namespace std;...
OMEGOSH01's user avatar
3 votes
1 answer
375 views

GPU array addition using OpenMP

I am trying out OpenMP offloading with an nvidia GPU and I am trying to do some array calculations with it in C++. Right now my output is not desirable, as I am new with offloading calculations with ...
OMEGOSH01's user avatar
1 vote
1 answer
320 views

Errors when trying to offload to GTX-1050 with GCC9.3 and OpenMP

The build log: -------------- Clean: Release in OffloadTest (compiler: GNU GCC Compiler)--------------- Cleaned "OffloadTest - Release" -------------- Build: Release in OffloadTest (...
david-4142135's user avatar
3 votes
0 answers
121 views

OpenMP offloading data race

I'm currently working on a project to invert matrices on the GPU using OpenMP. However, when normalizing a row of a matrix I have a data race. The code looks like this: #pragma omp target data map(...
DoodleSchrank's user avatar
0 votes
0 answers
481 views

Trying to load xdp on nic (offloaded)

I am trying to load my XDP program directly on the NIC (offloaded XDP). According to this answer, I need to specify the device to the following functions: BPF() load_func() I did it like so: b = BPF(...
Lidorelias3's user avatar
6 votes
1 answer
2k views

OpenMP runtime does not sees my GPU devices

I am trying to do some OpenMP offloading to the GPU on my local machine which is employed with a GTX 1060 graphic card. All of my CUDA and Cublas examples run just fine. However, when I tried to run ...
Addman's user avatar
  • 341
5 votes
0 answers
2k views

GCC fails to compile OpenMP offloading to GPU

I'm using GCC 9.3 on Ubuntu 20.04. I want to offload the famous SAXPY example to GPU using OpenMP. I installed GCC's offloading capabilities by sudo apt install gcc-9-offload-nvptx . Then compiled the ...
zana zakaryaie nejad's user avatar
1 vote
1 answer
238 views

OpenMP offloading in a dynamic library fails to compile

I want to compile C code with OpenMP offloading and create a dynamic library libtest.so. When I use the following command: gcc -fPIC -shared -fopenmp -foffload=nvptx-none="-fPIC" test.c -o ...
Benjamin Maier's user avatar
1 vote
2 answers
1k views

DPDK - RSS Offloads (Hash Calculations): ETH_RSS_IPV6_EX - Explanation

So I am currently working on understanding how to Receive Side Scaling (RSS) Offloads work inside DPDK (Data Plane Development Kit). So basically from my understanding depending on which offload you ...
Ameer Usman's user avatar
0 votes
2 answers
2k views

Do I have to build clang-11 from source on Ubuntu 18.04 to have OpenMP GPU target offload?

I installed clang-11 from https://apt.llvm.org/ on Ubuntu 18.04 and I have OpenMP host device functionality working in my C++ test project, but omp_get_num_devices() returns 0, even though I have ...
Paul Jurczak's user avatar
  • 8,041
4 votes
0 answers
2k views

Gcc offload compilation options

I'm trying to build the simplest OpenMP or OpenACC C++ program with GPU offload using gcc-10, CUDA 11 on Ubuntu 18.04 and this CMakeLists.txt file (or OpenMP version): cmake_minimum_required(VERSION 3....
Paul Jurczak's user avatar
  • 8,041
0 votes
1 answer
109 views

Intel Advisor beta offloading analysis: No execution count

I am trying to use Intel oneAPI advisor beta to do a GPU offloading analysis (via analyze.py and collect.py). I have the problem that all non offloaded regions show Cannot be modelled: No Execution ...
lm1909's user avatar
  • 35
0 votes
1 answer
471 views

OpenMP Offloading with Private Arrays of Unknown Size at Compile Time

I am trying to offload several nested for loops in fortran using OpenMP, XL compiler suite. 90% of the routines are straight forward, but a handful of the loops involve private 1D arrays that are of ...
Kschau's user avatar
  • 156
1 vote
0 answers
524 views

linux network acceleration ,hardware offload

I am learning network hardware acceleration in Linux kernel for quiet a long time, but there still are several problems that make me very very confused. After deeply inspecting source code in kernel ,...
risonwung's user avatar
1 vote
1 answer
477 views

Declare all Fortran module variables target OpenMP 4.5+

I have a fortran90 code that use chemical species properties (i.e. molecular weight, viscosity, etc.) for calculations. To easily swap in and out groups of chemical species, we keep module files that ...
Kschau's user avatar
  • 156
0 votes
2 answers
419 views

Difference between offloading decision and task scheduling in context of fog computing

In context of fog computing, the computation offloading decision decides where to offload- on cloud or on fog or execute it locally. While task scheduling also decides where to execute task on fog or ...
Shaifali Malukani's user avatar
0 votes
0 answers
343 views

Tomcat MAX Thread VS sessions

We are using protocol="org.apache.coyote.http11.Http11NioProtocol" and I have a question as to what the max concurrent sessions Tomcat can handle? As per my understanding http11.Http11NioProtocol ...
user2713122's user avatar
0 votes
1 answer
40 views

Azure App Gateway SSL Offloading to a Datacentre server?

So I am looking at using Azure App Gateway to overcome a set of legacy servers (Win2003) that will not support TLS 1.2 and therefore come March+ 2020 the client browsers will not be able to access the ...
Rusty's user avatar
  • 113
0 votes
1 answer
236 views

Debug OpenMP Python C extension offloading

I am using the modeling toolbox Anuga and have set it up to run with parallel support. To my current knowledge the mechanism behind is that Numpy is being extended by modules in C which are exposed to ...
Sebastian's user avatar
0 votes
1 answer
312 views

Private Variables in Offloaded Fortran Parallel Loop

I am offloading code to a GPU using OpenMP 4.5. So far everything is working on the GPU, except when I try to make parallel sections with private variables that are allocated before I offload. I am ...
Jared's user avatar
  • 77
0 votes
1 answer
512 views

libcoi_device.so.0 Not found Compiling Error with Intel 19.0.4 OpenMP 5.0 Offloading

I just installed Intel® Parallel Studio XE Cluster Edition for Linux* 2019 and am trying to use OpenMP to offload to a Xeon Phi accelerator. I am using cmake, with flags CC=~/intel/bin/icc CXX=~/...
Jared's user avatar
  • 77
0 votes
1 answer
1k views

Using OpenMP target offloading in llvm-8.0.0

When trying to use openmp target offloading with llvm I get the following error $ cat offload.cpp #include <omp.h> int main() { #pragma omp target teams distribute parallel for for(int i=0; ...
Alok's user avatar
  • 33