427 questions
0
votes
1
answer
105
views
Fast Communication between Python Threads
I am working on an audio application, where all functions execution times in the "audio loop" need to be << 1ms. I am aware that Python is not the right programming language for this ...
0
votes
0
answers
31
views
Linux real-time kernel frequently kills my app
I'm developing a near real-time application, that forwards messages in a endless loop(no yield, no sleep, no wait).
The messages come from shared memory, go to another blocks of shared memory, so my ...
0
votes
2
answers
110
views
When is virtual dispatch faster than function templates in C++ runtime?
Most knows that template meta programming is in general faster than virtual dispatch in C++ due to types of templates were decided in compile time while virtual functions required runtime lookup on ...
0
votes
0
answers
36
views
How to Implement a Real-Time Screen Reader Using Neural TTS Models on CPU?
I am in the process of developing a Persian text-to-speech (TTS) engine for NVDA screen reader considering neural TTS models for real-time speech synthesis. I need advice on the following ...
0
votes
0
answers
98
views
What's the default value of TCP_NODELAY for gRPC server in C++?
Is there any way to configure this value in C++ for the gRPC server?
I found method - 20grpc_set_socket_low_latency - https://github.com/search?q=repo%3Agrpc/grpc%20grpc_set_socket_low_latency&...
-3
votes
1
answer
49
views
MySQL Performance Issue: Initial Query Slow After Period of Inactivity but Fast on Subsequent Queries
I'm experiencing a performance issue with MySQL when deploying my application to a live server. The problem occurs when creating a short link for the first time after a period of inactivity. ...
1
vote
2
answers
105
views
Numba njit making function call involving long expressions extremely slow
I am writing a finite volume code to solve the inviscid, compressible Euler equations. As a part of this, I am performing what is known as the Cauchy-Kovalevskaya process. A code snippet is given ...
0
votes
1
answer
170
views
Why is my performance so low with Aeron, 90th percentile less than 5ms throughput is only 2k
I wrote a simple test for Aeron
one client, one server, the client sends data to the server, then the server sends it back and counts the elapsed time.
The final result is much lower than I expected, ...
0
votes
0
answers
77
views
Profiling outliers in java
I use an event loop architecture in my Java application. Some cycles of the event loop are several orders of magnitude slower than the median ones. I use async profiler to profile my application. It ...
1
vote
0
answers
82
views
Stock tick-data over Web sockets from Broker to browser with a middle node.js server
We are creating a trading web app with a node.js backend and react front end.
This app will be able to view tick data and place trades using broker's APIs.
Broker's APIs give stock quotes over ...
3
votes
1
answer
118
views
Does Aeron support io_uring?
I'm using the Aeron open source for some POCs, and using C version of media driver, does it support the io_uring?
Also we are using java version of Aeron cluster and archive, what about them?
0
votes
0
answers
156
views
How do I reduce latency for Flutter Geolocator on iOS?
I've built an app using Flutter that allows users to track how far they have travelled from a starting point, a bit like any map-based or sports tracking app. With the app, a high accuracy is ...
3
votes
0
answers
204
views
Does std::steady_clock::now() make a syscall in Linux?
My project is a long time daemon service and need to busy check some conditions and take actions as soon as possible.
I want to use std::steady_clock to account the responsing time of my service and ...
0
votes
1
answer
116
views
Read multiple sockets from single thread [low-latency]
As a network client I would like to follow the input from two TCP connections. Both servers send a few packets per second. Each packet is small in size when compared to the available bandwidth.
...
0
votes
1
answer
214
views
Spring Boot spanner latency problems
I found our service has problems with latency and every single read statements is taking more than 40ms.
I tried to use native/derived queries and it always takes more than 40ms.
Reading data is ...
-1
votes
1
answer
90
views
How to detect which client pressed a button first on a webpage?
Context
I'm trying to create a trivia website that can detect which user pressed a button first; my idea is that the first user who clicks the button gets to answer the trivia question. I don't really ...
0
votes
1
answer
630
views
What's the impact of hyperthreading on low-latency development
I've read a post regarding low-latency development as below.
"We always avoid (relatively) slow language features like exceptions, memory allocation and virtual function calls on the critical ...
0
votes
0
answers
73
views
accurate measurement and reduce jitter
I am trying to understand how to measure a block of code to maximize performance.
On doing some tests I found there to be quite a few outliers that have large differences from the average. The ...
2
votes
1
answer
439
views
Long delay on reading from Kafka Spark
There is a Spark streaming application which reads messages from Kafka, processes them and then stores them to some DB.
During its latency optimization constant delay of ~2s on reading from Kafka was ...
2
votes
1
answer
521
views
How to achieve Spring boot p99 under 500 milliseconds for 1000 requests per second
Our spring boot application verion 2.5.10 has just a single api which reads from redis 6 times and does some calculations and there are no other IO operations such as database operations and is ...
0
votes
0
answers
870
views
High Latency and Thread Starvation Issues in Spring Boot App
I have a Spring Boot application with an API endpoint that pushes data to a Kafka topic. However, when the application experiences high load, it leads to significant latency issues, and many requests ...
1
vote
0
answers
986
views
How to get rid of interrupts on isolated cores (caused by simple Go app) during low-latency work?
Summary
LOC, IWI, RES and CAL interrupts are observed on isolated cores on which low-latency benchmark is performed. Interrupts are caused by simple Go application (printing "Hello world" ...
1
vote
0
answers
26
views
How to make a thread handle environment with very high machine count
We have an array consisting of 1000 elements each of which signifies a machine count of 1000 environments and there are 5 threads that need to be given environments from this array such that each ...
0
votes
0
answers
58
views
Why is my OpenMP code experiencing high latency in production despite performing well in development environment?
Below is a parallel task model using omp.
The proc1, ..., proc4 each costs below 1ms to finish. And in develop environment(2 physical CPU, 16 core in total) the job finishes perfectly.
But in ...
2
votes
1
answer
1k
views
Call openai-node in the backend or call https in the frontend?
I have a web application by ReactJS and Nodejs. This application calls OpenAI APIs.
Previously, when a user launches a request in the frontend, we send a request to the endpoint in our backend, call ...
-1
votes
1
answer
202
views
ultra low latency live video cam streaming using azure
I want to use azure for my video cam live stream. And I want to use that for a realtime object detection process.
Problem :
I used azure media services but it has approx 4 sec delay which is not ...
1
vote
0
answers
738
views
If I want to play Low-Latency HLS streaming on safari native player, must #EXT-X-TARGETDURATION and #EXTINF be equal?
I make two low-latency hls streams to play on safari native player. The only difference is EXT-X-TARGETDURATION:
"First fine stream" with EXT-X-TARGETDURATION = EXTINF = 6sec;
"Second ...
0
votes
0
answers
1k
views
Rust Tokio Async performance
I'm looking to create a high throughput, low-latency marketplace using Rust. I was doing some performance testing on the serialization and found that it was pretty slow (0.05ms). After some ...
0
votes
1
answer
134
views
How Chronicle Wire allows you to serialize and deserialize objects with higher performance
We would be developing a low-latency market data solution for which we are exploring Chronicle products.
Quoting from article
Chronicle Wire allows you to serialize and deserialize objects to and
...
2
votes
1
answer
226
views
How can I serve ML models quickly and with a low latency
Assume a user connects via a Websocket connection to a server, which serves a personalized typescript function based on a personalized JSON file
So when a user connects,
the personalized JSON file is ...
0
votes
0
answers
86
views
Strange increase in latency of ZeroMQ between c# and python
I'm trying to send images from C# running on win 11 to a python server running on WSL2.
I'm using ZeroMQ to send byte arrays, but as soon as the byte arrays cross a threshold (~8kb) the latency spikes ...
3
votes
0
answers
239
views
AMD Epyc Zen 2 slowed down by spin-loop
I wrote a multi-threaded SIMD matrix-vector multiplication that performs better than Intel MKL at my sizes of interest
(Multi-threaded fixed-size matrix-vector multiplication optimized for many-core ...
1
vote
0
answers
102
views
Streaming 1080p 30FPS from i.MX to X86 Laptop with Minimum Latency
UVC Camera is used supporting MJPG: 1080p@30 / 720p@30 or YUV:1080p@6 / 720p@6.
HW Setup-i.MX6 Board [HS-O]:
Video Source (VS-O): UVC Camera connected with i.MX6 Board USB 2.0 port.
Video Receive (VR):...
2
votes
0
answers
42
views
Why is my interval of consecutive rdtsc calls periodic?
I am profiling interval of consecutive rdtsc calls as suggested by this talk by David Gross.
My system is aws c5a.8xlarge instance, ubuntu 22.04 LTS server, no other user packages installed and there ...
-5
votes
1
answer
82
views
Which one performs better in Java? Creating object or getting from cache twice? [closed]
The object has 6,7 fields.
cache.get() is a map read.
Which one is faster?
obj temp = cache.get(key); // temporary object creation
Int id = temp != null ? temp.getId() : ...
1
vote
1
answer
282
views
ZMQ Pub-Sub latency spikes
I'm using a ZMQ pub-sub to pass messages from a C++ program to a Python script.
I'm passing around 5000 messages per second, and they vary in weight from 100 to 4500 bytes, with an average of about ...
1
vote
0
answers
104
views
Camera to computer high latency
I am connecting a Fujifilm x-t3 to my computer (usb-c to usb-c) and getting the video stream using OpenCV and C++ in Visual Studio. The issue is I am getting a latency of 116 milliseconds from the ...
0
votes
0
answers
507
views
Cloud run cold start spring boot issue
I have login API developed in spring boot and deployed on cloud run with min instance 1. But for some request it is taking 20s to process which is not desirable. When I checked logs, I found that ...
0
votes
1
answer
414
views
iOS Swift live streaming: Warning: PART-HOLD-BACK SHOULD be at least three times the Part Target Duration
We use AVPlayer in order to play live stream in our iOS app. We've recently added low-latency feature.
Only when low-latency is enabled: Sometimes player stalls / is frozen.
The following waring is ...
0
votes
1
answer
200
views
How can we provide low latency transmission of messages using jGroups?
We have implemented message broadcasting between instances of the same API using jGroups to synchronize application state.
We use jGroups5.2.4.Final on JDK 11.0.17 on Linux 5.4.0.
Currently we observe ...
0
votes
1
answer
282
views
x86 rep instructions, lock prefix, atomics and real-time
Consider the following case:
Thread A (evil code I do not control):
# Repeat some string operation for an arbitrarily long time
lock rep stosq ...
Thread 2 (my code, should have lock-free behaviour):...
0
votes
1
answer
551
views
Ultra low-latency in ubuntu 22.04
I should run a visual experimental project with Matlab or Python in which the time resolution is dramatically important! that's why I shift to Linux and use the low-latency kernel in ubuntu. but I ...
2
votes
1
answer
718
views
crypto HFT architecture
This architecture is designed to minimize latency with the help of busy-spinning and CPU affinity locks(meaning each producer/consumer thread running in only one core), preventing a thread from ...
0
votes
0
answers
206
views
How to reduce latency in a PyQT Display?
I am working on a program that takes in high-speed CanBus Data and displays it to a speedometer-style display using PYQT (A digital Dash Board Display). I have run into the common problem of the ...
3
votes
4
answers
883
views
How to return sum of two highest value in List<int> in most efficient way? C#
I'm trying to pass a pre-written test with focus on performance. What is the most efficient way to return the sum of the two highest numbers in a List of int? I have tried the following and according ...
0
votes
1
answer
1k
views
How should I approach parsing the network packet using C++ template?
Let's say I have an application that keeps receiving the byte stream from the socket. I have the documentation that describes what the packet looks like. For example, the total header size, and total ...
1
vote
0
answers
410
views
Temporal workflow hardware constraints for low latency
I have recently started reading about temporal for use in my company's project. I have built an in-house orchestrator for short lived tasks which is able to support 100 tps with 8 gb heap and 4 cores. ...
0
votes
2
answers
296
views
How to patch portaudio for low-latency iOS usage?
I am currently working on a patch that should make portaudio work for iOS. For now I successfully applied portaudio member Hans Petter's patch for iOS usage
https://www.dropbox.com/s/6hf9bjqpa6b6uv3/...
2
votes
1
answer
783
views
Large dataset, ProcessPoolExecutor issues
PROBLEM - ProcessPoolExecutor hasn't increased speed. Confirmed by tqdm
Learned enough about python to copy and/or write a program that works. each file takes ~40 seconds to load->filter->write. ...
3
votes
0
answers
220
views
Why does a function run faster when I call it continuously?
I am deveoping a low latency service use c++ on linux. I do two group performance tests:
send 1 request per second, it's average latency is 3.5 microseconds.
send 10 request per second, it's average ...