24,380 questions
0
votes
0
answers
62
views
Python: weird characters in extract_message
I use extract_message from pypi.org/project/extract-msg package to extract outlook messages. It works well until this outlook file causes the error message. I found out that the problem is that the ...
-1
votes
0
answers
29
views
How to encode item features with high number of categories for recommendation
For the recommendation problem I am working on, there are around 50000 unique brands and 3 level product categories, level_1_cat (50 categories), level_2_cat (100 categories) and level_3_cat (1000 ...
1
vote
1
answer
39
views
Encoding issue ISO-charset vs utf-8 [duplicate]
When I input data such as : Œ, œ, Ÿ, … , €, ≤, ≥, ∞, ≈, ≠, – , — , ‐ , ‑ , ‹ ›, “ ”, ‘ ’
And then try to access data from the $_REQUEST['some_value'] I get this as my output
¼, ½, ¾, … ,...
0
votes
0
answers
46
views
Telegram link with draft_text unicode issue
I trying to use telegram link to profile with draft_text
https://core.telegram.org/api/links#public-username-links
https://t.me/someuser?text=привет
But I have unicode issue.
I tried ...
0
votes
0
answers
29
views
Try enable Nvidia GPU accelaration for chromium's video encoding
I build the latest Chromium 133, and tring enable Nvidia (3060 / 3090) hardware encoding.
./chrome --enable-features=AcceleratedVideoDecodeLinuxZeroCopyGL,AcceleratedVideoDecodeLinuxGL,...
1
vote
1
answer
58
views
Detected characters in arguments that may not be executed correctly by the shell. Please escape special characters using backtick (`)
When running the below azure pipeline, I get the error:
Detected characters in arguments that may not be executed correctly by the shell. Please escape special characters using backtick (`).
This ...
0
votes
1
answer
48
views
Should I use both HtmlEncode and JavaScriptStringEncode if inside HTML <script> tag to create HtmlString from string?
My question is very similar to this one, but without the <% %> details.
Let's say I have the following code:
public static IHtmlString AddSomethingToWindow(string value) {
var output ...
0
votes
0
answers
24
views
Special characters missing in MySQL database and displayed incorrectly in the browser whether in utf8mb3 or utf8mb4 [duplicate]
I'm facing an issue with special characters (e.g., é, ç) in my web application.
Here's the situation:
My MySQL database, tables, and columns are all set to utf8mb4_unicode_ci.
UPDATE cars SET name = '...
0
votes
0
answers
34
views
Prestashop SMTP test email text with wrong utf8 encoding
I activated the SMPT email option on Prestashop 8.2.0 and tried to send a test email through the dedicated section on the same page, I entered my email and I received the email.
I noticed that ...
0
votes
0
answers
34
views
Create query on SAP 4/Hana view with SAS e.g
I try to create a query on SAS view, based on a SAP 4/Hana table.
I get the following error:
ERROR: Unable to transcode data to/from UCS-2 encoding.
How can I change the encoding?
I've tried to copy ...
0
votes
3
answers
42
views
URL encoding with semicolons (;) in Java using UriComponentsBuilder
I’m working on encoding semicolons (;) in URL query parameters using Java. The semicolons need to be encoded as %3B, but I'm running into an issue where they are either get double-encoded as %253B or ...
0
votes
0
answers
11
views
How to pre-process data correctly in order to retain positional information?
I am training a neural network for Direction of Arrival estimation. The CNN should learn the inverse mapping from the complex valued covariance matrix of array received data to the spatial
spectrum. ...
1
vote
2
answers
44
views
How to Handle Both Encoding and Decoding with Custom Error Handler in Python?
I'm working on a scenario where I need to handle encoding and decoding of strings using custom error handling in Python. Specifically, I want to create an error handler that can deal with both ...
0
votes
2
answers
56
views
Reading CSV File without manupulating encoding
I have a csv file that I can see with :
file -bi myCsv.txt
that the character-set is iso-8859-1.
Now I want to read this file using JavaScript from input tag and type 'file' without manipulating the ...
-1
votes
1
answer
87
views
How to parse Word files with Hindi text in KrutiDev font using Python in json
I'm trying to parse Word files .docx that contain Hindi text written in the KrutiDev 010 font.
When I process the text using Python and try to encode it into JSON, I get random gibberish instead of ...
0
votes
0
answers
25
views
How to download a zip file attachment from an Oracle UCM SOAP response in Microsoft Power Automate
I'm attempting to access an Oracle UCM folder using Microsoft Power Automate, that has Zip files in it. A CSV file is included with every Zip file. The objective is to copy this Zip file (or extract ...
0
votes
0
answers
10
views
Gradle sync replaceTokens: how to specify file encoding UTF-8?
Gradle sync replaceTokens for UTF-8 files:
task syncFiles(type: Sync) {
from("/path")
into "/path2"
filesMatching (['**/*.xml']) {
filter(...
0
votes
1
answer
56
views
Error reading file from Synapse Analytics - Encoding issue in azure data factory
I currently try to copy files from a SFTP to a blob storage in azure with Azure Datafactory. In my SFTP, i got ANSI and UTF-8 files.
The copy is working fine with a copy activity in ADF (due to my ...
0
votes
1
answer
31
views
Understanding byte-pair encoding tokenization for Greek characters
I am trying to train a new tokenizer with Greek text to later add the new tokens into the Llama 3.1 tokenizer using
tokenizer.add_tokens(list(new_tokens)).
However, upon training the byte-pair ...
1
vote
2
answers
73
views
How to differentiate between optimal prefix codes and Huffman codes?
Question
While studying encoding theory, I encountered a set of code tables where I need to determine whether they belong to one of the following categories:
A:Prefix codes
B:Huffman codes
C:Optimal ...
1
vote
2
answers
75
views
Passing UTF-8 arguments to commands in Perl on Windows
I am trying to build a template for Perl scripts so that they would do at least most of the basic things right with UTF-8 and would work equally well on Linux and Windows machines.
One thing in ...
1
vote
0
answers
50
views
In Qt, how do you determine a file's proper encoding, and subsequently load it into a QString?
Backstory ( You can skip )
Building a sgml parser
Testing it with every html document on my computer locate *.html > /tmp/htmldump
Coming across a lot of different type of documents:
file "...
0
votes
0
answers
33
views
How can I decode base64 and quoted-printable emails and encode as 7/8bit emails in Python?
I want to decode base64 and quoted-printable MIME emails in Python, such that I either get an email.message.Message (or email.message.EmailMessage) or string representing the entire email as though it ...
1
vote
2
answers
110
views
Why are characters like "ã" appearing encoded in UTF-8 in Git and how to fix it?
I'm using Git to version my code, but I noticed that some commits have the author names encoded in UTF-8 in a strange way. For example, the author name "João" appears like this when I .patch ...
0
votes
1
answer
53
views
Is there an encoding in which it's possible to have more characters than bytes?
I wrote code similar to this the other day (pseudo-code):
let numBytes = file.size()
let string = String::reserve(numChars=numBytes)
file.read(&string, encoding='auto')
But then I thought to ...
1
vote
2
answers
71
views
Encoding in utf-16be and decoding in utf-8 print the correct output but cannot be converted into Python representation?
If I'm encoding a string using utf-16be and decoding the encoded string using utf-8, I'm not getting any error and the output seems to be correctly getting printed on the screen as well but still I'm ...
0
votes
1
answer
54
views
Converting zlib related python code to java
I am trying to convert the below python code into java
def _convert_base10encoded_to_decompressed_array(base10encodedstring) -> None:
bytes_array = base10encodedstring.to_bytes(5000, 'big')...
-4
votes
1
answer
61
views
Convert a shape interface{} to []byte leads to "values are not fixed-sized" (binary encoder) or "no exported fields" (gob) but I can't change structs
I have a shape interface{} depicting a circle, a square, or something else.
I need to return it as binary in a []byte array.
If I use the binary encoder
import "encoding/binary"
var buffer ...
0
votes
1
answer
38
views
How can I select the default encoding for a new text resource file in visual studio 2019?
I am developing a desktop application which relies on text resource files which I create using VS 2019.
In particular I create the files by selecting "My Project" under my project's name in ...
1
vote
1
answer
72
views
AIX 5.3 system returning ? for special characters in filenames
I am trying to get filenames containing special characters on an AIX 5.3 system using a Java program but its replacing the special characters with a question mark. This is in client environment. But ...
0
votes
0
answers
33
views
esp32 and cutecom encoding
Im using esp32 to write uint16 values to CuteCom and from there I copied those values to a csv file to be use by a python file to display an image. This works great!!
Now I tried to send those same ...
0
votes
0
answers
10
views
SOAP request encoding issue with WebLogic CXF and EclipseLink MOXy
I have an EAR application running on WebLogic 12.2.1 and Java 8. My application uses CXF to call SOAP web service. It turns out that default system encoding (in my case cp1251) is actually used for ...
1
vote
0
answers
58
views
duckdb cli with utf8 from windows - what setting to use national strings?
i explore duckDb under windows 10 (winget install DuckDB.cli)
$ duckdb --version # v1.1.2 f680b7d08f
my goal is to select from xlsx (or csv with utf-8 encoding file)
and i'm faced with the fact that ...
0
votes
1
answer
74
views
Javascript Phone Number mask for azerbaijani numbers
I've been trying to implement a code snippet for the telephone input field in my WordPress FunnelKit builder checkout, but it isn't working as expected when users re-enter their numbers. I aim to ...
0
votes
0
answers
37
views
CSV upload issue with UTF-8 and special characters
I have the following function:
processFiles(fileList) {
this.loading = true; // Ladeindikator anzeigen
Array.from(fileList).forEach(file => {
const fileExtension = file.name.split(".&...
2
votes
2
answers
104
views
koi8-r text is shown incorrectly
Here is the code that tries to output text in koi8-r single-byte encoding to the browser.
I saved it in Emacs in koi8-r:
#!/usr/bin/perl
use Mojolicious::Lite;
no utf8;
get '/' => sub {
my $c =...
0
votes
0
answers
43
views
gibberish result when decoding base64 content
first of all I have a very weak experience regarding encoding/decoding, and I have the following base64 - or at least ChatGPT said it's a base64 encoded! - REST APT response ( Content-Type: ...
0
votes
0
answers
48
views
Method to encrypt text that outputs only alphabetic characters
OK so I want to be able to securely encrypt text (ideally AES?) in such a way that the encrypted text output only contains alphabetic characters. Most encoding schemes I've found include numbers which ...
0
votes
0
answers
20
views
Is there a way to transfer code from GBK to UTF8 by using Gitlab or other auto tanslation tools?
I'm working on a cross-platform program that needs to run on both Linux and Windows. However, I've encountered an encoding issue with Chinese characters after pulling some code from GitLab on Windows. ...
-1
votes
1
answer
71
views
Why does the character encoding of HTML source code need to match the one declared in the meta charset tag?
The title very much describes the question. The HTML standard says explicitly that the only value acceptable for the <meta charset> tag is UTF-8, and even the Nu Html Checker throws an error if ...
0
votes
0
answers
55
views
How can I scroll a part of the screen to see some options without the content on the sides moving? (in language C)
I am currently developing a cookie clicker-type game using Processing, and I'm facing a challenge with the user interface. In the section where I have multiple buttons for purchasing different ...
-1
votes
2
answers
162
views
Is there way to convert a 38 digit number into a String of length 18 to 20 without any special character?
I have an use case where I need to encode a 38 digit number into a string of length 18 to 20.
I'm using a ByteBuffer with capacity of 16 bytes in java to store two long values. Then I try to encode it ...
-1
votes
1
answer
64
views
Left View Method error in Binary tree using Python
I am learning DSA quite long.. During my learning, I encountered a concept 'Left View Method in Binary Tree'. I am completely scripted my code but there is a error when i tried to give a modified ...
2
votes
3
answers
236
views
How to properly limit a String for VARCHAR2(4000 BYTE) in Oracle?
We are using VARCHAR2(4000 CHAR) columns in our oracle database to store JSON data without compromising performance too much by using CLOB. I've learned that 4000 CHAR is technically limited to 4000 ...
2
votes
0
answers
78
views
Failed to fetch file gs://chromium-clang-format/025ca7c75f37ef4a40f3a67d81ddd11d7d0cdb9b - Error while executing "gclient sync"
I am working on a remote machine to set up the release build for my application. We have depot_tools dependencies, so while executing the gclient sync command, we are facing some Python-related issues....
-2
votes
4
answers
143
views
How to convert a binary string like '0xFF' to a base64 encoded string in Java? [closed]
I am working on a data migration project, which contains binary strings from SQL Server queries, which come across in a string format '0xFF...'. I need to convert these to a base64 encoded string of ...
0
votes
0
answers
100
views
How to solve no unicode mapping error from PDFBox
I have a PDF file that has been compressed by Adobe, and I used PDFTextStripper from pdfbox to extract the PDF text content as blank, however there are multiple errors similar to the following:
org....
1
vote
1
answer
192
views
Why am I getting \u0010 returned from Encoding.ASCII.GetString() when source byte array has simple bytes in it
I am trying to use C# to read/write some old datafiles which were written using Delphi and which only contain ASCII based data, some of the data represents Pascal strings which, you might know, have ...
0
votes
1
answer
41
views
python `getpass.getuser()` on Windows to UTF-8
We are working with Windows computers. User names are based on real names, and so might contain accents (like, in my case 'MichaëlHooreman').
Windows machines are using CP1252 encoding. I'm using ...
0
votes
1
answer
60
views
How can I find the used compression algorithm of a file from its input and output?
I have some text files that encoded and compressed. I have the output file, but I don’t know the compression algorithm. Could you please help me to find the used algorithm?
I found the following facts ...