All Questions
Tagged with voice-recognition python
173 questions
0
votes
0
answers
15
views
connecting a fine-tuned chatgpt model with vtt and tts via terminal in linux (raspberry pi)
Basically, I'm trying to build a way to interact with a finetuned chatgpt model via a headset. I've managed to make it work with tinylama, though for to me unknown reasons that doesn't work anymore ...
0
votes
0
answers
62
views
Issue with Data Preprocessing and Tensor Concatenation for Whisper Model Training
I am trying to train a Whisper model for Jeju dialect speech recognition. However, I am encountering several errors related to tensor concatenation during the data preprocessing phase. Below is the ...
0
votes
0
answers
33
views
Sending file from Raspberry Pi Pico to voice recognition API
I have a project of a bedside lamp with a voice assistant using special requests. I have this code on my PC :
import os
import shutil
import time
import requests
import speech_recognition as sr
from ...
0
votes
0
answers
142
views
Speechbrains SpeakerRecognition saves short cuts/links/symlinks of used audio files in working directory
I use the speaker recognition of speechbrain using the Python language:
from speechbrain.inference.speaker import SpeakerRecognition
and I load a model in the following way
model = SpeakerRecognition....
1
vote
0
answers
279
views
Using webrtcvad to capture audio when the user starts speaking and stops speaking (like Siri) and then saving to a .wav file
I'm trying to make a "Siri-like" audio capture using python. I would like the system to detect speech to begin recording, and then detect silence to stop recording. I would like for this ...
0
votes
0
answers
115
views
How to extract these acoustic features from audio files
I was making a voice gender detector, and when trying new samples I need to get the features below from the audio samples, and I wrote this code that doos that; but apparently audioFeatureExtraction ...
0
votes
0
answers
61
views
Why Python voice assistant works so slow?
I am making a voice assistant using Python 3.12 on MacOS. Now, i am at voice recognition step, but it needs like 6 seconds for just recognize my speech and say 'How can i help you' to user. I know, ...
0
votes
0
answers
391
views
Detecting Silence in Python Voice Assistant
I'm making small pet-project. It's voice assistant. Now, development is on speech recognition part, and i need to make silent detection, to react on user's phrase end. I use vosk for speech ...
0
votes
0
answers
107
views
how to add events to calendar with google calendar api
for example; I have a meeting at 2 o'clock on December 8th, I'm trying to make a python voice assistant that will add the appropriate date in the calendar as an event when I give the voice command, ...
0
votes
0
answers
617
views
not able to convert produce audio file in gradio
I am making Voice Assistant for ordering in restaurants, the VA takes voice input passes to a trained model, and gives output according to intent. For interface, I am using Gradio which takes voice ...
0
votes
1
answer
133
views
Does Google Speech-to-Text Consume Internet Data for Silent Audio Input in Python?
I am currently using Python's Speech Recognition library with Google's Speech-to-Text functionality. My concern revolves around cases where there is no speech detected during the audio processing. ...
0
votes
0
answers
565
views
Unable to tts = TextToSpeech() | Tortoise tts | Torch
Am sorry, I may not use apt words for explaining my issue. please help me out.
Running the code in VS code with python.
These are the codes screenshots with errors:
My setup commands:
!nvidia-smi
!...
0
votes
1
answer
104
views
Python voice recognition stopping when it hears a sound it does not know
I am using Python speech recognition and when it hears sounds that are not understandable (like a dog barking) it raises an error and the program has to be restarted. Is there any way to fix this?
...
0
votes
0
answers
50
views
i was trying to create a desktop assistant using python but whenever i speak something it shows an error
FOR EG (I SAY ="Hello bro")
The error-- "'hello' is not recognized as an internal or external command, operable program or batch file."
or even if i say anything else it displays ...
1
vote
2
answers
1k
views
I have trouble working with pyaudio . How do I fix the errors?
I'm having trouble with my voice commander as it's not working properly and I think it's because of some errors in the PyAudio file. Can you suggest some steps to fix this issue?
I have already ...
-1
votes
1
answer
82
views
How to change the language speech
def take_command():
try:
with sr.Microphone() as source:
print('listening...')
voice = listener.listen(source)
command = listener.recognize_google(voice)...
-2
votes
1
answer
385
views
Python "speech_recognition" module has no "recognize"?
I was trying some code i watched on a video but when i had to run the program it said the following text: "An error ocurred : module 'speech_recognition' has no attribute 'recognize'" i used ...
1
vote
0
answers
379
views
PvRecorder not reading fast enough on my Raspberry Pi Zero
I'm using picovoice (rhino speech-to-intent and pvrecorder) on my pi0.
Here's my code:
from pvrecorder import PvRecorder
import pvrhino
#my pico voice key
key='my_access_key' #removed the access key
...
2
votes
1
answer
239
views
Why the command listener.listen from lib pyttsx3 printing this extra text?
basicly trying to follow the steps of a vid, everything is working great exapt that those lines of code:
with sr.Microphone() as source:
print('listening..')
voice = listener.listen(source)
command = ...
0
votes
1
answer
755
views
Voice Recognition (converting voice to text)
I have code to convert voice to written text, I want to save the written text after it's converted to files that can be accessed later, how do I do it in the following code?
import speech_recognition ...
1
vote
0
answers
214
views
I am making a voice assistant in python and I need a code or maybe a module for a wake up system
I am using the speech_recognition module for taking voice commands. The problem is that I want a system like Alexa and Google voice assistant has. When we say "Alexa" or "Ok Google"...
1
vote
0
answers
183
views
Python smart voice assistant with using a JSON file
I was trying to create a voice assistant using python... but my code is showing some weird texts when I run it... down below is my code
from email.mime import audio
from logging.config import listen
...
3
votes
0
answers
2k
views
Speaker recognition using Python (recognize who a voice belongs)
I would like to know if there is a library or tool for python that allows me to retrieve any identification for a recorded voice based on prior data.
For example:
import voice_recorder
import ...
0
votes
1
answer
161
views
How to make voice detection in python faster?
I have some voice detection code and it works! but, it runs really slowly. Can I do anything to make it faster?
import speech_recognition
import pyttsx3
recognizer = speech_recognition.Recognizer()
...
-1
votes
3
answers
279
views
Voice to text in python
Lets say I want to give input in Hindi or any other language in voice
And it will give an output in English (text). How can I do that?
What I know:
I know how to transfer voice to text.
E.g.
...
0
votes
1
answer
59
views
TypeError: speechtxt() takes 0 positional arguments but 1 was given
I am writing code for voice recognition I am run the program, I am getting TypeError problem.
This is my code:
import pyttsx3
import pyaudio
import speech_recognition as sr
import webbrowser
import ...
0
votes
0
answers
49
views
Can I use the mic with a voice record ? Python
im working in a selenium project and i need use the mic into a website. But I need run an specific voice record into the mic. So when the project is running i need this, 1-Open the website 2-Use the ...
0
votes
1
answer
25
views
Loading a test set when extracting a d-vector - model wants the training set size
So I have to extract a d-vector (d-vector is the averaged activation from the last hidden layer of this DNN) of the taught model. I pass in a checkpoint from the last epoch and want to create a list ...
1
vote
2
answers
1k
views
how to do speaker identification using voice?
I was wondering if you can detect s using their voice. For example, we give feed some data in our program like this is the voice of a cat and when it detects it, it says hello cat or something.
0
votes
1
answer
245
views
I am making voice assistant in pycharm. It was working fine week ago but now it is showing assertion erorr from __init__.py file
import pyttsx3
import speech_recognition as sr
import datetime
import wikipedia
import webbrowser
engine = pyttsx3.init('sapi5')
voices = engine.getProperty('voices')
engine.setProperty('voice', ...
1
vote
0
answers
890
views
How to display text on the screen it is said over the audio
As a personal project, I decided to create one of the reddit text-to-speech bot.
I pulled all the data from reddit with praw
import praw, random
def scrapeData(subredditName):
# Instantiate praw
...
1
vote
0
answers
92
views
I tried to make a virtual assistant using python, earlier it was only printing listening. Now it is browsing myntra's website for every command given
'''
on executing the following code, when I am giving any instruction to this voice assistant :
it is not printing whatever i am saying
it is opening myntra's website for every instruction I am ...
0
votes
0
answers
161
views
Training Keras Model : Graph Execution Error (callbacks = callbacks)
Following the Deep Learning Cookbook, for voice recognition. On the last step, when the epochs start, the following code is breaking at callbacks = callbacks:
history = model.fit_generator(
generator=...
1
vote
0
answers
151
views
Python Speechrecognizer
I'm currently working on my own voice assistant, but I often times have the problem that it's taking very long to answer/recognize that I'm not sayin anything.
I already tried using:
listener....
2
votes
1
answer
2k
views
Exception: Failed to process waveform
Error:
Traceback (most recent call last):
File "c:\Programming\New_assistant\speech_to_text.py", line 18, in <module>
if rec.AcceptWaveform(data):
File "C:\Users\david\...
0
votes
0
answers
208
views
Creating voice assistant in python and storing user's input
I have created a very simple voice assistant that listens to the user's input (more specifically, the user's story). Now, I want to create a function that stores the user's input - hereby, I have ...
1
vote
1
answer
165
views
Never ending batch in PyTorch
In short: training using the mutual information with a positive pair (two samples of same person) and one negative pair. Dont think the "how" is important here.
I need pointers on how to get ...
0
votes
1
answer
144
views
TypeError: argument of type 'int' is not iterable upon incorrect speech recognition
I'm working on a TTS/STT system for a school project and it was going well until I started getting this specific error whenever the code ran through the "Google Speech Recognition Does not ...
1
vote
0
answers
358
views
Kivy Voice recognition is not converting speech to text
my code is recording voice, but for some reason google recognizer or sphinx not converting the voice to text. If i print the audio there is voice. the output is "Sorry, I did not get that" ...
0
votes
1
answer
182
views
Build a voice assistant to open the application with a path in the curly bracket
I wondered if I could add a lot of paths to open the application, but my code only opens the calculator when I ask to open chrome. It shows me like this The file /System/Applications/Google Chrome.app ...
0
votes
1
answer
617
views
Cannot Train Wav2vec XLSR Model With Common Voice Data
I am trying to train a transformer ASR model with wav2vec XLSR in the danish language, but whenever I try to pull the danish dataset with datasets library it's giving me an error..
Notebook link
error ...
0
votes
1
answer
955
views
AttributeError: 'NoneType' object has no attribute 'lower' when saying nothing in voice assistant
I am making a voice assistant in python but when i dont speak in it i get a error like this:
Sorry, I could not Understand You!
Traceback (most recent call last):
File "g:\Python\app.py", ...
0
votes
2
answers
2k
views
Loading data using hugging_face load_dataset from Common Voice is giving an error
I am working on a voice dataset using the Facebook Hugging Face_ transformer, but I am unable to load data from the Common Voice forum:
from datasets import load_dataset, load_metric
...
0
votes
2
answers
100
views
Python return statement never reached but prints immediately before it
I am trying to call a take_command() function from main.py.
The system prints the value right before the return call, but doesn't actually return the value to main.py. The take_command() function was ...
1
vote
1
answer
2k
views
Python wakeup word for AI assistant
I have a voice assistant in python and can't add a wake word.
When I execute the code it just keeps listening until I stop talkin then it shuts off I need to make it keep listening for a wake word and ...
1
vote
0
answers
84
views
Python Voice Transcription Missing Words / Phrases
I'm using a Google Voice Transcription Function from this package
def find_speech_regions(filename,frame_width=10, min_region_size=1, max_region_size=5): # pylint: disable=too-many-locals
"&...
-2
votes
1
answer
76
views
How do i get him to stay awake until I say goodbye?
Building my own voice assistant. After each command is given and response returned, I have to wake him again. I want him to be awake for all queries. stuck, please help. sorry if it seems vague.
WAKE =...
0
votes
0
answers
105
views
How to add timer in speech input in python?
I want to add the timer for the 10sec audio input
import speech_recognition as
r= Recognizer()
with Microphone() as source:
while(i==0):
print('Say Something')
audio = r....
0
votes
1
answer
268
views
How to Create a new language model NLP? - Python
i use Google Api to transcript some audio files to text with Recognizer class. I found out there are limited numbers of languages available, and the most commonly and internationally used are part of ...
1
vote
0
answers
177
views
What does tf.squeeze does to the audio and how can I load an mp3?
I'm using TensorFlow and I would like to be able to load audio and generate a spectrogram from it. I have little knowledge of how audio internally works.
Currently, this is the code I'm using:
import ...