Newest 'voice-recognition+python' Questions

0 votes

0 answers

15 views

connecting a fine-tuned chatgpt model with vtt and tts via terminal in linux (raspberry pi)

Basically, I'm trying to build a way to interact with a finetuned chatgpt model via a headset. I've managed to make it work with tinylama, though for to me unknown reasons that doesn't work anymore ...

marsika

1

asked Nov 21 at 19:31

0 votes

0 answers

62 views

Issue with Data Preprocessing and Tensor Concatenation for Whisper Model Training

I am trying to train a Whisper model for Jeju dialect speech recognition. However, I am encountering several errors related to tensor concatenation during the data preprocessing phase. Below is the ...

dw26

1

asked Jul 17 at 1:45

0 votes

0 answers

33 views

Sending file from Raspberry Pi Pico to voice recognition API

I have a project of a bedside lamp with a voice assistant using special requests. I have this code on my PC : import os import shutil import time import requests import speech_recognition as sr from ...

Melki Youssef

1

asked May 16 at 19:11

0 votes

0 answers

142 views

Speechbrains SpeakerRecognition saves short cuts/links/symlinks of used audio files in working directory

I use the speaker recognition of speechbrain using the Python language: from speechbrain.inference.speaker import SpeakerRecognition and I load a model in the following way model = SpeakerRecognition....

Tütü

3

asked Apr 12 at 11:26

1 vote

0 answers

279 views

Using webrtcvad to capture audio when the user starts speaking and stops speaking (like Siri) and then saving to a .wav file

I'm trying to make a "Siri-like" audio capture using python. I would like the system to detect speech to begin recording, and then detect silence to stop recording. I would like for this ...

helpme

95

asked Feb 18 at 2:03

0 votes

0 answers

115 views

How to extract these acoustic features from audio files

I was making a voice gender detector, and when trying new samples I need to get the features below from the audio samples, and I wrote this code that doos that; but apparently audioFeatureExtraction ...

yacer meftah

15

asked Feb 15 at 17:47

0 votes

0 answers

61 views

Why Python voice assistant works so slow?

I am making a voice assistant using Python 3.12 on MacOS. Now, i am at voice recognition step, but it needs like 6 seconds for just recognize my speech and say 'How can i help you' to user. I know, ...

Dima Paharski

1

asked Jan 4 at 18:49

0 votes

0 answers

391 views

Detecting Silence in Python Voice Assistant

I'm making small pet-project. It's voice assistant. Now, development is on speech recognition part, and i need to make silent detection, to react on user's phrase end. I use vosk for speech ...

Dima Paharski

1

asked Dec 22, 2023 at 14:41

0 votes

0 answers

107 views

how to add events to calendar with google calendar api

for example; I have a meeting at 2 o'clock on December 8th, I'm trying to make a python voice assistant that will add the appropriate date in the calendar as an event when I give the voice command, ...

zeyzey

31

asked Dec 6, 2023 at 10:14

0 votes

0 answers

617 views

not able to convert produce audio file in gradio

I am making Voice Assistant for ordering in restaurants, the VA takes voice input passes to a trained model, and gives output according to intent. For interface, I am using Gradio which takes voice ...

Isha

11

asked Nov 6, 2023 at 19:03

0 votes

1 answer

133 views

Does Google Speech-to-Text Consume Internet Data for Silent Audio Input in Python?

I am currently using Python's Speech Recognition library with Google's Speech-to-Text functionality. My concern revolves around cases where there is no speech detected during the audio processing. ...

mrithul e

91

asked Oct 16, 2023 at 15:36

0 votes

0 answers

565 views

Unable to tts = TextToSpeech() | Tortoise tts | Torch

Am sorry, I may not use apt words for explaining my issue. please help me out. Running the code in VS code with python. These are the codes screenshots with errors: My setup commands: !nvidia-smi !...

Daaris Ameen

11

asked Aug 4, 2023 at 10:20

0 votes

1 answer

104 views

Python voice recognition stopping when it hears a sound it does not know

I am using Python speech recognition and when it hears sounds that are not understandable (like a dog barking) it raises an error and the program has to be restarted. Is there any way to fix this? ...

Asher Ellet

3

asked Jun 30, 2023 at 16:40

0 votes

0 answers

50 views

i was trying to create a desktop assistant using python but whenever i speak something it shows an error

FOR EG (I SAY ="Hello bro") The error-- "'hello' is not recognized as an internal or external command, operable program or batch file." or even if i say anything else it displays ...

MANAN SINGH

1

asked May 16, 2023 at 15:20

1 vote

2 answers

1k views

I have trouble working with pyaudio . How do I fix the errors?

I'm having trouble with my voice commander as it's not working properly and I think it's because of some errors in the PyAudio file. Can you suggest some steps to fix this issue? I have already ...

Muskaan Sharma

25

asked Apr 30, 2023 at 11:19

-1 votes

1 answer

82 views

How to change the language speech

def take_command(): try: with sr.Microphone() as source: print('listening...') voice = listener.listen(source) command = listener.recognize_google(voice)...

Windows Master

1

asked Apr 24, 2023 at 18:00

-2 votes

1 answer

385 views

Python "speech_recognition" module has no "recognize"?

I was trying some code i watched on a video but when i had to run the program it said the following text: "An error ocurred : module 'speech_recognition' has no attribute 'recognize'" i used ...

axel cardona vasquez

1

asked Apr 6, 2023 at 20:01

1 vote

0 answers

379 views

PvRecorder not reading fast enough on my Raspberry Pi Zero

I'm using picovoice (rhino speech-to-intent and pvrecorder) on my pi0. Here's my code: from pvrecorder import PvRecorder import pvrhino #my pico voice key key='my_access_key' #removed the access key ...

aashvikt

172

asked Feb 13, 2023 at 2:55

2 votes

1 answer

239 views

Why the command listener.listen from lib pyttsx3 printing this extra text?

basicly trying to follow the steps of a vid, everything is working great exapt that those lines of code: with sr.Microphone() as source: print('listening..') voice = listener.listen(source) command = ...

Nir Malka

35

asked Jan 11, 2023 at 19:13

0 votes

1 answer

755 views

Voice Recognition (converting voice to text)

I have code to convert voice to written text, I want to save the written text after it's converted to files that can be accessed later, how do I do it in the following code? import speech_recognition ...

Fatimah

9

asked Jan 9, 2023 at 8:35

1 vote

0 answers

214 views

I am making a voice assistant in python and I need a code or maybe a module for a wake up system

I am using the speech_recognition module for taking voice commands. The problem is that I want a system like Alexa and Google voice assistant has. When we say "Alexa" or "Ok Google"...

Aditya Chandra

21

asked Jan 7, 2023 at 4:07

1 vote

0 answers

183 views

Python smart voice assistant with using a JSON file

I was trying to create a voice assistant using python... but my code is showing some weird texts when I run it... down below is my code from email.mime import audio from logging.config import listen ...

Ankur Pathak

11

asked Nov 28, 2022 at 11:53

3 votes

0 answers

2k views

Speaker recognition using Python (recognize who a voice belongs)

I would like to know if there is a library or tool for python that allows me to retrieve any identification for a recorded voice based on prior data. For example: import voice_recorder import ...

plshelpmeout

129

asked Nov 26, 2022 at 20:55

0 votes

1 answer

161 views

How to make voice detection in python faster?

I have some voice detection code and it works! but, it runs really slowly. Can I do anything to make it faster? import speech_recognition import pyttsx3 recognizer = speech_recognition.Recognizer() ...

user20292485

asked Nov 24, 2022 at 12:12

-1 votes

3 answers

279 views

Voice to text in python

Lets say I want to give input in Hindi or any other language in voice And it will give an output in English (text). How can I do that? What I know: I know how to transfer voice to text. E.g. ...

Subhajit

1

asked Oct 15, 2022 at 15:00

0 votes

1 answer

59 views

TypeError: speechtxt() takes 0 positional arguments but 1 was given

I am writing code for voice recognition I am run the program, I am getting TypeError problem. This is my code: import pyttsx3 import pyaudio import speech_recognition as sr import webbrowser import ...

user20017494

asked Sep 23, 2022 at 12:46

0 votes

0 answers

49 views

Can I use the mic with a voice record ? Python

im working in a selenium project and i need use the mic into a website. But I need run an specific voice record into the mic. So when the project is running i need this, 1-Open the website 2-Use the ...

Martin Lackington

19

asked Sep 20, 2022 at 12:02

0 votes

1 answer

25 views

Loading a test set when extracting a d-vector - model wants the training set size

So I have to extract a d-vector (d-vector is the averaged activation from the last hidden layer of this DNN) of the taught model. I pass in a checkpoint from the last epoch and want to create a list ...

Moriyama Aiko

21

asked Sep 8, 2022 at 8:51

1 vote

2 answers

1k views

how to do speaker identification using voice?

I was wondering if you can detect s using their voice. For example, we give feed some data in our program like this is the voice of a cat and when it detects it, it says hello cat or something.

Kamran Aslam

21

asked Aug 20, 2022 at 11:27

0 votes

1 answer

245 views

I am making voice assistant in pycharm. It was working fine week ago but now it is showing assertion erorr from init.py file

import pyttsx3 import speech_recognition as sr import datetime import wikipedia import webbrowser engine = pyttsx3.init('sapi5') voices = engine.getProperty('voices') engine.setProperty('voice', ...

Anuj Goel

1

asked Aug 18, 2022 at 14:29

1 vote

0 answers

890 views

How to display text on the screen it is said over the audio

As a personal project, I decided to create one of the reddit text-to-speech bot. I pulled all the data from reddit with praw import praw, random def scrapeData(subredditName): # Instantiate praw ...

jshel

21

asked Aug 11, 2022 at 23:46

1 vote

0 answers

92 views

I tried to make a virtual assistant using python, earlier it was only printing listening. Now it is browsing myntra's website for every command given

''' on executing the following code, when I am giving any instruction to this voice assistant : it is not printing whatever i am saying it is opening myntra's website for every instruction I am ...

AKSHIMA

11

asked Aug 8, 2022 at 11:37

0 votes

0 answers

161 views

Training Keras Model : Graph Execution Error (callbacks = callbacks)

Following the Deep Learning Cookbook, for voice recognition. On the last step, when the epochs start, the following code is breaking at callbacks = callbacks: history = model.fit_generator( generator=...

thelamp

89

asked Jul 19, 2022 at 18:34

1 vote

0 answers

151 views

Python Speechrecognizer

I'm currently working on my own voice assistant, but I often times have the problem that it's taking very long to answer/recognize that I'm not sayin anything. I already tried using: listener....

apokalipo

11

asked Jun 4, 2022 at 23:30

2 votes

1 answer

2k views

Exception: Failed to process waveform

Error: Traceback (most recent call last): File "c:\Programming\New_assistant\speech_to_text.py", line 18, in <module> if rec.AcceptWaveform(data): File "C:\Users\david\...

DarMaster

95

asked May 30, 2022 at 9:42

0 votes

0 answers

208 views

Creating voice assistant in python and storing user's input

I have created a very simple voice assistant that listens to the user's input (more specifically, the user's story). Now, I want to create a function that stores the user's input - hereby, I have ...

Lea

7

asked May 19, 2022 at 14:39

1 vote

1 answer

165 views

Never ending batch in PyTorch

In short: training using the mutual information with a positive pair (two samples of same person) and one negative pair. Dont think the "how" is important here. I need pointers on how to get ...

Moriyama Aiko

21

asked May 5, 2022 at 11:51

0 votes

1 answer

144 views

TypeError: argument of type 'int' is not iterable upon incorrect speech recognition

I'm working on a TTS/STT system for a school project and it was going well until I started getting this specific error whenever the code ran through the "Google Speech Recognition Does not ...

Daximus

1

asked Apr 17, 2022 at 5:12

1 vote

0 answers

358 views

Kivy Voice recognition is not converting speech to text

my code is recording voice, but for some reason google recognizer or sphinx not converting the voice to text. If i print the audio there is voice. the output is "Sorry, I did not get that" ...

ea hubs

31

asked Apr 2, 2022 at 1:15

0 votes

1 answer

182 views

Build a voice assistant to open the application with a path in the curly bracket

I wondered if I could add a lot of paths to open the application, but my code only opens the calculator when I ask to open chrome. It shows me like this The file /System/Applications/Google Chrome.app ...

Azfar Danish

3

asked Mar 6, 2022 at 15:29

0 votes

1 answer

617 views

Cannot Train Wav2vec XLSR Model With Common Voice Data

I am trying to train a transformer ASR model with wav2vec XLSR in the danish language, but whenever I try to pull the danish dataset with datasets library it's giving me an error.. Notebook link error ...

Siyam Fahad

61

asked Feb 27, 2022 at 12:00

0 votes

1 answer

955 views

AttributeError: 'NoneType' object has no attribute 'lower' when saying nothing in voice assistant

I am making a voice assistant in python but when i dont speak in it i get a error like this: Sorry, I could not Understand You! Traceback (most recent call last): File "g:\Python\app.py", ...

Harsh

1

asked Feb 18, 2022 at 13:02

0 votes

2 answers

2k views

Loading data using hugging_face load_dataset from Common Voice is giving an error

I am working on a voice dataset using the Facebook Hugging Face_ transformer, but I am unable to load data from the Common Voice forum: from datasets import load_dataset, load_metric ...

amad durrani

62

asked Feb 11, 2022 at 17:50

0 votes

2 answers

100 views

Python return statement never reached but prints immediately before it

I am trying to call a take_command() function from main.py. The system prints the value right before the return call, but doesn't actually return the value to main.py. The take_command() function was ...

abristow

3

asked Jan 27, 2022 at 21:13

1 vote

1 answer

2k views

Python wakeup word for AI assistant

I have a voice assistant in python and can't add a wake word. When I execute the code it just keeps listening until I stop talkin then it shuts off I need to make it keep listening for a wake word and ...

Filip GG

31

asked Jan 2, 2022 at 18:56

1 vote

0 answers

84 views

Python Voice Transcription Missing Words / Phrases

I'm using a Google Voice Transcription Function from this package def find_speech_regions(filename,frame_width=10, min_region_size=1, max_region_size=5): # pylint: disable=too-many-locals "&...

MDorcelian

11

asked Dec 16, 2021 at 7:26

-2 votes

1 answer

76 views

How do i get him to stay awake until I say goodbye?

Building my own voice assistant. After each command is given and response returned, I have to wake him again. I want him to be awake for all queries. stuck, please help. sorry if it seems vague. WAKE =...

Colin Peek

1

asked Dec 15, 2021 at 2:16

0 votes

0 answers

105 views

How to add timer in speech input in python?

I want to add the timer for the 10sec audio input import speech_recognition as r= Recognizer() with Microphone() as source: while(i==0): print('Say Something') audio = r....

Shubham Kumar

1

asked Nov 24, 2021 at 16:41

0 votes

1 answer

268 views

How to Create a new language model NLP? - Python

i use Google Api to transcript some audio files to text with Recognizer class. I found out there are limited numbers of languages available, and the most commonly and internationally used are part of ...

Nathanael Kazaka

3

asked Nov 6, 2021 at 10:19

1 vote

0 answers

177 views

What does tf.squeeze does to the audio and how can I load an mp3?

I'm using TensorFlow and I would like to be able to load audio and generate a spectrogram from it. I have little knowledge of how audio internally works. Currently, this is the code I'm using: import ...

João Marques

141

asked Oct 6, 2021 at 0:12

Collectives™ on Stack Overflow

All Questions

Related Tags