Official repository for RawNet, RawNet2, and RawNet3
-
Updated
Mar 21, 2024 - Python
Official repository for RawNet, RawNet2, and RawNet3
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
Angular triplet center loss implementation in Pytorch.
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
I. Thoidis, C. Gaultier, and T. Goehring, "Perceptual Analysis of Speaker Embeddings for Voice Discrimination between Machine And Human Listening," ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5
Research about how to design a robot sound from voice conversion and speaker embeddings
Add a description, image, and links to the speaker-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the speaker-embeddings topic, visit your repo's landing page and select "manage topics."