Computational Audio Processing Research Papers

In the field of human speech capturing systems, a fundamental role is played by the source localization algorithms. In this paper a Speaker Localization algorithm (SLOC) based on Deep Neural Networks (DNN) is evaluated and compared with... more

—This paper focuses on Voice Activity Detectors (VAD) for multi-room domestic scenarios based on deep neural network architectures. Interesting advancements are observed with respect to a previous work. A comparative and extensive... more

In sound reproduction systems the audio crossover plays a fundamental role. Nowadays, digital crossover based on IIR filters are commonly employed, of which non-linear phase is a relevant topic. For this reason, solutions aiming to IIR... more

Bookmark
Download
- by stefano squartini and +2
  Ferdinando Foresi
  Diego Zallocco
- •
- 3
  Machine Learning, Digital Audio, Computational Audio Processing

In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic... more

—Novelty detection is the task of recognising events the differ from a model of normality. This paper proposes an acoustic novelty detector based on neural networks trained with an ad-versarial training strategy. The proposed approach is... more

Bookmark
Download
- by stefano squartini and +2
  Emanuele Principi
  Fabio Vesperini
- •
- 5
  Deep Learning, Novelty Detection, Autoencoder, Computational Audio Processing

The task of Speaker LOCalization (SLOC) has been the focus of numerous works in the research field, where SLOC is performed on pure speech data, requiring the presence of an Oracle Voice Activity Detection (VAD) algorithm. Nevertheless,... more

Bookmark
Download
- by stefano squartini and +1
  Emanuele Principi
- •
- 4
  Machine Learning, Speech Communication, Deep Neural Networks, Computational Audio Processing

In the past years, several hybridization techniques have been proposed to synthesize novel audio content owing its properties from two audio sources. These algorithms, however, usually provide no feature learning, leaving the user, often... more

Bookmark
Download
- by stefano squartini and +2
  Diego Droghini
  Emanuele Principi
- •
- 3
  Deep Learning, Digital Audio, Computational Audio Processing

Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, deep learning offers valuable techniques for this goal such as convolutional neural... more

Bookmark
Download
- by stefano squartini and +1
  Leonardo Gabrielli
- •
- 5
  Machine Learning, Artificial Neural Networks, Digital Audio, Machine Listening

—This paper presents a novel application of convo-lutional neural networks (CNNs) for the task of acoustic scene classification (ASC). We here propose the use of a CNN trained to classify short sequences of audio, represented by their... more

Cry detection is an important facility in both residential and public environments, which can answer to different needs of both private and professional users. In this paper, we investigate the problem of cry detection in professional... more

Bookmark
Download
- by stefano squartini and +1
  Emanuele Principi
- •
- 4
  Machine Learning, Audio Signal Processing, Neonatology, Computational Audio Processing

—In this paper, we propose a system for rare sound event detection using a hierarchical and multi-scaled approach based on Convolutional Neural Networks (CNN). The task consists on detection of event onsets from artificially generated... more

The amount of time an infant cries in a day helps the medical staff in the evaluation of his/her health conditions. Ex- tracting this information requires a cry detection algorithm able to operate in environments with challenging acoustic... more

Nowadays, the detection of human fall is a problem recognized by the entire scientific community. Methods that have good performance use human falls samples in the train set, while methods that do not use it, can only work well under... more

—Detecting the presence of speakers and suitably localize them in indoor environments undoubtedly represent two important tasks in the speech processing community. Several algorithms have been proposed for Voice Activity Detection (VAD)... more

Supporting people in their homes is an important issue both for ethical and practical reasons. Indeed, in the recent years, the scientific community devoted particular attention to detecting human falls, since the first cause of death for... more

This paper presents and compares two algorithms based on artificial neural networks (ANNs) for sound event detection in real life audio. Both systems have been developed and evaluated with the material provided for the third task of the... more

This paper focuses on employing Convolutional Neural Networks (CNN) with 3-D kernels for Voice Activity Detectors in multi-room domestic scenarios (mVAD). This technology is compared with the Multi Layer Perceptron (MLP) and interesting... more

Bookmark
Download
- by stefano squartini and +2
  Fabio Vesperini
  Emanuele Principi
- •
- 7
  Machine Learning, Computational Intelligence, Artificial Neural Networks, Smart Home

A Speaker Localization algorithm based on Neural Networks for multi-room domestic scenarios is proposed in this paper. The approach is fully data-driven and employs a Neural Network fed by GCC-PHAT (Generalized Cross Correlation Phase... more

Bookmark
Download
- by stefano squartini and +1
  Emanuele Principi
- •
- 3
  Artificial Neural Networks, Computational Audio Processing, Speaker Localization

Vehicle noise emissions are highly dependent on the road surface roughness and materials. A classification of the road surface conditions may be useful in several regards, from driving assistance to in-car audio equalization. With the... more

Bookmark
Download
- by stefano squartini and +1
  Livio Ambrosini
- •
- 3
  Automotive Engineering, Deep Learning, Computational Audio Processing

The primary cause of injury-related death for the elders is represented by falls. The scientific community devoted them particular attention, since injuries can be limited by an early detection of the event. The solution proposed in this... more

Bookmark
Download
- by stefano squartini and +2
  Diego Droghini
  Francesco Piazza
- •
- 5
  Computational Intelligence, Ambient Assisted Living, Digital Audio, Fall detection

Vehicle noise emissions are highly dependent on the road surface roughness and materials. A classification of the road surface conditions may be useful in several regards, from driving assistance to in-car audio equalization. With the... more

Bookmark

Supporting people in their homes is an important issue both for ethical and practical reasons. Indeed, in the recent years, the scientific community devoted particular attention to detecting human falls, since the first cause of death for... more

Bookmark

The primary cause of injury-related death for the elders is represented by falls. The scientific community devoted them particular attention, since injuries can be limited by an early detection of the event. The solution proposed in this... more

Computational Audio Processing

Log In