Spectral Modeling Synthesis: Past and Present

Spectral Analysis/Synthesis
Spectral Modeling Synthesis:

Past and Present Input
FFT Sine Detection
sound
Partial Tracking Residual Analysis

Xavier Serra partials residual
MTG-IUA, Universitat Pompeu Fabra Feature Analysis
Barcelona, Spain
http://www.iua.upf.es Transformations
Output
Sines/Noise Synthesis
sound
Xavier Serra - London 2003 2
… some history Relevant Research Topics (I)

X. Serra. 1989. A system for sound analysis / transformation / Detection/Estimation of Sinusoids
synthesis based on a deterministic plus stochastic decomposition.
Ph.D. Thesis. Stanford University.
window
Partial Tracking
generation pitch
Transient Modeling
magnitude frequency
smoothing spectrum
window sine frequencies
peak pitch peak
sound * FFT detection peak detection continuation sine magnitudes
phase peak sine phases
spectrum data
Multiresolution
data
additive
synthesis
sinusoidal
component
Residual Analysis/Modeling
-
residual
smoothing component
window window
*
generation
“…The objective is the development of an

amplitude
correction Feature-based Analysis/Synthesis
analysis/synthesis system that allows the largest FFT
possible number of transformations on the analysis

magnitude
spectrum
spectral
phase
spectrum
residual
Synthesis of Sinusoids/Noise
data before resynthesis …” approximation spectral data
Xavier Serra - London 2003 3 Xavier Serra - London 2003 4
1
Relevant Research Topics (II) Detection/Estimation of Sinusoids (I)
Morphing George, E. B. (thesis, 1991)
Analysis by synthesis where each
Time Scaling sinusoid is subtracted one at a
time.
Compression/Transmission
Source Separation/Transcription Depalle, P.; Hélie, T. (WASPAA,
1997)
Music Content Analysis
parametric modeling of the STFT.
Instrument/Voice Models
Expanded Models Goodwin, M. (thesis, 1997)
Matching Pursuit.
Software Environments
Detection/Estimation of Sinusoids (II) Partial Tracking

Ding, Y.; Qian, X. (ICMC, 1997) Depalle, Ph. et alt. (ICMC,
Global optimization of phase using 1993)
B-spline quadratic polynomials.
Hidden Markov Models.
Masri, P. (DAFX, 1998)
Measurement of distortion. Wang, A. (thesis, 1994)
Frequency-Lock Loop.
Marchand, S. (DAFX, 1998)
n signal derivatives.
Lagrange, M. et alt. (DAFX,
Hainsworth, S.; Macleod, M. 2003)
(DAFX, 2003) Linear prediction of the
Frequency reassignment. frequency evolutions.
2
Transient Modeling Residual Analysis/Modeling
Masri, P. (thesis 1996) Hamdy, K. N. et alt. (ICASSP, 1996)
Analysis of transients to Wavelet coding of residual.
position analysis window.
Goodwin, M. (thesis, 1996)
Ali, M. (thesis, 1996) Filter-bank auditory model.
Wavelet Analysis for
transients. Ding Y.; Qian, X.
(ICMC, 1997)
Verma, T. et alt. (ICMC, LPC modeling.
1997)
Sinusoids+Transients+ Desainte-Catherine, M; Hanna,P.
Noise Model. (DAFX, 2000)
Parameterization of noise-like sounds.
Multiresolution Feature-based Analysis/Synthesis

Input Output
Ellis, D.; Vercoe, B. Arcos, J. et alt. (ICMC, 1997) Inexpressive
Expression analysis and Score phrase Expressive phrase
(ICMC, 1990) transformation. .mid .snd .snd
constant-Q bandpass filters

Serra, X.; Bonada, J. (DAFX, 1998)
analysis synthesis
before sinusoidal modeling. Sms
High-level features for
transformations.
Levine, S. et alt.
(WASPAA, 1997) Rossignol S. et alt. (DAFX, 1999) .sms .sco
Vibrato processing.
Bounded-Q transform.
Gómez, E. et alt. (DAFX, 2003) Noos
Polotti P.; Evangelista G. Melodic transformations. Cases
methods
model
(DAFX, 2001) Gouyon, F. et alt. ((DAFX, 2003)

Harmonic-Band Wavelets. Rhythm transformations.
3
Synthesis of Sinusoids/Noise Morphing
Rodet, X.; Depalle, Ph. (AES, 1992) Serra, X. (ICMC, 1994) User Input
IFFT synthesis for sinusoids. sine- Featured-based
wave interpolation. Morph & Synthesis
Goodwin, M.; Rodet, X.(ICMC, 1994) SMS-Analysis
IFFT synthesis for nonstationary
sines. FFT with Tellman, E. et alt. SMS-
Morph
Blackman-Harris 92dB
Fitz, K.; Haken, L. (ICMC 1995)
(ICMC, 1994) Synthesis
Voice
Bandwidth Enhanced Sinusoidal Sinusoidal interpolation. Alignment

output
Modeling. based on
magnitude phoneme HMMs Target Information
Goodwin, M. (thesis, 1996)

spectrum Cano, P. et alt.
Song Information
Noise synthesis using Equivalent (ICMC, 2000)
Rectangular Bands. Voice morphing.
phase Analysis & Alignment
spectrum
Time Scaling Coding/Transmission

Verma, T.; Meng, H. Phase
Hamdy, K. N. et alt. (ICASSP,
(DAFX, 1998) Z-1 1996)
Spectrum
Phase Harmonics plus wavelets.
Using Sines +Transients Phase
Generation
+ Noise Signal Model.
Levine, S. N. (thesis, 1998)
Transformations in transform
Laroche, J.; Dolson, M. domain.
(IEEE, 1999) Peak
Spectral Peaks. Z-1 Detection Purnhagen, H.; Meine N.
Peak (ISCAS, 2000)
Bonada, J. (ICMC, 2000) Amplitude Peak Continuation MPEG4: Harmonic and Individual
Detection Lines plus Noise (HILN).
Spectral Peak
Processing. Amplitude Amatriain, X. et alt. (AES, 2002)
Content-based transmission.
4
Source Separation / Transcription Music Content Analysis
Maher, R. (thesis, 1989) Herrera, P. et alt. (CBMI, 1989)
Descriptors for MPEG-7.
Partial collision and Two
Way Mismatch algorithm Heittola, T.; Klapuri, A.
for F0 detection. (ISMIR, 2002)
Identification of drums.
Virtanen, T. et alt.
Gómez, E. et alt. (JNMR, 2003)
(ICASSP, 2000) Melodic description.
Multipitch analysis and
iterative parameter Wang, A. (Shazam, 2003)
estimation. Audio identification.
Instrument/Voice Models Expanded Models

Macon, M. W. et alt. (AES,
1997) EpR spectral EpR spectral
Laroche, J. et alt. (ICASSP, 1993)
Singing voice (Lyricos) Amplitude Phase
Fitz, K. et alt. (ICMC, 1995)

Haas, J. (MOSART, 2001) Flat harmonic
excitation sines
Sax synthesizer Filtering Ding Y. et alt. (ICMC, 1997)
Bonada et. alt. (ICMC, 2001)

SMS Goodwin, M. (thesis, 1997)
Flat residual synthesis
Singing voice (Vocaloid) excitation
Filtering
residual
Verma, et alt. (ICMC, 1997)
Laroche, J. (DAFX, 2003)
Spectral processing plus
1/F
Peeters, G. et alt. (ICMC, 1999)
TD-PSOLA SMS residual
5
Software Environments Conclusions
From speech to audio to music.
Serra, X. (LMJ, 1991)
SANSY: Lisp environment From analysis/synthesis to content
based on SPIRE
processing.
Fitz, K. et alt. (ICMC, 1995) Beyond signal processing techniques.
Lemur
Techniques are ready for many practical
Loscos, A. et alt.(DAFX, 1998)
SMSPerformer
applications.
Need to combine bottom-up with top-
Amatriain, X. et alt. (ACM, 2002)
CLAM down approaches.
6
Sinusoidal plus Residual Modeling of Musical Sounds: Relevant
References
compiled by Xavier Serra, September 2003
1. Chamberlin, H. 1980. “Using the FFT for Synthesis.” In Music Applications

of Microprocessors, Hayden Book Co., pp. 424-431.
2. Almeida, L. B. and F. M. Silva. 1983. “Harmonic Coding with Variable-

Frequency Synthesis”, Proceedings of the 1983 Spain Workshop on Signal
Processing and its Applications (WSPA'83), Sitges, Spain, September 1983.
3. Smith, J.O. and B. Friedlander. 1984. “High Resolution Spectrum Analysis

Programs.” TM no. 5466-05, Systems Control Technology, Palo Alto CA,
April 1984.
4. Almeida, L. B. and F. M. Silva. 1984. “Variable-Frequency Synthesis: An
Improved Harmonic Coding Scheme”, Proceedings of the 1984 IEEE
International Conference on Acoustics, Speech and Signal Processing
(ICASSP'84), S. Diego, California, March 1984.
5. Griffin, D. W.; J. S. Lim. 1985. “A New Model-Based Speech Analysis /

Synthesis System”, IEEE-ICASSP, 1985, pp. 513-516.
6. McAulay, R. J. and T. F. Quatieri. 1986. “Speech Analysis/Synthesis based

on a Sinusoidal Representation.” IEEE Transactions on Acoustics, Speech and
Signal Processing 34(4):744--754.
7. McAulay R. J; Thomas F. Quatieri. 1986. “Phase Modeling and its
Application to Sinusoidal Transform Coding”, IEEE Int. Conf. on Acoustics,
Speech and Signal Processing, pp. 1713-1715, April 1986.
8. Quatieri, T. F.; R. J. McAulay. 1986. “Speech Transformations Based on a
Sinusoidal Representation”, IEEE Transactions on Acoustics, Speech and
Signal Processing, Vol. 34, No. 6, December 1986.
9. Serra, X. 1986. “A Computer Model for Bar Percussion Instruments”
Proceedings of International Computer Music Conference 1986. La Haya, The
Netherlands
10. Smith, J.O.; Serra, X. 1987. “PARSHL: an analysis/synthesis program for

non-harmonic sounds based on a sinusoidal representation”. International
Computer Music Conference, 1987.
11. McAulay, R. J.; T. F. Quatieri. 1988. “Computationally efficient sine-wave

synthesis and its application to sinusoidal transform coding.” Proc. IEEE
ICASSP-88, pp. 370-373, 1988.
12. Maher, Robert C. 1989. An Approach for the Separation of Voices in

Composite Musical Signals. Ph.D. Thesis, University of Illinois at Urbana-
Champaign.
13. McAulay, R. J.; Thomas F. Quatieri. 1989. “Phase Coherence in Speech
Reconstruction for Enhancement and Coding Applications”, IEEE Int. Conf.
on Acoustics, Speech and Signal Processing, Glasgow, pp. 207-209 (May
1989).
14. Serra, X. Smith, J. 1989. “Spectral Modeling Synthesis”. Proceedings of
International Computer Music Conference 1989. Ohio, USA
15. Serra, X. 1989. A system for sound analysis/transformation/synthesis based
on a deterministic plus stochastic decomposition. Ph.D. thesis, Stanford
University.
16. Ellis, Daniel P., Barry L. Vercoe. 1990. “A wavelet based sinusoid model of
sound for auditory signal separation.” ICMC90
17. Maher, Robert and James Beauchamp. 1990. “An Investigation of Vocal
Vibrato for Synthesis.” Applied Acoustics 30 pp. 219-245
18. McAulay, R. J.; T. F. Quatieri. 1990. “Pitch Estimation and Voicing
Detection Based on a Sinusoidal Speech Model.” Proceedings IEEE ICASSP
1990.
19. Schumacher, R. T., and C. Chafe. 1990. “Detection of Aperiodicity in Nearly
Periodic Signals.” Proceedings of the IEEE Int. Conf on Acoustics, Speech,
and Signal Processing, Alburquerque, NM, 1990.
20. George, E. B. 1991. An Analysis-by-Synthesis Approach to Sinusoidal

Modeling Applied to Speech and Musical Signal Processing. Ph.D.
dissertation, Georgia Institute of Technology.
21. George, E. B. and M. J. T. Smith. 1991. “An Analysis-by-Synthesis
Approach to Sinusoidal Modeling Applied to the Analysis and Synthesis of
Musical Tones,” in Proc. 1991 CMA International Computer Music
Conference, October 1991, pp. 356-359.
22. Serra, X. 1991. “SANSY: An Environment for the transformation of musical
sounds”, Leonardo Music Journal Vol. Fall.
23. Xie, X.; R. J. Evans. 1991. “Multiple Target Tracking and Multiple
Frequency Line Tracking Using Hidden Markov Models.” IEEE Transactions
on Signal Processing, vol. 39, pp. 2659-2676, December 1991.
24. Fitz, K; W. Walker; L. Haken. 1992. “Extending the McAulay-Quatieri

Analysis for Synthesis with a Limited Number of Oscillators”. ICMC92.
25. Freed, Adrian; Xavier Rodet, Philippe Depalle. 1992. “Synthesis and Control
of Hundreds of Sinusoidal Partials on a Desktop Computer without Custom
Hardware”, ICSPAT 92, San José (USA), 1992
26. Garcia G. 1992. “Analyse des Signaux Sonores en Termes de Partiels et de
Bruit. Extraction Automatique des Trajets Frèquentiels par des Modèles de
Markov Cachès.” Mèmoire de DEA en Automatique et Traitement du Signal,
Orsay, 1992.
27. George, E. B.; M. J.T.Smith. 1992. “Analysis-by-Synthesis/Overlap-Add
Sinusoidal Modeling Applied to the Analysis and Synthesis of Musical
Tones”. J. Audio Eng. Soc., Vol. 40, No. 6, June 1992.
28. Holloway, Bryan and Lippold Haken. 1992. “A Sinusoidal Synthesis
Algorithm for Generating Transitions Between Notes”, ICMC92
29. McIntyre, C. M.; D. A. Dermott. 1992. “A New Fine-Frequency Estimation
Algorithm Based on Parabolic Regression.” IEEE-ICASSP 1992, pp. 541-544.
30. Rodet, X. and P. Depalle. 1992. “Spectral Envelopes and Inverse FFT
Synthesis.” 93rd Convention of the Audio Engineering Society. San Francisco,
October 1992.
31. Barrett, R.F.; Holdsworth, D.A. 1993. “Frequency tracking using hidden
Markov models with amplitude and phase information”, IEEE Transactions on
Signal Processing, Volume: 41, Issue: 10, Year: Oct 1993 Page(s): 2965-
2976
32. Depalle, Ph., G. Garcia and X. Rodet. 1993. “Analysis of Sound for Additive
Synthesis: Tracking of Partials Using Hidden Markov Models.” Proceedings
of the 1993 International Computer Music Conference. San Francisco:
Computer Music Association.
33. Doval, B., and X. Rodet. 1993. “Fundamental frequency estimation and
tracking using maximum likelihood harmonic matching and HMMs.”
Proceedings of the ICASSP ‘93, 221--224.
34. Laroche, J.; Y Stylianou; E. Moulines. 1993. “HNS: Speech Modification
based on a Harmonic+Noise Model”. Proc. IEEE-ICASSP-93, Vol. II. pp.
550-553, April 1993.
35. Macon, Michael W. 1993. Applications of Sinusoidal Modeling to Speech
and Audio Signal Processing. Ph.D. dissertation, Georgia Institute of
Technology.
36. Adams, G.J.; Evans, R.J. 1994. “Neural networks for frequency line tracking
“ IEEE Transactions on Signal Processing, Volume: 42 Issue: 4 , April 1994
Page(s): 936 -941
37. Doval, B. 1994. Estimation de la Fréquence Fondamentale des signaux
sonores. PhD. Thesis, Université Paris-6, Paris, 1994.
38. Goodwin, M. and X. Rodet. 1994. “Efficient Fourier Synthesis of
Nonstationary Sinusoids.” Proceedings of the 1994 International Computer
Music Conference. San Francisco: Computer Music Association.
39. Serra, Xavier. 1994. “Residual Minimization in a Musical Signal Model
based on a Deterministic plus Stochastic Decomposition.” Journal of the
Acoustical Society of America 95(5-2):2958--2959.
40. Serra, Xavier. 1994. “Sound Hybridization Techniques based on a
Deterministic plus Stochastic Decomposition Model.” Proceedings of the
1994 International Computer Music Conference. San Francisco: Computer
Music Association.
41. Tellman, E.; L. Haken; B. Holloway. 1994.”Timbre Morphing Using the
Lemur Representation.” Proceedings of the International Computer Music
Conference, Aarhus, Denmark, October 1994.
42. Wang, A. 1994. Instantaneous and Frequency-Warped Signal Processing
Techniques for Audio Source Separation. Ph.D. Thesis, Stanford University.
43. Dutoit, T. and B. Gosselin. 1995. “On the Use of a Hybrid

Harmonic/Stochastic Model for TTS synthesis-by-Concatenation.” Speech
Communication 19 pp. 119-143.
44. Fitz, Nelly; Lippold Haken, and Bryan Holloway. 1995. “Lemur - A Tool for
Timbre Manipulation.” International Computer Music Conference, September
1995, Banff Centre, Alberta, Canada
45. Fitz, K; and L. Haken. 1995. “Bandwidth Enhanced Sinusoidal Modeling in
Lemur.” Proc. International Computer Music Conference, Banff, 1995.
46. Goodwin, M.; A. Kogon. 1995. “Overlap-add synthesis of non-stationary
sinusoids.” Proc. International Computer Music Conference, Banff, 1995.
47. Masri, P., Bateman, A. 1995. “Identification of nonstationary audio signals
using the FFT, with application to analysis-based synthesis of sound.” Proc.
IEE Colloquium on Audio Engineering. pp. 11.1-6.
48. McAulay, R. J.; T. F. Quatieri. 1995. “Sinusoidal coding.” In Speech Coding
and Synthesis, Chapter 4, W.B. Kleijn, and K.K. Paliwal Eds., Elsevier, 1995.
49. Osaka, N. 1995. “Timbre Interpolation of Sounds Using a Sinusoidal
Model.” ICMC 95.
50. Quatieri, T. F. and T. E. Hanna. 1995. “Time-scale modification with
inconsistent constraints”, in Proceedings of the IEEE Workshop on
Applications of Signal Processing to Audio and Acoustics, New Paltz, NY,
New York, Oct. 18, 1995, pp. Session 10, Paper 2, IEEE Press.
51. Stylianou, Y.; J. Laroche; E. Moulines. 1995. “High Quality Speech
Modification based on a Harmonic + Noise Model.” Eurospeech-95.
52. Tellman, E.; L. Haken; B. Holloway. 1995. “Timbre Morphing of Sounds
with Unequal Number of Features.” J. Audio Eng. Soc., Vol. 43, No 9. 1995.
53. Wang, A. 1995. “Instantaneous and frequency-warped techniques for source
separation and signal parametrization.” in Proceedings of the IEEE Workshop
on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY,
New York, Oct. 1995, IEEE Press.
54. Ali, M. 1996. Adaptive Signal Representation with Applications in Audio

Coding. Ph.D. thesis, University of Minnesota.
55. Depalle, P.; L. Tromp. 1996. “An Improved Additive Analysis Method
Using Parametric Modeling of the Short-Time Fourier Transform.”
Proceedings of the ICMC 96.
56. Dutoit, T.; B. Gosselin. 1996. “On the use of a hybrid harmonic/stochastic
model for TTS synthesis-by-concatenation.” Speech Communacation 19, pp.
119-143.
57. Fitz, Kelly and Lippold Haken. 1996. “Sinusoidal Modeling and
Manipulation Using Lemur.” Computer Music Journal, vol. 20.4, 1996, pp.
44-59.
58. Goodwin, M. ; M. Vetterli.1996. “Time-Frequency Signal Models for Music
Analysis, Transformation, and Synthesis.” Time-Frequency Time-Scale
Symposium, Multidimensional Systems and Signal Processing , Paris, Aug.
1996.
59. Goodwin, M. 1996. “Residual modeling in music analysis-synthesis.” Proc
IEEE-ICASSP, Atlanta, GA, pp. 1005-1008, May 1996.
60. Gribonval, R.; E. Bacry, S. Mallat, Ph. Depalle, X. Rodet. 1996. “Analysis of
sound signal with high resolution matching pursuit.” Proceedings of the IEEE
Conference on Time-Frequency and Time-Scale Analysis (TFTS'96), Paris,
France, June 1996.
61. Hamdy, K. N.; M. Ali and A. H. Tewfik. 1996. “Low bit rate high quality
audio coding with combined harmonic and wavelet representations.”
Proceedings of ICASSP96
62. Lomax, K. 1996. “The development of a singing synthesizer.” in Speech and
Computers (SPECOM), 1996.
63. Macon, M. W. 1996. Speech Synthesis Based on Sinusoidal Modeling. PhD
thesis, Georgia Institute of Technology, October 1996.
64. Macon, M. W. and M. A. Clements. 1996. “Speech concatenation and
synthesis using an overlap-add sinusoidal model.” in Proceedings of the
International Conference on Acoustics, Speech, and Signal Processing, vol. 1,
pp. 361-364, May 1996.
65. Masri, P. 1996. Computer Modeling of Sound for Transformation and
Synthesis of Musical Signal. PhD thesis, University of Bristol, Dec. 1996.
66. Masri, P.; A. Bateman. 1996. “Improved Modelling of Attack Transients in
Music Analysis-Resynthesis.” ICMC-96.
67. Phillips, D.; A. Purvis; S. Johnson. 1996. “Multirate Additive Synthesis.”
ICMC 96.
68. Pielemeier, W. J.; G.H. Wakefield. 1996. “A high-resolution time-frequency
representation for musical instrument signals.” J. Acoust. Soc. Amer., 99(4),
1996.
69. Stainsby, Thomas. 1996. “A System for the Separation of Simultaneous
Musical Audio Signals.” ICMC96
70. Arcos, J. Lopez de Mantaras, R. Serra, X. 1997. “Generating expressive

musical performances with SaxEx.” Proceedings of AIMI International
Workshop. KANSEI - The Technology of Emotion. Genova, Italy
71. Arcos, J. Lopez de Mantaras, R. Serra, X. 1997. “Saxex: a Case-Based
Reasoning System for Generating Expressive Musical Performances”.
Proceedings of International Computer Music Conference 1997. Thessaloniki,
Greece
72. Bonada, J. 1997. “Desenvolupament d`un entorn gráfic per a l`análisi,
transformació i síntesi de sons mitjanant models espectrals”.
UPC. Barcelona
73. Depalle, P.; T. Hélie. 1997. “Extraction of Spectral Peak Parameters Using a
Short-Time Fourier Transform Modeling and No Sidelobe Windows.”
Proceedings of IEEE Workshop on Audio, Mohonk 1997.
74. Ding Y.; X. Qian. 1997. “Sinusoidal and Residual Decomposition and
Residual Modeling of Musical Tones Using the QUASAR Signal Model.”
Proceedings of the ICMC 97.
75. Ding, Y. and Qian, X., 1997. “Processing of Musical Tones Using a
Combined Quadratic Polynomial-Phase Sinusoid and Residual (QUASAR)
Signal Model.” J. Audio Eng. Soc., Vol. 45, No. 7/8, pp. 571-584.
76. Ding, Y. and Qian, X., 1997. “Estimating Sinusoidal Parameters of Musical
Tones based on Global Waveform Fitting”, Proceedings of the IEEE
Workshop on Multimedia Signal Processing, pp. 95-100, June 1997.
77. Dubnov, S.; X. Rodet. 1997. “Statistical Modeling of Sound Aperiodicities.”
ICMC-97.
78. Fitz, K.; L. Haken. 1997. “Sinusoidal Modeling and Manipulation Using
Lemur.” Computer Music Journal, vol. 20, n 4. [direct implementation of the
McAulay and Quatieri sinusoidal modeling approach]
79. George, E. B.; M. J.T.Smith. 1997. “Speech Analysis/Synthesis and
Modification Using and Analysis-by-Synthesis/Overlap-Add Sinusoidal
Model.” IEEE Transactions on Speech and Audio Processing, vol. 5, No. 5.
80. Goodwin, M. 1997. “Matching pursuit with damped sinusoids,” in
Proceedings ICASSP’97, Munich, Germany, May 1997, vol. 3, pp. 2037–
2040.
81. Goodwin, M., 1997. Adaptive Signal Models: Theory, Algorithms, and
Audio Applications. Ph.D. Thesis, University of California, Berkeley
82. Laroche, J. and M. Dolson, “About this phasiness business.” in Proceedings
of the IEEE Workshop on Applications of Signal Processing to Audio and
Acoustics, New Paltz, NY, New York, Oct. 1997, IEEE Press.
83. Levine, Scott, Tony Verma, Julius O. Smith III. 1997. “Alias-Free,
Multiresolution Sinusoidal Modeling for Polyphonic, Wideband Audio.” IEEE
Workshop on Applications of Signal Processing to Audio and Acoustics,
Mohnonk, NY, 1997.
84. Loureiro, R. Serra, X. 1997. “A Web Interface for a Sound Database and
Processing System”. Proceedings of International Computer Music
Conference 1997. Thessaloniki,Greece
85. Lomax, Ken. 1997. The Analysis and Synthesis of the Singing Voice. Ph.D:
Thesis, Oxford University.
86. Macon, M. W.; L. Jensen-Link, J. Oliverio, M. Clements, and E. B. George.
1997. “Concatenation-based MIDI-to-singing voice synthesis.” 103rd Meeting
of the Audio Engineering Society, New York, 1997.
87. Macon, M. W.; L. Jensen-Link, J. Oliverio, M. Clements, and E. B. George.
1997. “A system for singing voice synthesis based on sinusoidal modeling,”
Proc. of International Conference on Acoustics, Speech, and Signal
Processing, Vol. 1, pp. 435-438, 1997.
88. Prandoni, P.; M. Goodwin, M. Vetterli. “Optimal time segmentation for
signal modeling and compression.” Proc ICASSP97, vol 3, pp. 2029-2032,
Munich, Germany, April 1997.
89. Qian, Ding. 1997. “A phase interpolation algorithm for sinusoidal model
based music synthesis.” Proceedings of the International Conference on
Acoustics, Speech and Signal Processing, 1997, pp. 451-454.
90. Rodet, X. 1997. “Musical Sound Signals Analysis/Synthesis:
Sinusoidal+Residual and Elementary Waveform Models”, in Proceedings of
the IEEE Time-Frequency and Time-Scale Workshop (TFTS'97), University
of Warwick, Coventry, UK, 27th-29th August 1997.
91. Serra, X. Bonada, J. Herrera, P. Loureiro, R. 1997. “Integrating
Complementary Spectral Models in the Design of a Musical Synthesizer.”
Proceedings of International Computer Music Conference 1997. Thessaloniki,
Greece
92. Serra, Xavier. 1997. “Musical Sound Modeling With Sinusoids Plus Noise.”
In Roads, Pope, Poli (eds.). Musical Signal Processing. Swets & Zeitlinger
Publishers.
93. Sullivan, D. L 1997. “Accurate frequency tracking of timpani spectral lines.”
JASA, 101 (1), 1997.
94. Verma, T. S.; S. N. Levine; T. H.Y. Meng. 1997. “Transient Modeling
Synthesis: a flexible analysis/synthesis tool for transient signals”, Proceedings
of the ICMC 1997.
95. Amatriain, X. Bonada, J. Serra, X. 1998. “METRIX: A Musical Data

Definition Language and Data Structure for a Spectral Modeling Based
Synthesizer”. Proceedings of COST G6 Conference on Digital Audio Effects
1998. Barcelona
96. Arcos, J. Lopez de Mantaras, R. Serra, X. 1998. “Saxex: a Case-Based
Reasoning System for Generating Expressive Musical Performances”.
Journal of New Music Research Vol.27 .3
97. Campedel, Marine. 1998. Etude du modèle “sinusoids et bruit” pour le
traitement des signaux de parole, Estimation Robuste de l’envelope spectrale.
Ph.D. Thesis, TELECOM Paris.
98. Cano, P. 1998. “Fundamental Frequency Estimation in the SMS analysis.”
Proceedings of COST G6 Conference on Digital Audio Effects 1998.
Barcelona
99. Di Federico, Riccardo. 1998. “Waveform Preserving Time Stretching and
Pitch Shifting for Sinusoidal Models of Sound”. Proceedings of COST G6
Conference on Digital Audio Effects 1998. Barcelona
100. Fernandez-Cid, Pablo. 1998. Transcripción Automática de Señales Musicales
Polifónicas. PhD Thesis, Universidad Politécnica de Madrid.
101. George, E. B. 1998. “Practical High-Quality Speech and Voice Synthesis
Using Fixed Frame Rate ABS/OLA Sinusoidal Modeling.” in Proc. 1998
IEEE Int’l Conf. On Acoust., Speech, and Signal Processing, May 1998.
102. Guerra, E. 1998. “VowSynth: A Synthesizer of Vowel Sounds Based on
Additive Synthesis.” Proceedings of COST G6 Conference on Digital Audio
Effects 1998. Barcelona
103. Herrera, P. Bonada, J. 1998. “Vibrato Extraction and Parameterization in the
Spectral Modeling Synthesis framework.” Proceedings of COST G6
104. Irizarry, R. A. 1998. Statistics and Music: Fitting a Local Harmonic Model to
Musical Sound Signals. Ph.D. thesis, University of California, Berkeley.
105. Klapuri, A. 1998. “Automatic Transcription of Music.” MSc thesis, Tampere
University of Technology, 1998.
106. Klapuri, A. 1998. “Number Theoretical Means of Resolving a Mixture of
Several Harmonic Sounds.” Proceedings of the European Signal Processing
Conference, 1998.
107. Laroche, Jean. 1998. “Using Resonant Filters for the Synthesis of Time-
Varying Sinusoids.” 105th AES Convention, San Francisco, CA. 1998.
Preprint 4782 (F-6).
108. Levine, Scott. 1998. Audio Representation for Data Compression and
Compressed Domain Processing. Ph.D. thesis. Stanford University.
109. Levine, S. N. and J. O. Smith. 1998. “A sines+transients+noise audio
representation for data compression and time/pitch-scale modi.cations.” Audio
Engineering Society Convention, no. 4781, 1998.
110. Loscos, A.; Resina, E. 1998. “SMSPerformer: A real-time synthesis interface
for SMS”. Proceedings of COST G6 Conference on Digital Audio Effects
1998. Barcelona
111. Macias, B. 1998. “SMS3d: An application for the visualization of SMS
data.” Proceedings of COST G6 Conference on Digital Audio Effects 1998.
Barcelona
112. Marchand, Sylvain. 1998. “Improving Spectral Analysis Precision with an
Enhanced Phase Vocoder using Signal Derivatives.” Proceedings of COST G6
113. Masri, Paul. 1998. “Extracting more Detail from the Spectrum with Phase
Distortion Analysis.” DAFX98-Workshop, Barcelona (Spain), November
1998 .
114. Peeters, G.; X. Rodet. 1998. “Sinusoidal Characterization in terms of
Sinusoidal and Non-Sinusoidal Components.” DAFX98-Workshop, Barcelona
(Spain), november 1998 .
115. Resina, E. 1998. “SMS Composer and SMS Conductor: Applications for
Spectral Modeling Synthesis Composition and Performance.” Proceedings of
COST G6 Conference on Digital Audio Effects 1998. Barcelona.
116. Serra, X. Bonada, J. 1998. “Sound Transformations Based on the SMS High
Level Attributes.” Proceedings of COST G6 Conference on Digital Audio
Effects 1998. Barcelona
117. Verma, T. S.; T. H. Y. Meng. 1998. “An Analysis/Synthesis Tool for
Transient Signals.” ASA98.
118. Verma, T. S.; T. H. Y. Meng. 1998. “An Analysis/Synthesis Tool for
Transient Signals that Allows a Flexible Sines#Transient#Noise Model for
Audio.” ICASSP98.
119. Verma, T. S.; T. H. Y. Meng. 1998. “Sinusoidal Modeling Using Frame-
Based Perceptually Weighted matching Pursuits.” ICASSP99.
120. Verma, T. S.; T. H. Y. Meng. 1998. “Time Scale Modification Using a
Sines+Transients+Noise Signal Model.” Proceedings of the Digital Audio
Effects Workshop (DAFX98), Barcelona, November 1998.
121. Wright, M. Chaudhary, A. Freed, A. Wessel, D. Rodet, X. Woehrmann, R.
Serra, X. 1998. “New Applications of the Sound Description Interchange
Format.” Proceedings of International Computer Music Conference 1998.
Michigan, USA
122. Althoff, Rasmus; Florian Keiler; Udo Zölzer. 1999. “Extracting Sinusoids
from Harmonic Signals.” DAFX99.
123. Fitz, Kelly. 1999. The Reassigned Bandwidth-Enhanced Method of Additive
Synthesis. Ph. D. dissertation, Dept. of Electrical and Computer Engineering,
University of Illinois at Urbana-Champaign.
124. Freed, Adrian. 1999. “Spectral Line Broadening with Transform Domain
Additive Synthesis.” ICMC99.
125. Herrera, P., X. Serra, G. Peeters. 1999. "A proposal for the description of
audio in the context of MPEG-7", Proceedings of the CBMI'99 European
Workshop on Content-Based Multimedia Indexing.
126. Irizarry, Rafael. 1999. “Weighted Estimation of Harmonic Components in a
Musical Sound Signal.” JTSA
127. Koenen, R. 1999. Overview of the MPEG-4 Standard. ISO/IEC
JTC1/SC29/WG11 N3156, Dec. 1999.
128. Laroche, J. and M. Dolson. 1999. “New phase-vocoder techniques for real-
time pitch shifting, chorusing, harmonizing, and other exotic audio
modifications.” Journal of the Audio Engineering Society, vol. 47, no. 11, pp.
928–936, November 1999.
129. Laroche, J. and M. Dolson. 1999. “New phase-vocoder techniques for pitch-
shifting, harmonizing, and other exotic effects.” in Proceedings of the IEEE
Workshop on Applications of Signal Processing to Audio and Acoustics, New
Paltz, NY, New York, Oct. 17–20, 1999, pp. 91–94, IEEE Press.
130. Laroche, Jean and Mark Dolson. 1999. “Improved Phase Vocoder Time-
Scale Modification of Audio.” IEEE Transactions on Speech and Audio
processing. Vol. 7, No. 3, May 1999.
131. Levine, S. N. 1999. Audio Representations for Data Compression and
Compressed Domain Processing. Ph.D. Thesis, Stanford University
132. Levine, S. N. and Julius O. Smith III. 1999. “A Switched Parametric &
Transform Audio Coder.” ICASSP-99
133. Levine, S. N. and Julius O. Smith III. 1999. “Improvement to the Switched
Parametric & Transform Audio Coder.” Proc. IEEE Workshop on
Applications of Signal Processing to Audio and Acoustics.
134. Peeters, G.; X. Rodet. 1999. “SINOLA: A New Analysis/Synthesis using
Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum.” ICMC99,
Beijing (China).
135. Rossignol, S.; P. Depalle, J. Soumagne, X. Rodet, J.-L. Collette. 1999.
“Vibrato: detection, estimation, extraction, modification.” DAFX99
136. Schwarz, D.; X. Rodet. 1999. “Spectral Envelope Estimation and
Representation for Sound Analysis-Synthesis.” Proceedings of the
International Computer Music Conference (ICMC'99), Beijing, October 1999.
137. Tolonen, Tero. 1999. “Methods for Separation of Harmonic Sound Sources
using Sinusoidal Modeling.” Preprint Number: 4958 AES Convention 106.
138. Troughton, Paul T. 1999. “Bayesian Restoration of Quantised Audio Signals
using a Sinusoidal Model with Autoregressive Residuals”. Proceedings of the
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.
Mohonk, 1999.
139. Verma, T.S. and T.H.Y. Meng. 1999. “Sinusoidal modeling using frame-
based perceptually weighted matching pursuits,” in Proceedings ICASSP’99,
Phoenix, Arizona, USA, May 1999, vol. 2, pp. 981–984.
140. Verma, Tony S. “A Perceptually Based Audio Signal Model with
Application to Scalable Audio Compression”. Ph.D. thesis. Stanford
University, October 1999.
141. Vos, K.; R. Vafin, R. Heusdens, and W.B. Kleijn. 1999. “High-quality
consistent analysis-synthesis in sinusoidal coding,” in Proceedings of the AES
17th International Conference, Florence, Italy, September 1999, pp. 244–250.
142. Bonada, J. 2000. “Automatic Technique in Frequency Domain for Near-

Lossless Time-Scale Modification of Audio.” Proceedings of International
Computer Music Conference 2000. Berlin, Germany
143. Cano, P., A. Loscos, J. Bonada, M. de Boer, X. Serra. 2000. “Voice
Morphing System for Impersonating in Karaoke Applications.” Proceedings
of the International Computer Music Conference 2000.
144. De Boer, M., J. Bonada, X. Serra. 2000. “Using the Sound Descripton
Interchange Format within the SMS Applications.” Proceedings of the
International Computer Music Conference 2000.
145. De Boer, M., J. Bonada, Cano, P., A. Loscos, X. Serra. 2000. “Singing Voice
Impersonator Application for PC.” Proceedings of the International Computer
Music Conference 2000.
146. Desainte-Catherine, M.; S. Marchand. 2000. “High-Precision Fourier
Analysis of Sounds Using Signal Derivatives.” JAES, vol. 48, no. 7/8.
147. Desainte-Catherine, Myriam; Pierre Hanna. 2000. “Statistical Approach for
Sound Modeling.” Proc. of the COST G-6 Conference on Digital Audio
Effects (DAFX-00), Verona, Italy, December 7-9, 2000.
148. Edler, Bernd; Heiko Purnhagen. 2000. “Parametric Audio Coding.”
149. Fitz, K.; L. Haken; P. Christensen. 2000. “A new algorithm for Bandwidth
Association in Bandwidth-Enhanced Additive Sound Modeling.” Proceedings
of the ICMC 2000.
150. Fitz, K.; L. Haken; P. Christensen. 2000. “Transient Preservation under
Transformation in an Additive Sound Model”. Proceedings of the ICMC 2000.
151. Herrera, P., X. Amatriain , E. Batlle, X. Serra. 2000. “Towards Instrument
Segmentation for Music Content Description: a Critical Review of Instrument
Classification Techniques.” Proceedings of the International Symposium on
Music Information Retrieval 2000.
152. Izmirli, Ozgur. 2000. “Non-harmonic Sinusoidal Modeling Synthesis Using
Short-time High-resolution Parameter Analysis.” Conference on Digital Audio
Effects (DAFx), 2000.
153. Klapuri, A., T. Virtanen, J.-M. Holm. 2000. “Robust multipitch estimation
for the analysis and manipulation of polyphonic musical signals.” In Proc.
COST-G6 Conference on Digital Audio Effects, Verona, Italy, 2000.
154. Laroche, J. 2000. “Synthesis sinusoids via non-overlapping inverse fourier
transform.” IEEE Transactions on Speech and Audio Processing, vol. 8, no. 4,
pp. 471–477, July2000.
155. Laurenti, Nicola; Giovanni De Poli. 2000. “A Method for Spectrum
Separation and Envelope Estimation of the Residual in Spectrum Modeling of
Musical Sound”. Proc. of the COST G-6 Conference on Digital Audio Effects
(DAFX-00), Verona, Italy, December 7-9, 2000.
156. Painter, T. 2000. Scalable Perceptual Audio Coding with a Hybrid Adaptive
Sinusoidal Signal Model. Ph.D. Thesis, Arizona State University, June 2000.
157. Purnhagen, H. and N. Meine. 2000. “HILN – the MPEG-4 parametric audio
coding tools,” in Proc. IEEE Int. Symposium on Circuits and Systems (ISCAS),
Geneva, CH, May 2000, pp. III–201 – III–204.
158. Tolonen, T. 2000. “Object-based sound source modeling for musical
signals.” in AES 109th Convention, Preprint 5174, (Los Angeles, USA), Sept.
2000.
159. Verma, T. S.; T. H. Y. Meng. 2000. “Extending Spectral Modeling Synthesis
wth Transient Modeling Synthesis”, Computer Music Journal 24:2, pp.47-59.
160. Virtanen, T. 2000. Audio signal modeling with sinusoids plus noise. Master’s
thesis, Department of Information Technology, Tampere University of
Technology, 2000
161. Virtanen, Tuomas; Anssi Klapuri. 2000. “Separation of Harmonic Sound
Sources using Sinusoidal Modeling.” ICASSP 2000.
162. Wright, M., J. Beauchamp, K. Fitz, X. Rodet, A. Röbel, X. Serra, G.
Wakefield. 2000. “Analysis/synthesis comparison.” Organized Sound, 5(3),
pp 173-189. 2000.
163. Amatriain, X. Bonada, J. Loscos, A. Serra, X. 2001. “Spectral Modeling for

Higher-level Sound Transformation.” Proceedings of MOSART Workshop on
Current Research Directions in Computer Music. Barcelona
164. Amatriain, X. Herrera, P. 2001. “Audio Content Transmission.” Proceedings
of COST G6 Conference on Digital Audio Effects 2001. Limerik, Ireland
165. Anal J. S. Ferreira. 2001. “Perceptual Coding using Sinusoidal Modeling in
the MDCT Domain.” Preprint Number: 5569 AES Convention: 112 2002-
05
166. Bonada, J. Celma, O. Loscos, A. Ortolà, J. Serra, X. 2001.”Singing Voice
Synthesis Combining Excitation plus Resonance and Sinusoidal plus Residual
Models.” Proceedings of International Computer Music Conference 2001.
Havana, Cuba
167. Bonada, J. Loscos, A. Cano, P. Serra, X. 2001. “Spectral Approach to the
Modeling of the Singing Voice.” Proceedings of 111th AES Convention. New
York, USA
168. Duxbury C., Davies M., Sandler M. 2001. “Separation of Transient
Information in Musical Audio Using Multiresolution Techniques”. DAFX01
169. Ferreira, A.J.S. 2001. “Accurate Estimation in the ODFT Domain of the
Frequency, Phase and Magnitude of Stationary Sinusoids”. WASPAA01
170. Ferreira, A.J.S. 2001. “Combined Spectral Envelope Normalization and
Subtraction of Sinusoidal Components in the OFDT and MDCT Frequency
Domains”. WASPAA01
171. Florian. 2001. Time-scale Modification using the Phase Vocoder. Diploma
Thesis. Graz University of Music and Dramatic Arts.
172. Garcia, G. 2001. “Estimation of Sinusoids in Audio Signals using an
Analysis-by-Synthesis Neural Network.” IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP), 2001, Salt Lake City, UT.
173. Goodwin, M. M. 2001. “Multiscale Overlap-Add Sinusoidal Modeling Using
Matching Pursuit and Refinements.” WASPAA01
174. Haas, J. 2001. “SALTO - A Spectral Domain Saxophone Synthesizer”
Proceedings of MOSART Workshop on Current Research Directions in
Computer Music. Barcelona
175. Hanna, Pierre and Myriam Desainte-Catherine. 2001. “Influence of
frequency distribution on intensity fluctuations of noise.” DAFX01.
176. Hammer, Florian. 2001. Time-scale Modification using the Phase Vocoder.
Diploma Thesis. Institute for Electronic Music and Acoustics (IEM), Graz
University of Music and Dramatic Arts.
177. Haste, Tue; Andersen and Kristoffer Jensen. 2001. “On the importance of
phase information in additive analysis/synthesis of binaural sounds.”
Proceedings of International Computer Music Conference 2001. Havana,
Cuba
178. Hermus, Kris, Werner Verhelst, Patrick Wambacq. 2001. “Perceptual Audio
Modeling Based on Total Least Squares Algorithms.” Preprint Number: 5571.
Journal of the AES
179. Heusdens, Richard; Renat Vafin, Bastiaan Kleijn. 2001. “Sinusoidal
Modeling of Audio and Speech Using Psychoacoustic-Adaptive Matching
Pursuits.” ICASSP01.
180. Jehan, Tristan and Bernd Schoner. 2001. “An Audio-Driven Perceptually
Meaningful Timbre Synthesizer.” ICMC2001
181. Jensen J., Heusdens R. Veenman, C.J. 2001. “Optimal Time-Differential
Encoding of Sinusoidal Model Parameters.” 22nd Symposium on Information
Theory in the BENELUX, Enschede (NL), May 2001
182. Kauppinen I., Roth K. 2001. “An Adaptive Technique for Modeling Audio
Signals.” Conference on Digital Audio Effects DAFX 2001.
183. Keiler, F., Zölzer U. 2001. “Extracting Sinusoids from Harmonic Signals.”
JNMR 30 (3)
184. Lagrange, M, Marchand, Sylvain. 2001. “Real-time Additive Synthesis of
Sound by Taking Advantage of Psychoacoustics”. DAFX01
185. Master, Aaron. 2001. “Physical Modeling and Sinusoidal Modeling for Noise
and Artifact Elimination.” CCRMA class report.
186. Painter, Ted; Andreas Spanias. 2001. “Perceptual Segmentation and
Component Selection in Compact Sinusoidal Representations of Audio.”
ICASSP01.
187. Parra L., Jain U. 2001. “Approximate Kalman Filtering for the Harmonic
plus Noise Model”. WASPAA01
188. Peeters, Geoffroy. 2001. Modèles et modélisation du signal sonore adaptés à
ses caractéristiques locales. PHD thesis Université, Paris VI July 2001
189. Polotti P., Evangelista G. 2001. “Multiresolution Sinusoidal/Stochastic
Model fr Voiced-Sounds”. DAFX01
190. Vafin R., Heusdens R., van de Par, S. & Bastiaan Kleijn, W. 2001.
“Improving modeling of audio signals by modifying transient locations.”
WASPAA01
191. Verfaille V., Duhamel P., Charbit M. 2001. “Lift: Liklihood-Frequency-Time
Analysis for Partial Tracking and Automatic Transcription of Music”.
DAFX01.
192. Virtanen, T., Klapuri A. 2001. “Separation of Harmonic Sounds Using
Multipitch Analysis and Iterative Parameter Estimation.” Proc. IEEE
Workshop on Applications of Signal Processing to Audio and Acoustics, New
Paltz, New York, 2001.
193. Virtanen, Tuomas. 2001. “Accurate Sinusoidal Model Analysis and
Parameter Reduction by Fusion of Components”, AES Convention 110.
194. Wang, Kun; Hongya Ge, Yinong Ding. 2001. “Adaptive Parametric Schemes
for Analysis and Synthesis of Musical Signals.” JAES vol. 49 (5)
195. Amatriain, X. Herrera, P. 2002. “Transmitting Audio Content as Sound

Objects.” Proceedings of AES22 International Conference on Virtual,
Synthetic and Entertainment Audio. Espoo, Finland
196. Amatriain, X. de Boer, M. Robledo, E. Garcia, D. 2002.
”CLAM: An OO Framework for Developing Audio and Music Applications”
Proceedings of 17th Annual ACM Conference on Object-Oriented
Programming, Systems, Languages and Applications. Seattle, WA, USA
197. Amatriain, X. Arumi, P. Ramírez, M. 2002.
”CLAM, Yet Another Library for Audio and Music Processing?”
Proceedings of 17th Annual ACM Conference on Object-Oriented
Programming, Systems, Languages and Applications. Seattle, WA, USA
198. Bonada, J. 2002. “Audio Time-Scale Modification in the Context of
Professional Post-Production.” Doctoral Pre-Thesis Work. UPF. Barcelona
199. Boyer R.; Abed-Meraim K. 2002. “Efficient Parametric Modeling for Audio
Transients.” Proceedings of the 5th International Conference on Digital Audio
Effects.
200. Fitz, Kelly; Lippold Haken. 2002. “On the Use of Time-Frequency
Reassignment in Additive Sound Modeling.” JAES, vol 50 (11).
201. Hanna, P., Desainte-Catherine M. 2002. “Adapting the overlap-add method
to the synthesis of noise”. DAFX02.
202. Hanna, P., Myriam Desainte C. 2002. “Detection of sinusoidal components
in sounds using statistical analysis of intensity fluctuations”. ICMC02
203. Heittola, Toni; Anssi Klapuri. 2002. “Locating Segments with Drums in
Music Signals”, ISMIR2002.
204. Irizarry, R. A. 2002. “Weighted estimation of harmonic components in a
musical sound signal.” Journal of Time Series Analysis. 23: 29-48
205. Keiler, Florian; Sylvain Marchand. 2002. “Survey on Extraction of Sinusoids
in Stationary Sounds.” Proceedings of the 5th International Conference on
Digital Audio Effects
206. Lagrange, M; Marchand, S. and Rault, J.-B. 2002. “Sinusoidal Parameter
Extraction and Component Selection in a Non Stationary Model.” Proceedings
of the 5th International Conference on Digital Audio Effects.
207. Lee, M.; and M. J. T. Smith, “Digital singing voice synthesis using a new
alternating refection model”, in ISCAS, May 2002, vol. 2, pp. 341-344.
208. Marentakis G., Jensen K.2002. “Sinusoidal Synthesis Optimization.”
ICMC02
209. Master A. 2002. “Sinusoidal Modeling Parameter Estimation via a Dynamic
Channel Vocoder Model”. ICASSP02
210. Meine N. & Purnhagen P. 2002. “Fast sinusoid synthesis for MPEG-4 HILN
parametric audio decoding”. DAFX02.
211. Morris, R.W. and M.A. Clements. 2002. “Modification of formants in the
line spectrum domain.” Signal Procesing Letters, vol. 9, pp. 19-21, Jan. 2002.
212. Polotti, Pietro. 2002. “Fractal Additive Synthesis: A Pitch-Sinchronous
Extension of the Method for the Analysis and Synthesis of Natural Voiced-
Sounds”, ICMC02
213. Purnhagen, Heiko. 2002. “Parameter Estimation and Tracking for Time-
varying Sinusoids.” IEEE-MPCA-2002.
214. Röbel A. 2002. “Estimating partial frequency and frequency slope using
reassignment operators”. ICMC02
215. Timoney, Joseph; Victor Lazzarini, Thomas Lysaght. 2002. “New SndObj
Library Classes for Sinusoidal Modeling”. DAFX02
216. Tohyama, Mikio. 2002. “Sinusoidal and Envelope-Modulation-Modeling-of-
Signals-A Signal Theoretic Approach to Acoustics Events Rendering-.
Proceedings of the 2002 International Conference on Auditory Display, Kyoto,
Japan.
217. Virtanen, T.; Anssi Klapuri. 2002. “Separation of Harmonic Sounds Using
Linear Models for the Overtone Series.” ICASSP 2002
218. Wells J. J., Murphy D.T. 2002. “Real-time partial Tracking in an Augmented
Additive Synthesis System.” DAFX02.
219. Amatriain, X. Bonada, J. Loscos, A. Arcos, J. Verfaille, V. 2003. “Content-

based Transformations.” Journal of New Music Research Vol.32 .1
220. Beltrán, José R. and Fernando Beltrán. 2003. “Additive synthesis based on
the continuous wavelet transform: A sinusoidal plus transient model.”
DAFX03
221. Bonada, J. Loscos, A. 2003. “Sample-based singing voice synthesizer by
spectral concatenation.” Proceedings of Stockholm Music Acoustics
Conference 2003. Stockholm, Sweden
222. Bonada, J. Loscos, A. Mayor, O. Kenmochi, H. 2003. “Sample-based singing
voice synthesizer using spectral models and source-filter decomposition.”
Proceedings of 3rd International Workshop on Models and Analysis of Vocal
Emissions for Biomedical Applications. Firenze, Italy
223. Gómez, E. Gouyon, F. Herrera, P. Amatriain, X. 2003. “Using and enhancing
the current MPEG-7 standard for a music content processing tool.”
Proceedings of Audio Engineering Society, 114th Convention. Amsterdam,
The Netherlands
224. Gómez, E. Grachten, M. Amatriain, X. Arcos, J. 2003. “Melodic
characterization of monophonic recordings for expressive tempo
transformations.” Proceedings of Stockholm Music Acoustics Conference
2003. Stockholm, Sweden
225. Gómez, E. Klapuri, A. Meudic, B. 2003. “Melody Description and
Extraction in the Context of Music Content Processing.” Journal of New
Music Research Vol.32 .1
226. Gómez, E. Peterschmitt, G. Herrera, P. 2003. “Content-based melodic
transformations of audio for a music processing application”. Proceedings of
6th International Conference on Digital Audio Effects. London, UK
227. Gouyon, Fabien; Lars Fabig and Jordi Bonada. 2003. Rhythmic
expressiveness transformations of audio recordings: swing modifications”.
DAFX03
228. Hainsworth, Stephen and Malcolm Macleod. 2003. “On sinusoidal parameter
estimation.” DAFX03
229. Hanna, Pierre and Myriam Desainte-Catherine. 2003. “Analysis method to
approximate the spectral density of noises.” Proceedings of the 2003 IEEE
Workshop on Applications of Signal Processing to Audio and Acoustics -
October 19-22, 2003 - New Platz - New Cork
230. Hanna, Pierre and Myriam Desainte-Catherine. 2003. “Time Scale
modification of noises using a Spectral and Statistical Model.” Proceedings of
the International Conference on Acoustics, Speech, and Signal Processing -
April 6-10, 2003 - Hong Kong (China).
231. Helen, Marko and Tuomas Virtanen. 2003. “Perceptually motivated
parametric representation for harmonic sounds for data compression
purposes.” DAFX03
232. Lagrange, Mathieu; Sylvain Marchand, Martin Raspaud and Jean-Bernard
Rault. 2003. “Enhanced partial tracking using linear prediction.” DAFX03
233. Laroche, Jean. 2003. “Frequency-domain techniques for high-quality voice
modification.” DAFX03
234. Lee, Matthew E.; Mark J. T. Smith. 2003. “Spectral Modification for Digital
Singing Voice Synthesis Using Asymmetric Generalized Gaussians.”
ICASSP03.

Spectral Modeling Synthesis: Past and Present

Uploaded by

Copyright:

Available Formats

Spectral Modeling Synthesis: Past and Present

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Spectral Modeling Synthesis: Past and Present

Uploaded by

Copyright:

Available Formats

Spectral Analysis/Synthesis

Spectral Modeling Synthesis:

Partial Tracking Residual Analysis

Xavier Serra - London 2003 2

… some history Relevant Research Topics (I)

“…The objective is the development of an

possible number of transformations on the analysis

Xavier Serra - London 2003 3 Xavier Serra - London 2003 4

Detection/Estimation of Sinusoids (II) Partial Tracking

Xavier Serra - London 2003 7 Xavier Serra - London 2003 8

Xavier Serra - London 2003 9 Xavier Serra - London 2003 10

Multiresolution Feature-based Analysis/Synthesis

(ICMC, 1990) transformation. .mid .snd .snd

 constant-Q bandpass filters

(DAFX, 2001)  Gouyon, F. et alt. ((DAFX, 2003)

Xavier Serra - London 2003 11 Xavier Serra - London 2003 12

 Bandwidth Enhanced Sinusoidal  Sinusoidal interpolation. Alignment

 Goodwin, M. (thesis, 1996)

Xavier Serra - London 2003 13 Xavier Serra - London 2003 14

Time Scaling Coding/Transmission

Xavier Serra - London 2003 17 Xavier Serra - London 2003 18

Instrument/Voice Models Expanded Models

 Fitz, K. et alt. (ICMC, 1995)

 Bonada et. alt. (ICMC, 2001)

Xavier Serra - London 2003 19 Xavier Serra - London 2003 20

Xavier Serra - London 2003 21 Xavier Serra - London 2003 22

1. Chamberlin, H. 1980. “Using the FFT for Synthesis.” In Music Applications

2. Almeida, L. B. and F. M. Silva. 1983. “Harmonic Coding with Variable-

3. Smith, J.O. and B. Friedlander. 1984. “High Resolution Spectrum Analysis

5. Griffin, D. W.; J. S. Lim. 1985. “A New Model-Based Speech Analysis /

6. McAulay, R. J. and T. F. Quatieri. 1986. “Speech Analysis/Synthesis based

10. Smith, J.O.; Serra, X. 1987. “PARSHL: an analysis/synthesis program for

11. McAulay, R. J.; T. F. Quatieri. 1988. “Computationally efficient sine-wave

12. Maher, Robert C. 1989. An Approach for the Separation of Voices in

20. George, E. B. 1991. An Analysis-by-Synthesis Approach to Sinusoidal

24. Fitz, K; W. Walker; L. Haken. 1992. “Extending the McAulay-Quatieri

43. Dutoit, T. and B. Gosselin. 1995. “On the Use of a Hybrid

54. Ali, M. 1996. Adaptive Signal Representation with Applications in Audio

70. Arcos, J. Lopez de Mantaras, R. Serra, X. 1997. “Generating expressive

95. Amatriain, X. Bonada, J. Serra, X. 1998. “METRIX: A Musical Data

142. Bonada, J. 2000. “Automatic Technique in Frequency Domain for Near-

163. Amatriain, X. Bonada, J. Loscos, A. Serra, X. 2001. “Spectral Modeling for

195. Amatriain, X. Herrera, P. 2002. “Transmitting Audio Content as Sound

219. Amatriain, X. Bonada, J. Loscos, A. Arcos, J. Verfaille, V. 2003. “Content-

You might also like

constant-Q bandpass filters

(DAFX, 2001) Gouyon, F. et alt. ((DAFX, 2003)

Bandwidth Enhanced Sinusoidal Sinusoidal interpolation. Alignment

Goodwin, M. (thesis, 1996)

Fitz, K. et alt. (ICMC, 1995)

Bonada et. alt. (ICMC, 2001)