Internal Mixing Rev2 en
Internal Mixing Rev2 en
Internal Mixing Rev2 en
Internal Mixing
How to create a professional mix on your computer – a systematic approach
Tischmeyer Publishing
FOREWORD
Dear Reader,
W
Second, revised edition 2008
Publisher and Author: Friedemann Tischmeyer ELCOME ABOARD! We are about to embark on a voyage
Translator: Brian Smith through the entire mixing process. This book will give you easy-
Cover Design: aim Werbeagentur Hamburg, Michael Prahl to-understand suggestions on how to systematically carry out a
Layout: Mott Jordan www.mottjordan.com successful mix.
Graphics: Friedemann Tischmeyer and Gregoire Vanoli
Very often people ask if computer-based production can provide the same punch
Copyeditor: Leina Gonzalez Baird
and especially the same degree of space that a production made with classical
Proofreaders: Omid Bürgin, Namin Nooman
tools and the best outboard equipment. Step by step, I will explain the necessary
Production: Media Print GmbH, printed in Germany
tools and techniques for professional mixing with computers. We will look at
DVD-Production: optimal media production GmbH, manufactured in Germany
how mixing was done with tape-based recording technology, and will examine
the supposed advantages and disadvantages of these production methods while
©2008 by Tischmeyer Publishing GmbH Germany
applying this to the world of digital production.
www.tischmeyer-publishing.de
www.proworkshops.de On the way, we will discover a treasure of inspiration for improving our current
working methods.
All rights reserved.
No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by Beginners, DJs, audio engineers, musicians, producers, and audio engineering
any means, mechanical, electronic, photocopying, recording, or otherwise, without the prior written students will all be able to use the ideas and suggestions in this book for their
permission of the publisher. productions.
ISBN: 978-3-9811217-1-1 Here’s a suggestion for all those whose highest priority is the best possible con-
servation of their ideas and songs: In order to make progress in mixing, please
The contents of this book and the accompanying DVD have been written and produced with the plan on finishing your projects within a predetermined period of time. This is the
greatest possible care. However, nobody is perfect and it is possible errors have occurred. The only way that you will be able to look back and see how your work methods and
publisher cannot accept any responsibility for damages resulting from erroneous information, nor your hearing have developed. On the other hand, if you let yourself be a slave to
from the use of the software included on the DVD.
the possibilities of total recall incorporated in today’s digital audio workstations
Tischmeyer Publishing cannot provide software support for included demo software. Please contact (DAW) by never really finishing your projects, then it will be much more difficult
the respective software manufacturer.
to see any progress in your mixing techniques.
The Internet addresses in this book are given for informative purposes only – the publisher is not
responsible for their content.
The software and hardware names in this book are property of the companies that own them and are
protected under local and international copyright laws.
The Sound Examples contained on the DVD are only intended for use in exercises or as examples.
Any other use is expressly forbidden.
3
Please do not use any excerpts as samples for productions as their use is not al-
CONCERNING MY BACKGROUND AND lowed without express permission from the copyright owners.
THE CREATION OF THIS BOOK
I
The knowledge contained in my books therefore stems from pure practical user
AM A TRAINED guitar and bass player and have learned everything I
experience, from one user to another. Wherever appropriate, it is supported by
know about audio engineering on my own. As both a musician and as an
extensive research and studies.
engineer, I have been very lucky to have worked with a variety of experi-
enced engineers; I have used these experiences and opportunities to further in- Enjoy yourself and have fun applying the following techniques to your work!
crease my knowledge. Later I had a large 48-track studio with a Studer 2” tape
recorder and a Trident analog mixing desk/console, along with a large quantity of
outboard equipment. During this time, I gained quite a bit of experience working Friedemann Tischmeyer
with tape – quite a bit different from the predominately individual or mini-team
working methods that are characteristic of today’s production methods using
home computers. Before the digital revolution could wipe out my studio, I was
still able to sell off my equipment to concentrate fully on the new computer-based
techniques in a smaller studio, spending more time working on my own projects.
I constantly tried to achieve results of a quality equal to that attained using the
familiar analog techniques. In the beginning, it was not easy to attain the same
quality with digital workstations as analog technology. Therefore, I began to work
with manufacturers and developers of software-based workstations and plug-ins,
and always went to the limits of what was technically feasible.
Like my mastering book “Audio Mastering with PC Workstations,” this book has
“organically” grown from the many workshops I have given in my studio and at
various educational institutions. Workshops given to small groups are ideal for
the mutual exchange of information and experiences. They helped me to fine-
tune pedagogical concepts and to tailor to the participants’ needs – to your needs.
This was the foundation of both the book and the DVD series I created based
on the books. Theoretical issues, which were inappropriate for the DVDs, along
with all information about quickly evolving matters such as plug-in descriptions,
are reserved for the book, which will be revised in a cycle of approximately two
to three years. On the other hand, the tutorial DVDs focus on practical work-
ing methods illustrated by numerous audio examples. The book’s accompanying
DVD-ROM contains audio examples, exercises, and demo versions of plug-ins
from many different manufacturers.
I would like to thank the owners of copyrighted material who have allowed me to
reproduce parts of their work for the audio examples. There are also single-track
excerpts that are suitable for exercises concerning compressors and EQs.
4 5
WORKING WITH THIS BOOK information in this book, you will be able to correctly navigate yourself through
the entire mixing process.
N
EARLY ALL OF THE INFORMATION and suggestions in this
After exploring the individual areas in depth, the section “Workflow Overview”
book are cross-platform and are therefore equally valid for both
represents the mixing process as a timeline in order to give you an overview of
PC and Mac users; exceptions will be indicated. The same is the
the “absolute” process workflow.
case for mixing strategies with analog mixing consoles and computer-based
systems. Nevertheless, the focus of the book is clearly based on computer-
based mixing.
For the sake of clarity, screenshots are taken from only one sequencer program.
Because I personally work with Steinberg’s Nuendo and 99% of the features that
concern us also are included in Cubase, I have used screenshots from Nuendo.
To avoid unnecessary complication, I will not discuss other sequencers. Every
professional sequencer or hard disk recording application has the necessary ba-
sic functions for our work and it will not be difficult for you to apply the work-
flows that we describe to your own software environment. My choice is not a
recommendation concerning the very similar qualities of the various sequenc-
ers or DAWs available today.
In this book we will concentrate almost entirely on systematically organized
techniques for interpreting and answering artistic requirements. The right
sound and how to obtain it is our main task, provided you already have a clear
idea of the sound aesthetic and the artistic flow you want.
The system involving the complex, intuitive, and creative process of music mix-
ing can be divided into two large categories: absolute processes and relative pro-
cesses.
Absolute processes refer to recurring steps or rules, which should be carried
out in a specific manner and order. For example it makes sense to begin with
automation after finishing the static mix.
Relative processes also follow rules, but are not dependent on being done in a
strict order. Working with EQs, while independent of the mixing process or-
der, still follows the same rules. Relative process steps can therefore occur at
different points in the mixing process or even more often than once. It will be
self-evident which steps are relative and which are absolute processes. With the
6 7
CONTENTS
Foreword . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3
Concerning my Background and the Creation of this Book . . . . . . . . . . . . . . 4
Working with this Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
Chapter 1: The Three Phases of Classical Production –
A Retrospective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Phase 1 – The Recording Session . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Improve your Decision-making Capabilities . . . . . . . . . . . . . . . . . . . . . . . 20
Building Confidence in Rhythmic Hearing . . . . . . . . . . . . . . . . . . . . . . . . . 21
A Few Good Tracks are Better than Many Mediocre Ones! . . . . . . . . . . . 21
Phase 2 - The Mixdown . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
How to Achieve a Quality Mix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Mixing Goals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
The Three Dimensions: Aspects/Sub-Aspects . . . . . . . . . . . . . . . . . . . . . . . 24
Delivering the Mix Master to the Mastering Studio . . . . . . . . . . . . . . . . . . 25
Phase 3 - Mastering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
8 TABLE OF CONTENTS 9
Bit Resolution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Hard Choices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
What is Truncation? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 UAD-1 Card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
Why is High Bit Depth So Important for Sound? . . . . . . . . . . . . . . . . . . . . 48 PowerCore . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
What Happens When a 16-bit File is Saved as a 32-bit File? . . . . . . . . . . . 49 Incompatibility Between UAD-1 and PowerCore? . . . . . . . . . . . . . . . . . 79
Why Record in 32-bit Format When the A/D Converter Only Supports Waves. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
24-bit Resolution? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 SSL Duende . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Summary of a Few Basic Rules Relating to Bit Depth . . . . . . . . . . . . . . . . 50 Offline Processing as an Alternative to DSP-supported Plug-Ins . . . . 82
When Can We Leave the 32-bit Domain? . . . . . . . . . . . . . . . . . . . . . . . . . . 51 Mixing Console Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Tip for Mastering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 In-line Consoles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
Dithering. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 Split Consoles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Studio Acoustics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 The Cubase and Nuendo Split Console . . . . . . . . . . . . . . . . . . . . . . . . . . 86
How to Live With Imperfect Acoustics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Description of the Audio Channel Diagram . . . . . . . . . . . . . . . . . . . . . . 90
Studio Equipment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Choosing Speakers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 Chapter 3: The Systematic Approach –
The Basics of Setting up Speakers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 Clarity and Workflow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Front-end . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Defining the Start of the Mixing Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Monitor Controller, Monitor Matrix, and Mixing Console . . . . . . . . . 61 Working with Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
Power Amplifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 Using Compression in Groups: Save CPU and Increase Punch . . . . . . . . . 101
Digital Monitors – the Purist’s Back-end . . . . . . . . . . . . . . . . . . . . . . . . . 62 EQing Groups to Save Resources and Create Space . . . . . . . . . . . . . . . . . . . 102
Remotes and Controllers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Handling Groups: What to Control via Groups, and what via the Tracks? . 103
Using Metering for Monitoring the Dimensions . . . . . . . . . . . . . . . . . . . . 64 Widening Stereo Basis within Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
16 TABLE OF CONTENTS 17
CHAPTER 1: THE THREE
PHASES OF CLASSICAL
PRODUCTION –
A RETROSPECTIVE
U
SE THE ADVANTAGES OF BOTH analog and digital work meth-
ods to achieve better results.This chapter adapts the experiences of
classical mixing production in a tape-based studio to the predominat-
ing work methods in today’s modern computer-based studios. It helps to take
a brief look into the past to see what has changed and if some of the traditional
methods can be applied today for improving quality.
Traditionally, music production can be divided into three basic work phases
that overlap when working with digital audio workstations (DAWs): Recording
– Mixing – Mastering.
20 CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE 21
Clearly there are many reasons why working in analog studios led to more fo- Being deeply involved in the production can hamper a fresh and systematic ap-
cused, disciplined work that involved spontaneous and quick decision-making, proach to the mixing process. Very few people would go to the trouble of taking
along with a great deal of imagination. The convenience – inherent in DAWs out all plug-ins and setting all levels, pans, and EQs to zero and start everything
– resulting from the lack of clearly-defined production phases has both advan- from the beginning. A further disadvantage in being both the recording and mix
tages and disadvantages. Using Cubase, Logic, Pro Tools, DP, or Sonar becomes a engineer has to do with something we will mention later: the mix concept. Often,
blessing only if we work more systematically. when creating a mix concept, the mute button is a very important tool. The pro-
ducer, who has been involved since the very start, has a difficult time muting
The “analog” working methods had the advantage that in the second phase – the
tracks that were created as the result of hard recording and editing work. This is
mixdown – a good basic sound had already been supplied; the mixing engineer
why I recommend creating teams with friends and colleagues, and occasionally
had a number of fundamental musical aspects already laid out for him. One point
changing the roles of mixing engineer and producer with that of the client; taking
is as valid today as it was in the days of analog recording: the better the recording
on the role of accepting or rejecting final results while delegating the actual mix-
and the “front-end” – mics, pre-amps, compressors, and other equipment used
ing process. Doing this will give you new ideas and teach you a great deal.
– the more easily a signal integrates into the mix later on. The popular saying
“we’ll fix it in the mix” should not be synonymous with musically or technically The mix is done when the mixing engineer creates a stereo master. In earlier days,
inferior recording, even when we are tempted by the endless correction possibili- this involved a DAT recorder fed by an Apogee converter with onboard limiting.
ties of modern DAWs. Often, an analog limiter – like the Urei 1178 LN – was used to cut out peaks in
order to make the best use out of the 16 bits of the DAT recorder. Now that we
PHASE 2 - THE MIXDOWN have the luxury of 24 or 32-bit mixdowns (bounce/export/render/apply), you do
not have to dynamically limit the master with a bad native converter. The 32-bit
In the past, a 2-inch tape was frequently brought to another studio or sent to
floating-point files provide enough headroom to eliminate a limiter completely,
a mixing engineer. The mixing studio was usually outfitted with high-end out-
since even overs can be handled cleanly. The 24-bit files can be protected from
board equipment and mixer automation. Phase 2 began with all tracks being laid
overs using a brick wall limiter, because the limiter only kicks in when a peak is
down next to each other without panning and all faders set according to taste, in
detected. Generally, dynamic processing and other level processing steps should
order to get a first impression. Masking tape was used to label the tracks and a
be left to the mastering engineer, who takes over in phase 3.
long tape roll from each song was hung on walls and doors until the production
was finished. Today, the equivalent process would typically be done by import-
ing an OMF (Open Media Framework) file, where individual .wav or .aif files are
How to Achieve a Quality Mix
brought into the arrangement without EQs, insert effects, or level and panning An additional artistic aspect is dramatic form and can be especially achieved by
information. muting or using special effects.
The advantages of division of labor here are important: because the sound “grows” During the recording stage of the production, many tracks are filled with content
throughout the entire computer-based music production process, by the time the that is to be later sorted during the mixing process. Nothing is more boring than
mixdown begins, a production is often already halfway finished. a song where all instruments are audible from beginning to end. By intelligently
muting individual tracks a song can become interesting. For example, if the vo-
That is not necessarily an advantage! cals are good enough you can even create a cappella passages with this process.
The last fine-tuning can turn into a rocky path because of awkwardness or a lack
of systematic working methods. It is important to keep questions in mind such Mixing Goals
as: what is the reason for this send effect? Which strategy is used for panning? The goal of a good mix is a warm, clear, deep, and punchy sound, where all events
What is my mixing concept?
22 CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE 23
The secret – or u Ask a like-minded friend or colleague to help by switching roles (client/engi-
neer).
fundamental
u Give yourself only a limited amount of time. In the past, it was often only possible
skill – necessary to have one night to “sleep on” mix decisions – maybe listening on a different set
of monitors – with the sole possibility of coming in the next morn-
to obtain a ing, before the next production day begins, in order to make a few The mixing
good mix lies minor changes. Afterwards the patch bay and mixer settings would process is a
be irreversibly changed. Do not be led to collect a pile of unfinished
in intelligently projects just because of the existence of the total recall functions in continuous
distributing all modern DAWs. It is much better to decide to close mixing projects chain of
which can be saved and wiped from the hard disk, so that you can
events in the realize that in a year’s time you have gotten better! Then, if required decisions!
three spatial you can mix another version. A mix should be 90% done after four hours of
work. The rest is fine-tuning and takes the largest amount of time (1 to 2 days),
dimensions: but of course this can vary largely and depends on the person.
width, height, u The mixing process consists of a continuous chain of decisions. The ability to
quickly make decisions is crucial for maintaining an efficient work pace.
and depth! The Three Dimensions
u When you mix your own song and you are not happy with the results, make The BLER value is a statistical error value of digital media. The Red Book specifi-
a copy of your project, remove all insert and send effects, and put all panning cation stipulates the highest acceptable value.
to the center. Start right from the beginning with a clear mixing strategy (see For comparative listening, you can create a second file, dithered to 16 bits in the
following chapters). last position of the master insert of the virtual DAW mixer.
24 CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE 25
If you are working with a digital mixer, it is a good idea to create the greatest Here is a summary of the most important points for delivering a mix master to a
possible bit depth – in this case 24 bits – when creating the mix master. Because mastering studio:
the S/P-DIF and AES/EBU digital transfer formats are limited to 24-bit integer bit
u Highest possible bit depth; Cubase & Nuendo: 32-bit floating point files; oth-
depth, it is impossible to process files on a 32-bit basis with external equipment.
erwise use 24-bit files.
The 16-bit DAT recorders are to be avoided nowadays; the DAT format is no
longer suited as a mastering medium. In addition, very few project studios have u In situations where the computer uses internal 32-bit floating point process-
high-resolution tape machines. For mixing down using an external digital mixer, ing, but you must go to 24-bit files for exporting, use either a brickwall limiter
it is best to copy back over to the DAW. Here it is important to make sure that the without dithering or a simple limiter and dither down to 24 bits.
synchronization (wordclock/houseclock) settings are correct.
u Do not dither 32-bit files.
For good sound in the digital studio using a houseclock, the rule is: converters
clock themselves internally, are clock masters, and drive the clock distributor. u Use CD-ROMs, DVD-ROMs (slow writing speeds), hard disks, or memory
(If you own several different converters, then the most important converter – that sticks as transport media.
the mic and line inputs are fed through – should be internally clocked.)
u For CD productions, use a sampling rate of 44.1 kHz. If conversion is neces-
If they are in the chain, devices like the TC Finalizer should be set to bypass or sary, use a high-quality sample rate converter (SRC) with internal oversam-
used only as “technical” limiters for eliminating occasional peaks. This does not pling. If this is not an option, leave the conversion to the mastering studio.
contradict the high quality of the Finalizer, but avoids unnecessary compression, Sample rate conversion is a complex and CPU-intensive process, which can
which can be difficult to correct in the mastering process. only be done with oversampling (multiplication of the sampling rate) in order
Be careful! When you are using S/P-DIF with your digital mixer, make sure that to prevent rounding errors.
the processing chain supports 24-bit word depth. Some audio interfaces – espe- u Before mastering, avoid fades at all costs unless they are musically arranged
cially low-cost models – are limited to 16 bits. Such devices should be replaced into the song. If fades are run through dynamic processing during mastering,
with 24-bit audio interfaces if they are to be used for mixdowns. WaveLab pro- pumping and digital artifacts can result.
vides a bit-depth metering function that shows the actual allowed bit depth.
u Do not cut beginnings and endings. Many mastering plug-ins require short
If you are working with an analog mixer, you can make a ½-inch master parallel lead-ins so that the “predict” function can work. Without the lead-ins, arti-
to the digital master. These formats are still accepted in some mastering studios facts might be created during the processing phase. When marking areas to be
in North America. In Europe large mastering studios also support such formats. bounced or exported in the arrangement window of your DAW, it is smart to
Beware of the country-specific measuring standards of analog machines (USA: leave small lead-in times instead of selecting the area start point exactly at the
IEC/Germany: NAB) and be gentle with tape saturation. With a digital mix mas- beginning of the song.
ter, high-quality 24-bit converters should be used. These also serve as clock mas-
ters for the DAW being used for recording. 24-bit or 32-bit floating point files u When using analog mixers, a lead-in can be used as a fingerprint for possible
should be delivered on CD-ROMs. If reliable digital metering is available, I rec- de-noising during mastering.
ommend keeping pop and radio music at an average loudness of not more than
u Label track files according to their numerical order on the album. (For ex-
-14dB/RMS during loud passages. This ensures that the mastering engineer’s job
ample, 01-32Bit my song.wav.)
can remain enjoyable. Over-compressed masters are very difficult to work on and
are difficult to shape. When mixdowns are louder than -14dB/RMS, it is difficult u Give the mastering studio a processing wishlist along with reference tracks, if
to correct mixing mistakes. desired.
Please see Chapter 2: Using Metering for Monitoring the Three Dimensions.
26 CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE CHAPTER 1: THE THREE PHASES OF CLASSICAL PRODUCTION — A RETROSPECTIVE 27
PHASE 3 - MASTERING
The mastering engineer brings various mixdowns together into a unified sound
aesthetic. These mixdowns were often made weeks, months, or years apart and
were maybe even mixed in different studios by different engineers. The mastering
engineer carefully increases loudness and shapes the material according to the
Fletcher-Munson curve and makes adjustments so that the tracks sound equally
good on a car radio, in the kitchen, and on a living room stereo.
A personal bias can arise if one is too involved in the mixing process and should
be avoided. When mastering your own production, you should allow some time
to pass between the two phases. If possible, avoid mastering in the same listening
room that you mixed in; after all, one of the roles of the mastering studio is to
find potential mistakes that result in faulty acoustics in the mixing studio! If the
mixing studio has certain room modes and you master a track in the same room,
then the mistake will remain undetected.
You will find more information on the subject of mastering in my book “Audio
Mastering with PC Workstations.”
T
O GET THE BEST RESULTS with software-based mixing, there are a
number of technical issues to keep in mind. This applies to both the se-
lection and use of equipment. First of all, I would like to look into the
advantages and disadvantages of analog summing. Recently, my readers have of-
ten asked me whether or not analog summing is useful or even necessary. This
is why I have decided to take up this issue here in the form of a thorough test.
The audiophile results are available on the accompanying DVD (in the folder
“Summing Unit Test”).
32 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 33
summing amp or an analog console. My feeling is that on the digital level, you device varies significantly. Please refer to the details in the device descriptions.
have to work harder to create the same result. Although I realize that this view-
Note that there is another area of application that goes beyond the scope of our
point is controversial, I have had it confirmed many times through my own ex-
test. This kind of equipment serves very well as a front-end for external keyboards.
perience and by many other sound engineers. On the other hand, you cannot
For users who would like to stop using their consoles, but cannot because of their
expect to automatically obtain a good mix just because you are using analog sum-
arsenal of keyboards and expanders, high quality analog summing devices are a
ming. For users with good skills in software-based mixing, analog summing can
good alternative with which to improve sound.
add that extra bit of spice to their mixes.
Another advantage of analog summing is the possibility of taking advantage of The Test
transistor or tube saturation and the resulting uneven harmonics, which add a bit
How can a kind of device be fairly compared, when there are such big differences
of “warmth” to the end result.
in terms of functions? How important is the use of analog summing for achieving
This effect does not occur during software-based mixdowns, as long as it is not truly professional results? What the difference in working methods? As an advo-
simulated with devices such as the “PSP VintageWarmer” or Steinberg’s “Magneto” cate of software-based working methods, I have decided to clarify the mystical
in group tracks. These units also offer strong saturation behavior when the in- all-rounder. Here I have tried to find the common denominator between the six
coming signal is increased. To do justice to the saturation aspect, a few examples
in the test are provided with high levels.
Another potential advantage is connecting external analog outboard equipment
in the sum or even in individual audio channels. With summing units without
master inserts, high-quality EQs, and compressors can be connected directly to
the outputs of the unit. Here, unnecessary critical reconversion (A/D) is avoided,
as opposed to inserting into the sum of the software mixer. This can make a dif-
ference when a large number of external gems are inserted into the internal mix.
Only the SPL MixDream provides a type of bridgehead into the analog world.
Each stereo channel has a switchable insert path. Accordingly, the MixDream
can be used with 8 stereo devices or even device chains in the 8 busses. If you
use this feature, you are saving 16 A/D conversions, which you would otherwise
have accepted with analog devices placed directly on the outputs and inputs of
the converters. This cannot be fairly compared, since the SPL device is the only
candidate with these features. For this reason, we will not take this aspect into
consideration. The MixDream is thoroughly well designed, with direct outputs
and a “no mix” button for every stereo channel; when activated, this reroutes
the signal of the inserted outboard device back into the DAW when you want to
insert the device just once in the mix. This saves you from having to crawl behind
your studio rack in order to re-cable.
Another potentially positive aspect are extra functions in the master section such as
compressor/limiter, stereo expander, or M/S matrix of some devices. In this area, each
The six candidates
34 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 35
test candidates, and have developed a test involving an unadulterated and fair
comparison where your judgment is also taken into consideration. The partly Block Diagram Internal Mixdown
block diagram internal mixdown
very subtle variations in sound, having the most varying attributes can quickly mono mono mono mono mono mono mono mono mono mono mono mono St St St St St mono mono
Aux 1
St St St St St
Snare (solo)
Overhead R
Overhead L
Lead Vox 1
Lead Vox 2
Guitar solo
Rim gated
Bassdrum
Rim open
Reverb 1
Reverb 2
Reverb 3
Reverb 4
Reverb 5
wear out our powers of imagination. With the help of the platform-indepen-
Aux 2
E-Piano
Strings
Tom 1
Tom 2
Tom 3
Guitar
Horns
Piano
Bass
Bell
Aux 3
Aux 4
dent OMF file, you should form your own opinion as to whether or not analog
Aux 5
Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins
Stereogroup Overhead
comparison between a software-based mix and analog summed mixes. But first
Stereogroup Guitar
Stereogroup Toms
Stereogroup Keyb
Monogroup Snare
Stereogroup Vox
read on.
Stereogroup Drums
filling the role of the common denominator: “16 in – 2 out.”
The equipment: Neve 8816, SPL MixDream, TSM from Tegeler Audiomanufaktur,
!! No Inserts !! No Send FX !! Fader 0dB !!
Dangerous 2-Bus, Audient Sumo, and the Danish SSA 2B from Tube-Tech. I
would like to thank the manufacturers and distributors for quickly and easily
Masterbus
providing these devices! Insert: only technical limiting for over protection
RAM connected to two 8-Channel RME ADI-8 QS converters. The excellent mono mono mono mono mono mono mono mono mono mono mono mono St St St St St mono mono St St St St St
Snare (solo)
Overhead R
Overhead L
Lead Vox 1
Lead Vox 2
Guitar solo
Rim gated
Bassdrum
Rim open
Reverb 1
Reverb 2
Reverb 3
Reverb 4
Reverb 5
Aux 2
E-Piano
Strings
Tom 1
Tom 2
Tom 3
Guitar
Horns
Piano
Bass
Bell
Aux 3
the conversion level. Good converters are a basic prerequisite for using analog Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins Ins
Aux 4
Aux 5
Ins Ins Ins Ins Ins
summing units.
Stereogroup Overhead
In Nuendo I like to use group channels for insert effects such as EQs, compres-
Stereogroup Guitar
Stereogroup Toms
Stereogroup Keys
Monogroup Snare
Stereogroup Vox
sors, and stereo expanders (which I will explain later in detail). In order to
internally and externally sum the same signal, an internal mix is created with
a drum group without insert or send effects. It is set at 0 dB. This way, the sub-
Inserts Inserts Inserts Inserts Inserts Inserts
groups “Snare,” “Toms,” and “Overheads” – as well as bass drum and bass – can
of the RME audio interface, which in turn go directly into the inputs of the sum- DAW Input 1&2 via RME ADI-8 QS
ming units. The cable between the RME converters and the summing unit inputs
The same levels are used in the summing of both mixdown variations in the block diagrams.
36 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 37
has gold-plated contacts and are made by Mogami. For the returns, the stereo
outputs of the summing units are routed into the stereo inputs of the RME con-
verter with VoVox cables. For reasons of sound neutrality, I have avoided using
the Universal Audio 2-610 for line inputs.
The same channel configuration was used for all units:
Input 1: bass drum
Input 2: snare
Input 3: bass
Input 4: free
Input 5&6: toms
Input 7&8: overhead
Input 9&10: vocals
Input 11&12: keyboards
Input 13&14: guitar
Input 15&16: effect returns
I then created two different mixdowns with each summing unit in order to
fairly show the different possibilities of the devices. When the devices were not
equipped with their own level control, I used the RME audio interface mixer
“Total Mix” in a few of the summings to increase the levels of all channels from 0
to 6 dB, in order to test the sound saturation behavior of the units when provided
with a stronger signal. These mixes are indicated with “+6 IP”.
All results have – at least theoretically – identical level and panorama behavior.
Only potential level tolerances of the summing units could have any affect on the
mix behavior.
Here you can see the device settings, the basis for the twelve different summing
results (six devices, each with two mixes):
Total Mix with +6 dB empha-
sis starting in channel 9 up to
channel 24 which are connect-
Neve 8816 Mix 1: Channel level: 3’o’clock, Master: 1 line under full Mix 2: Channel level: Right position; 1 line under full
ed to the analog summing unit
SPL Mix 1: Limiter: off; St-exp.: off; Transformer: on, M:0 dB Mix 2: Limiter: on (18); St-exp.: 3; Transformer: on;
Tegeler TSM1 Mix 1: Output: 0 dB Mix 2: +6 IP; Output: -6 dB
Dangerous 2-Bus Mix 1: + 6 dB-switch off; Output: +6 dB Mix 2: + 6 dB-switch on; Output: 0 dB
Audient Sumo Mix 1: Mix Master: 0 dB, Master: 0 dB; no sum. comp. Mix 2: MixMaster: 0 dB, Master: 0 dB; with sum. comp.
Tube-Tech SSA 2B Mix 1: Output: 0 dB Mix 2: +6 IP; Output: -6 dB
38 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 39
Finally, in order to create fair conditions for an A/B comparison, all the files were Level adjustment is fortunately very easy with all the devices: set everything to 0
brought to the same loudness level without any dynamic-altering intervention. dB and at the end the right level arrives in Nuendo. Extremely simple!
The order of the A/B comparisons in the test file (OMF) is purposely not the
Another requirement for this test is the possibility of switching at least the first 4
same as the order presented here, to prevent bias. Also, sometimes the software-
channels to mono, so that snare, bass drum, and bass will also arrive in the center.
based mixdown is on the upper channel and sometimes it is on the lower one
All manufacturers have understood this need and have answered it in various
of the OMF project. In order to neutralize your ears, please listen entirely to the
ways.
short “distraction sequence” provided between each A/B comparison. This serves
as a way of “resetting” your ears for the next comparison. Also, it is best to set the The Dangerous 2-Bus provides mono switches for all 8 stereo busses, SPL’s
track display in your DAW so that you cannot see the actual waveforms – this MixDream for the first three channel pairs (i.e. for inputs 1 to 6), Audient Sumo
way you can really concentrate 100% on what you are hearing. Now, if you follow and Tube-Tech have mono switches for the first 2 stereo pairs, the Neve 8816
the rules, your results will be very meaningful! Do not jump back and forth be- has panorama pots on all 16 inputs, and the Tegeler TSM provides for automatic
tween the examples. You can do this when choosing your favorite after you have mono switching whenever a mono input occupies only the odd-numbered input
filled out column 12 in the evaluation list. jack of a stereo pair (i.e. channel 1, 3, 5, etc.). This can be a bit inconvenient if
you want to permanently cable the device and be able to use different configura-
You will find your personal evaluation list as a PDF file on the DVD in the folder
tions.
“Summing Unit Test.”
All manufacturers should be praised for adhering to level and pin assignment
The Test Procedure standards. All sub-D connectors are assigned according to the Tascam standard,
After connecting the first summing unit and with high expectations, switch on so that error-free cabling was possible right away.
the input list button of the Nuendo input track, this should deliver the enhanced
master output of the summing unit. If you hear cracking or popping sounds: is it Device Particularities
jitter? Sync-error?
After quickly checking the sync settings, you will see the solution: la-
Tegeler Audiomanufaktur TSM1
Fact or fiction: tency crackling! The mix project, which can easily internally deal with The TSM has a chic vintage design and is a simple and basic version of an ana-
log summing amplifier with 16 stereo inputs (via 32 balanced mono jacks) and
Analog a(12latency of 256 samples (6 ms), must be increased to 512 samples
ms) for a crackle-free mixdown. Just to be sure, after restarting tube-refined summing output. The only particularity, made apparent by the test,
summing units Nuendo, I repeated the latency test. The internal mixdown requires was that mono-cabled channels appeared louder in the mix than in Nuendo. In
37% CPU load (Nuendo indication) at 6 ms for crackle free operation, Nuendo I use the -3 dB panning law, which lowers mono signals in the center
save CPU. compared with 42% CPU with the same latency in external mixdown position by 3 dB (see chapter on panning). This is the most neutral way of work-
mode. With 12 ms, the external mixdown requires only 30% CPU load and is ing and is also used by high-end analog consoles. It is assumed that in the TSM1,
completely crackle-free. a different panning law is used. On a practical level it is simply annoying, when
you are trying to get the same results internally and externally. If the TSM is nor-
Fact or fiction: Analog summing units save CPU mally integrated in a setup, then the respective channels would simply be mixed
As far as Nuendo – and therefore also Cubase – this often heard argument does 3 dB quieter. The 3 dB level steps of the damping output control works quite ac-
not hold up. It can be assumed that other hosts react in the same way. Routing curately. Two ECC88 tubes are used in the TSM1.
to individual outputs requires more computing power and therefore higher la-
tency.
40 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 41
be adjusted from minus infinity to +6 dB with the Mixmaster potentiometer. I
Tube-Tech SSA 2B
miss a reliable incremental pot here. While working, I noticed a left-right imbal-
The Danish Tube-Tech SSA is very similar to the Tegeler summing device. The ance of around 0.2 dB. A nice feature is the separate monitor output, complete
16 balanced XLR inputs are summed together. The 2 U rack height is a purely with mono switch, gain, and switchable source (EXT IN). Finally, an A/D con-
physical dimension; the powerhouse with 8 tubes (two ECC82s, two ECC 83s, verter with a sample rate of 192 kHz is available as an option. The converter was
and four ECC 88s) should have a least 1 U headroom clearance to allow for heat. not used in the testing.
My impression is that the tubes not only warm up the control room, but also the
mix. The gold-plated incremental output controller has a range of +/- 10 dB in SPL MixDream
steps of 0.5 dB (0 to 3 dB), 1 dB (3 to 6 dB), and 2 dB (6 to 10 dB). Inputs 1-4 can The particularity of the MixDream is the very clever “bridgehead” concept; in
be set to mono with a toggle switch. addition to the simple 16 in - 2 out principle, the MixDream also provides direct
out channel inserts and can be connected to up to 8 analog stereo devices. An
Dangerous 2-Bus
insert switch in each stereo channel turns them on or off or – if you choose to use
The American 2-Bus summing device is tubeless and is a bit more lavishly an external device individually – can be taken out of the summing. This position
equipped. For every stereo pair there is a +6 dB gain switch as well as a mono is labeled “No Mix.” As opposed to direct connection to external devices through
switch. Next to the mix output is a parallel monitor output for connecting the D/A-A/D converters when using a summing unit, this prevents sound-degrad-
summing device directly to the monitor matrix. In addition, an input for cascad- ing reconversion since conversion only occurs once with the summing signal.
ing with another 2-Bus is included. The external power supply gives a nice, solid
impression with a robust 7-pole plug to connect with the device. Output level is The “Mix” and “Monitor” outputs are XLR connectors, all other connectors are
adjusted incrementally in 0.5 dB steps with a potentiometer with a range of – 4 Sub-D.
dB to + 6 dB. The device’s levels are also very precise.
Channels 1 to 6 can be switched to mono with toggles. The MixDream is expand-
Audient Sumo able and with three units, you can even use it in a surround setup. The “Expansion”
switch routes the additional units into the summing signal before the inserts.
The Sumo summing amplifier by Audient is conceptually a little different from
The master insert is switchable and can be adjusted from -20 dB to +7 dB. The
the others. It is just 1 U high, which is made possible with a mini fan, and is for-
summed signal can be treated with a stereo expander, but for me, this would be
tunately so quiet that it poses no problem when mounted in a rack. Sumo also al-
part of a mastering process. (Very detailed results can be obtained in M/S pro-
lows for switching the first 4 channels to mono. “Mix Gain” allows you to reduce
cessing when controlling stereo expansion with multi-band compression.) The
gain before the compressor and limiter. Unfortunately, the pot is not incremental.
last element in the chain is a switchable limiter for keeping peaks under control.
The Sumo is the first test candidate with an insert in the master section, outfitted
The Lundahl transformers at the output – which are also switchable – add typical
with an on/off switch. It can also be switched pre or post dynamic section. These
warmth and silkiness to the sound.
features were not included in the test, since they could not be compared with
the others. The single-band compressor boasts all typical compressor parameters: Neve 8816
threshold, ratio, attack, release, and make-up gain. Next in the signal chain is a
Neve’s summing unit has the most structure among the devices that were tested
peak limiter, which is controlled by a threshold potentiometer. If I now put myself
– it is actually a real mixer with a mastering/monitoring section, which can also
into the shoes of a mastering engineer, I realize that personally, I use only techni-
be used as an analog summing unit. A complete description of all possible uses
cal limiting in the sum channels and deal with dynamic processing in a separate
is beyond our scope here, so we will just list the features briefly: a 16 in / 2 out
stage of the work. Ideally, a specialist should carry this out. If you do not share my
mixer with panning and level potentiometers as well as cue and cut/solo switches
opinion, then you can always use the compressor for soft “pre-compression.” In
for every channel. The cue function provides an independent headphone mix for
one of the examples, that is exactly what I have done. The output level “Mix” can
overdub sessions. “Cut” allows you to turn off individual tracks. You can create a
42 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 43
solo group by switching the “Cut/Solo” switch in the Master Section to solo. The Test Results
device remembers the “Cut” and “Solo” settings of the individual channels, allow-
Please only look at this list after you have done your own listening tests, so that
ing you to create complex A/B comparisons between complete different channel
you are not biased. Please print out the “Evaluation List.pdf ” before starting with
combinations. When used uniquely as a summing device, the potentiometers
the comparision test.
should not be moved. With the correct input and output levels, the channel po-
tentiometers can be left at the position all the way to the right, without clipping Here is the sequence arrangement in the OMF file:
in the mix. This is certainly the fastest total recall method. Caution: In some cases
with high-level input signals this can lead to a slight analog compression in the Sequence: 1 2 3 4 5 6 7 8 9 10 11 12
stereo sum of the summing unit. The panorama law is fixed to a 3 dB attenua- Ch. A int 8816-2 SPL-2 int Tube-T.-1 int int int int 8816-1 int int
tion at the center position by the manufacturer. Because of this the mix relation- Ch. B 2-Bus 1 int int Sumo-2 int 2-Bus 2 SPL-1 Tegeler-2 Tube-T.2 int Sumo-1 Tegeler1
ships are not changed when a mono instrument is placed in the center. Without
this attenuation, the mono signal, which is equally distributed between L and R,
would result in a 3 dB gain increase when placed in the center compared to side Summary
placement. A particularity of the 8816 is its automation capability, inspired by
Without going into the details concerning the differences of each summing unit,
big mixing consoles. This is controlled with a USB-connected computer. In the
there is a small acoustic advantage with analog summing. Characterizations such
classic total recall fashion, the potentiometers can be set to the values displayed
as “unity” or “the mix feels tight” come to mind. The question is whether this
on the computer monitor. These virtual pots then disappear as soon as the values
improvement comes from analog summing itself or from the sound-coloring na-
between the physical pots and the real pots match. Most users probably do not
ture of the analog circuitry. In my experience, proper mastering allows for much
have the patience to deal with this when they are only dealing with summing. The
greater sound optimization in terms of these sound characterizations.
panning and gain automation can be very helpful if the 8816 is used as a front-
end for a collection of synthesizers. The ability to solve phasing problems by ste- It is up to you to determine if the investment in 16 converters, good cable, and a
reo reduction of widely spread layer sounds, or the ability to place any keyboard summing unit really makes sense for you. The previously mentioned aspects con-
more in the center is a very useful feature for this use. cerning working faster and avoiding unnecessary A/D conversions using analog
outboard devices is also to be taken into consideration.
To summarize the functions of the master and monitor section: headphone jack
with potentiometer and switchable summing signal; speaker toggle (A/B); talk- In conclusion, using analog summing devices can enrich the results of a mix both
back level; talkback mic; master insert; insert mix return; insert can be switched as a result of the individual character of the device, and through avoiding cumu-
to M/S thanks to the internal M/S encoding and decoding circuitry; 2 track re- lative truncation. A lack of room ambience during recording cannot be com-
turn as well as iPod jack on the front panel; high-quality A/D converter with DSD pensated with analog summing. Consistent and continuous use of 32-bit float-
outputs and sampling rates of up to 192 kHz available as an option. With the ing point files along with good reverb devices make professional software-based
additional fader pack the volume potentiometers can be used for aux-send level mixes possible.
control. The 8816 is also expandable.
MORE BITS FOR MORE SOUND!
To examine this topic – which is essential for both production and mastering
– we will take a brief look into the basics of digital techniques.
44 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 45
PCM – The Principles of Digital Audio
PCM stands for Pulse Code Modulation and is the most common way that au-
dio is digitalized. The .wav and .aif formats are based on PCM. PCM technique
involves cutting an analog signal into slices, called samples. The audio CD format
uses 44,100 samples for every second of audio material, while audio for DVD
video employs a sample rate of 48 kHz. Higher sampling rates are used for for-
mats like DVD-Audio. The highest reproducible audio frequency is half of the
sampling rate. This means that with a sampling rate of 44.1 kHz, audio informa-
tion of up to about 22 kHz can be recorded. Every sample consists of a 32-bit
subframe, while usually at least 16 bits are used for representing amplitude.
Bit Resolution
The audio CD standard calls for a so-called 16-bit word length. Each of these
audio bits can be either a 1 or a 0. Therefore there are 216 possibilities. 216 equals
65536 steps (2x2x2x2x2x2x2x2x2x2x2x2x2x2x2x2), half of which are discrete
positive amplitude values and the other half negative values. A series of these
values can be represented as ascending and descending steps.
More bits for The reason audio frequencies can only be reproduced at up to “Stairs” in a digitalized file in WaveLab
more sound half the sampling rate is that in order to be represented, a fre-
quency (= oscillations per second) needs both a positive and
For the purposes of determining which bit resolution to use, here are a few basic
characteristics of 32-bit floating point processing:
– what you a negative value. At 44,100 samples per second that results in
an upper frequency limit of 22,050 Hz. u As demonstrated with the calculations above, 32-bit resolution provides a great
should know deal more processing precision and dynamic range than 24-bit resolution.
To make the bit pattern clearly visible, visually enlarge a .wav
about bit depth file in your DAW until the individual steps are recognizable. u As long as audio data remains in the 32-bit floating point format, it is almost
and truncation With 24-bit resolution there are 16,777,216 steps (224) – in-
impossible to produce overs. Even signals that exceed 0 dB can be precisely
processed and stored in the exponents of the 32-bit floating point data. With
creased bit depth creates a logarithmic improvement in dynamics and precision.
24-bit integer data (fixed point), levels above 0 dB are represented as 0 dB
Comparing 16-bit and 24-bit resolution, there is 256 times as much information
chains, leading to artifacts.
for a relatively small increase in data storage!
u If you work entirely with 32-bit floating point resolution, you do not have to
By multiplying the bit depth (also called “word length”) by 6, you get the maxi-
worry about potential overs. You only have to avoid signals above 0 dB at the
mum dynamic range that can be represented. Using this formula, 16 bits gives a
very end of the processing chain, when bit-reduction is necessary to convert
range of 96 dB and 24 bits increases the dynamic range to 144 dB.
the data to the target format (24, 20, or 16 bits).
With 32-bit floating point processing, 24 bits plus 8 so-called exponents are used.
u Most of the well-known PC and Mac-based DAWs (Logic, Cubase, Nuendo,
A detailed description is beyond the scope of this book.
WaveLab, DP, ProTools, etc.) as well as nearly all good VST and DirectX plug-
ins work in the 32-bit floating point resolution.
46 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 47
u All current Intel and AMD CPUs also calculate with 32-bit floating point. quality analog equipment before the material was digitized at the very end of the
Because of the uneven number of bytes with 24-bit files, it actually takes less editing chain and processed onto the CD master. If you carry out this last step
processing power to operate on the 32-bit basis. By not using 32-bit floating very carefully – using input dithering and paying attention to every detail – you
point, you are often actually using more computing power. With mixing proj- can get excellent results with 16 bits. However, back then, 20 different digital cal-
ects having more than 80 tracks, it is simply the increased data flow caused culations were not made along the way. Nowadays, it is frequently only the mi-
by hard disk read and write operations – along with PCI bus data capacity crophone, the pre-amplifier, and the monitoring path that are analog. Everything
– which limits performance. else happens exclusively in the digital domain and requires dozens of calculations.
Almost every digital calculation entails additional decimal places that should be
u As long as the amplification remains ‘normal’, that is, the sum of the signals
represented as precisely as possible. For example, a simple 1-decibel level change
does not exceed 0 dB, the 32-bit float format behaves like a 24-bit integer cal-
can require an extra 7 decimal places, making the 32-bit format absolutely neces-
culation.
sary for accurate processing. The problem with short word lengths is therefore
That adds up to many advantages. But in order to understand the fundamental the inaccuracy inherent in numerous successive processing. Just think of every
acoustic advantage we need to understand truncation and the consequences of little change in a mixing project which requires calculation: fades, cross fades,
cumulative truncation. level changes, sends, EQing, insert effects, and many more.
Who cares about the seventh place after the decimal point?
What is Truncation?
Our hearing has the ability to unconsciously perceive the subtlest sounds that
Truncation is the removal of lower value bits through the reduction of word
carry information relating to the three-dimensionality of a recording. When
length. When we have a 32-bit calculation result, which can only be saved in the
3D information components occur at minus 95 dB, we cannot consciously hear
24-bit format, then the mantissa is simply deleted. Opinions differ on whether or
them. However, we clearly miss them when they are no longer present. If you
not the sound actually suffers during this process. In many situations, it would
imagine that the lower value bits, which keep being discarded due to the usage of
be difficult to clearly distinguish between a dithered and a truncated file in an
low bit depth, always carry these subtle intricacies it is no wonder that the record-
A/B comparison if there is a decrease from 32 to 24 bits and this decrease only
ing sounds flat.
takes place in a single generation. But problems begin to arise if you repeatedly
decrease the word length in a computer-based mixdown. Try to create a deep,
transparent mix from 40 tracks with a program that does not consistently operate
What Happens When a 16-bit File is Saved as a 32-bit
on the 32-bit level. It is simply not possible because the many bit-depth decreases File?
remove increasingly more subtleties necessary for such a quality mix. When you The 16 empty bits will simply be tacked on to the bit word. At first, these empty
have mixed together the first 8 tracks, it still sounds good, but the quality will bits have no acoustic impact. I only do this when I receive 16-bit material for a
deteriorate with each further track. This is the consequence of cumulative trun- mix, so that I do not need to give any more thought to bit depth if any further
cation. editing and offline processing is necessary. Once I save my work after the first
processing (for example: normalizing, changing levels on individual clips, fades,
Why is High Bit Depth So Important for Sound? offline processing, etc.) a “real” 32-bit file has been created that uses the full bit
I have already provided some explanation on the importance of high bit depth depth when necessary.
for sound quality in the previous section on truncation. When CDs first came
out, music production was still somewhat of an elite sovereign territory – at least
nobody could produce a CD at home. Production generally took place on high-
48 49
Why Record in 32-bit Format When the A/D
Converter Only Supports 24-bit Resolution? O pen a 32-bit audio file in WaveLab and insert the plug-
in you want to test into the WaveLab Master Rack. The
plug-in has to be working, of course. In other words, the
This is the same situation as with 16-bit files. Eight “empty” bits are simply tacked
on. The advantage for the sound quality first becomes clear when the first pro- settings should not be in a neutral position. If all of the 24
cessing is undertaken and when you want to save the file after offline processing. bits light up (green), as well as the blue “Internal” light, then
Even if you are using the 24-bit format, the internal calculations are still being the plug-in is operating on the 32-bit floating point level.
If this is not so, you should check to see if there is a similar
done in 32 bits, but are saved and therefore truncated to 24 bits. Normalization is
or better 32-bit plug-in which will do the same job. Using a
certainly one of the most commonly used offline processes. When you normalize
16-bit file, the Bit meter lets you easily see that only 16 bits
a 24-bit file, the calculations are done in 32 bits and the least significant bits are
are being used, as long as the file is not being processed in
therefore thrown out since the 24-bit format you are using cannot use this higher
any way. Even if you just barely change the level, all 32 bits
resolution. And – as previously pointed out – today’s computers actually work
will be used.
more efficiently with 32 bits, therefore giving you the further advantage of saving
The WaveLab Bit meter lets you check the bit depth
CPU resources. being used. If you play a 16-bit file without any
processing, the meter will show a depth of 16 bits.
The technical foundation for obtaining satisfying results requires correctly As soon as you change the level – even by a very
dealing with bit depth and understanding concepts such as dithering and trun- small amount – the meter will show a 32-bit depth.
cation.
When Can We Leave the 32-bit Domain?
Summary of a Few Basic Rules Relating to Bit
u If you want to connect an external device via AES/EBU, you should dither the
Depth: internal 32-bit floating point signal to 24 bits, before routing it to the outputs.
u Always use the best possible resolution: 32-bit floating point wave files or even
better 64-bit if already available in your DAW! u If you want to burn a Red Book test CD as part of a music production, you can
export it for this purpose by dithering it to 16-bit resolution. If you are satisfied
u If you need to periodically save between processing steps (offline processing with the result, export the master for further processing in 32-bit format.
such as level changes, fades, normalization, DC offset removal, and any other
processing) in order to compensate for less computer power, you should con- u If you want to use a plug-in which you know only works in 16 or 24 bits, find
vert the original source material to 32 bits wherever possible. the corresponding resolution through dithering. I would prefer not using such
plug-ins.
u Always dither when converting to lower bit depths.
u Generally, dither as seldom as possible. Tip for Mastering
In the last stage of mastering, aim for the target bit depth by dithering. No more
u Only use 32-bit-floating point plug-ins. This can be checked with the WaveLab
changes may be made to the master thereafter as these will immediately occur in
bit meter.
PC-based 32-bit resolution and the optimal usage of 16 bits through dithering
u Don't truncate (shorten word length) without dithering. will be ruined. In addition, you must re-dither or you will truncate to 16 bits as
the internal calculation takes place at 32 bits.
u Deliver high-resolution material to the mastering studio.
50 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 51
Dithering
In this book, the emphasis is on practical approaches. For more information on
dithering and other basic concepts, please refer to the book “Audio Mastering
with PC Workstations.”
Nowadays dithering refers to adding very quiet noise to the signal in order to
modulate the least significant bits combined with bit mapping. This increases
the representation precision. Both techniques are used in the process of shorten-
ing the word length of files. The main purpose of dithering is to ensure a good
representation of the sound when an audio file is converted to a lower bit depth.
Simultaneous bitmapping ensures the optimal use of the available bits. This is why
in mastering, once the last dithering has been carried out, the audio file should
not be manipulated in any way.
Contrary to sample rate reduction, where all the advantages of higher sampling
rates are lost when the rate is reduced, good dithering can maintain most of the
sound quality when bit depth is reduced.
STUDIO ACOUSTICS
Acoustics can be divided into building acoustics and room acoustics.
With the Signal Generator in WaveLab you can generate sinus tones corresponding to modes in your
Building acoustics protects us from “unmusical” neighbors, traffic sounds, and listening room
other nuisances from our surroundings. Room acoustics deals with what hap-
pens to sound inside your studio; it is about the acoustic characteristics of your problem frequencies (room modes). A three-dimensional graphic representation
studio rooms. When I talk about studio acoustics here, I mean this latter aspect, shows any selected mode in your room.
having to do with the interior acoustics of listening or recording studio rooms. Open up WaveLab (demo version provided on DVD). With the Signal Generator
There is always a lot of discussion concerning studio acoustics. Although it is not (Tools), create a sinus wave at each of the frequencies that you noted.
the aim of this book to go deeply into this subject, I would like to touch upon the Loop these tones and play them back moderately loudly (above 70 dB/SPL) and
matter here so that you are aware of the importance of studio acoustics and how move around your usual listening position within a radius of about one and a half
it effects your work. Many engineers underestimate the significance of acoustics meters (about 5 ft).
with regards to their ability to hear and judge their own work. to help calculate
If your listening room has a resonance frequency at 82 Hz
Here’s a little experiment you can do; take a measuring stick or a tape measure
between frequency and
(room mode) and the bass drum in the track you are working
and measure your listening room (W x L x H). Assuming that most rooms are on has this same characteristic core frequency, the following wavelength, try the
quadrilateral, go to the following website: will occur: in your listening position, the bass drum might following website:
http://www.hunecke.de/en/calculators/room-eigenmodes.html sound much too loud, but if you move one meter (3´3´´)
and calculate the room modes of your listening room. Please make a note of the away, the same sound will be much too quiet; or the other
http://www.sengpielaudio.
com/calculator-
wavelength.htm
52 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 53
way around. How can you possibly judge your work under such circumstances? u Also use reference headphones – just like the acoustics of your studio, you
should be as familiar as possible to how they sound.
If the acoustics are not optimal, then you should know the defects as well as pos-
sible. For example, you can use a bass-sweep to record the hills and valleys in your u A good graphic analyzer is not only good for finding problems in the bass
listening position. A simple SPL meter is fully adequate for measuring. On the ac- frequencies – it is also simply a valuable tool for obtaining good results. In ad-
companying DVD you will find a level-calibrated bass sound that goes through dition, when used properly it will train your ears.
the bass range in semitones. The sound consists of the combination of a short
tone and a long tone in order to make it possible to get a feel for each frequency’s You will find further information concerning metering in the section “Using
decay time. Print out the pdf “Frequency Chart” and use it as the basis for your Metering for Monitoring the Dimensions” on page 64.
notes. Set a standard SPL meter to “C” weighting and make sure it is in “slow”
mode. Hold it vertically in your listening position and play the bass sounds with STUDIO EQUIPMENT
a uniform loudness of at least 78 dB/SPL. Make a note of any deviations. Even though the cost of investing in a modern music production facility has gone
The results could look something like this: down thanks to computer-based systems, there are a few things to consider in
order to make good mixes. Even with a great deal of knowledge and experience,
37 Hz +5 dB D it is very rare to make a good mix with just a laptop and a pair of headphones.
45 Hz -2 dB F Choosing good studio equipment will definitely bring you much closer to your
55 Hz +6dB A goal.
and so on.
Choosing Speakers
If an A in a bass line appears too loud, you can glance at the list and see right away
The same rules for studio acoustics apply to choosing speakers: the “truer” the
that your room (in the listening position) makes that particular tone twice as
signal that reaches our ears – and in this case, over the entire signal path (D/A
loud as others. This way, you can learn to work with any acoustic weak spots you
converters, cable, mixer or monitoring matrix, cable, amps, cable, speakers) – the
might have in your listening room. If you have any other listening positions in
better the conditions for judging and acting on the sound.
your room, for example for guest listeners like a sofa, then I recommend carrying
out a separate set of measurements for this listening position as well. This can be compared to a sieve; the coarser the sieve, the grainier the result and
the more difficult it is to shape it.
Typical SPL meter that The widely used foam insulation – produced in a near-infinite number of variet-
can be purchased for ies – is effective against slap echoes and overly long decay times in the middle In your listening room, speakers act like a box in a box. The small (loudspeaker)
around $50
and high frequency ranges. However, if overused, the result can be an irregular or box interacts with the bigger (room) box. That makes the choice difficult. The
dead sound in these frequencies. The above-mentioned bass and mid-range fre- typical demo room in a studio equipment department
quencies are little affected by this material. But that is exactly where many prob- rarely has good acoustics. Testing speakers in your own The finer the resolution
lems arise while recording and mixing. room also can adulterate the results, depending on of the chain of
speaker position and especially on the room’s charac-
How to Live With Imperfect Acoustics: teristics.
reproduction, the finer
u If you cannot invest much in the acoustic construction of your studio, the next
the possibilities for
best thing is to be perfectly familiar with the acoustics of your studio. Listen judging and acting on
to many reference recordings to have a clear, confident feeling for what your
the sound.
speakers and studio acoustics sound like.
54 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 55
The important characteristics for choosing speakers are: u Whenever possible, use large, heavy speaker stands which do not resonate
with the speakers.
Spatialization (Dimension 1 = Creating a stereo field)
u Avoid “loudness” circuits (as is common in hi-fi amps) in the monitor signal
Resolution (Dimension 2 = Frequencies)
path
Spatial depth (Dimension 3 = Depth)
u EQs are the last resort for correcting when all else fails. Only the best compo-
These factors depend on such things as speaker box design, crossover network nents should be used. It is better to make precise corrections with paramet-
circuit design, and much more. Another important criterion is good ‘’listenabil- ric EQs than with graphic equalizers, which have a greater tendency to create
ity’’ making it possible to listen without rapidly becoming fatigued. At the same phase errors and muddy sound.
time, a studio speaker should never have the pleasant-sounding and well-round-
One of the most frequent speaker setup mistakes I have seen is the use of “practi-
ed characteristics of a hi-fi speaker.
cal” workstation tables with side flaps for speaker placement. In most cases, the
If you do not work in a listening room which has been professionally measured, I speakers are much too close to the ears and too far apart.
recommend using products which have already proven themselves in profession-
The same rules apply to both your room and your speakers: in the end, the most
al environments. If good speakers sound bad in your room, the reason is prob-
important factor is your familiarity with your listening environment. I once met
ably the quality of your room and not that of the speakers. In this case, change the
a producer – now successful internationally – who achieved excellent results in
environment, not the speakers.
the basement studio he had at the time. He mixed with plastic active speakers
that cost less than $100 (used nowadays as multimedia speakers for comput-
The Basics of Setting up Speakers: ers). Combined with headphones, he could perfectly compensate for the lacking
u Set up the speakers within a 60° angle (30° left of the middle and 30° right of sound quality of the speakers. Of course, this is not an ideal situation, but it was
the middle). certainly very expedient.
u Set up the speakers symmetrically in the room.
Front-end
u Keep enough distance between the speakers and the walls (at least 50 cm By front-end, we mean all the devices that are placed along the signal path before
(20´´), refer to the set-up instructions of your speakers). the actual recording process. These principal elements are microphones, mic pre-
u Ensure that the listening distance is sufficient (with near-field monitors, at least amps, A/D converters and cables. But it also includes the dynamic processing of
1 meter (3´3´´)). compressors and EQs. Here we will only address the principal devices.
u The tweeters of both speakers should be the same height as your ears when Unfortunately, a bad choice of front-end devices often is a major source of prob-
sitting in your monitor chair (ca. 120 cm (4´)). lems. The quality of every single component – including cables – is so impor-
tant that I recommend investing in ONE top-quality signal chain rather than
u Avoid unnecessary objects (such as CRT monitors) in the sound field between several mediocre. For example, if you can invest $2,000 in a mic preamp, then
the speakers and the listening position. choose one that only lets you adjust quiet/loud – but does this perfectly – rather
than a machine with numerous extras like a compressor, an EQ, DeEsser, which
u Use spikes under near-field monitors to insulate against structure-borne sound
are good, but not excellent. All of the fineness, quality, depth, and resolution of
– this will tighten up the bass sounds.
a good recording chain will make every further processing step in your DAW
easier. A finely-recorded lead vocal recording is obviously much easier to place
56 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 57
in the mix than one badly recorded with a $150 dollar microphone, a third-class If you plan to purchase a new interface, you should first create a needs-profile,
preamp, and the computer’s built-in 16-bit converter. outlining the number and format of inputs and outputs necessary; then see what
is on the market that fulfills these criteria. Your budget will further narrow the
You (or your clients, if you have a mix gig) determine the level of quality of the
choices so that you will only have a few devices to choose from in the end. In any
entire production with the front-end chain! A top-quality computer-mixed pro-
case, look for the appropriate products in specialized music shops, not computer
duction can only be achieved with the best-recorded signal quality possible.
stores.
Cable is often neglected in discussions about the front-end. Recently, I had a rev-
Minimum requirements for PC or Mac audio interfaces:
elation when recording vocals with completely new cable bought from VOVOX.
The front-end – already excellent quality – got a clear boost with improved depth u ASIO driver for Windows systems, Core Audio driver for Mac. ASIO stands for
resolution and finer, cleaner high frequency transmission. Please do not cut “Audio Signal Input/Output” and is an open standard developed by Steinberg
corners when cabling your front-end chain – good cable is a worthwhile invest- that allows for low latency operation.
ment.
u Reliable operation down to 3 ms latency. (Low latencies are only important in
Since we are focusing on a mixing procedure that should also be able to make the recording and production phase. To free up CPU resources, you can set
a lower quality recording sound good, we should also look into the back-end, latency as high as 46 ms while mixing. With higher values, you will feel a slight
which is also important for the mixing process. delay when working with your DAW.)
u Audio interface drivers are crucial for reliable operation. You can usually see
Back-end on the support page of the manufacturer’s Internet site if updates are posted
Just as the name implies, the back-end describes the signal chain from the me- regularly. Forums often can quickly give you a good idea of both quality and of
dium (computer hard disk) through the D/A converters, the monitor matrix or customer satisfaction.
mixing console, the power amp, the speakers, and of course the cable paths.
u 24-bit S/P-DIF connections. Cheap interfaces often support only 16 bits. Be
sure to check this!
D/A Converters
u Wordclock input and output, in order to be able to synchronize other digital
The D/A converters are the first step along the back-end signal chain and play a devices. Audio interfaces without wordclock can only be used alone, without
significant role in terms of representing depth and space. Built-in converters are other devices.
unacceptable for any ambitious mixing project. When choosing a high-end au-
dio interface, do not shy away from comparisons, even though it may not be easy
or practical to install and remove different devices in your computer. The 24-bit,
96 kHz audio interfaces are the absolute standard nowadays. Because there are
only a few different converter chips on the market, the differences in quality lie
mainly in the analog circuitry of the converters and the quality of the clock.
58 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 59
Monitor Controller, Monitor Matrix,
What is latency? and Mixing Console
L atency, or input latency, refers to the
amount of time the computer takes to
record and reproduce signals with a given
Cubase and Nuendo offer the function Constrain
Delay Compensation, this allows you to simply
switch off plug-ins, which in turn increase the
Most often, mixing consoles serve as switching and adjustment units for speak-
ers, conveniently bringing together features like talkback, machine controller,
audio interface. Low latencies are necessary for latency. This is helpful when you want to lay and multi-channel recording in one relatively inexpensive package. A modern
sending the recording input signal to monitors down another track in a song where the mix computer-based studio can also function without a mixing console and is often
or headphones, without giving the musician a is nearly finished. Some DAWs like Cubase and even better without one.
feeling of delay. Sound needs 3 ms to travel one Nuendo add the so-called output latency with
When you imagine how little technical cost and effort goes into a single pre-amp
meter (3´3´´) through a room. A bass player who the input latency. Output latency is only present
or monitor controller circuit in a 24-in-8 inline mixing desk costing little over
is 3 meters (10´) from the drummer – or the side when CPU-intensive plug-ins are in the master
$1000, then doing without this equipment is comprehensible. It is obvious that a
fill monitors providing the drum sound – has to bus, group channels, or VSTi channels. The
monitor controller with just a small number of choice components dedicated to
deal with a latency of 9 ms. Depending on the output latency is brought into play in order to
the few tasks that are done outside of the computer is better for the job.
musician’s level of training, latencies starting at apply automatic latency compensation over
6 ms will be perceived as unpleasant. the entire mixing console. The Constrain Delay Another significant point against using a large mixing console is the extremely
If your DAW’s CPU meter is over 50% because Compensation button only applies to the CPU- large, acoustically unfavorable surface, which can add a comb filter effect to the
you are using many tracks or have plug-ins that intensive plug-ins in the above-mentioned speaker signal.
pull a lot of CPU, then low latency levels can channels, since the processing time (for automatic
result in crackling in the monitor path. In this latency compensation) cannot be calculated in In addition, using two mixing consoles makes for a confusing workflow: what
case, either remove the plug-ins or increase the advance, as is the case with audio channels. should I manage with the external console and what should I manage with the
latency in order to ensure flawless operation. computer-internal DAW console? These thought processes become burdensome
while recording an entire band live. The existence of two
Cable
mixers is especially irritating for novice studio users. Important Information!
Since the internal mixing console is a complete represen-
• Do not control monitor levels digitally within
The cable you use is as important in determining sound quality as the speakers. tation of its analog counterpart, concentrating on ONE
the computer! Reducing level on the digital
It is well worth it to purchase high quality cable. Bad cable can downgrade the console greatly simplifies concentration. For multi-track
domain reduces bit depth – the number of bits
best devices to mediocrity. To limit costs, concentrate on the monitor signal path. recordings with bands, you just need to make sure there
used – and inevitably leads to a flatter sound.
With active monitors, this consists of only two pairs of cables from the convert- are enough inputs and outputs, and to set up mic preamps
as well as headphone amps outside of the computer. • Always send a full-level signal out of your
ers to the monitor controller and from the monitor controller to the speakers. In DAW into the converters and reduce volume to
addition, I recommend having high quality instrument and microphone cables From a purist point of view, it is better to use a really the desired level only with high quality analog
as well as good preamp-to-converter cables on hand. I have included some useful good-sounding, high quality monitor matrix or monitor equipment.
Internet links to a few manufacturers in the index. controller that has integrated talkback. The SPL Monitor
• In monitor systems like the TC/Dynaudio Air
series or the JBL LSR4328 & 4326P, remote level
control is analog – after the D/A conversion – in
order to take advantage of full bit depth on the
digital level.
60 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 61
Talkback Controller (MTC) is one of the pioneer products in this field. Nowadays
Remotes and Controllers
there is a large choice of products on the market.
Remote controllers are a relatively new kind of device that arrived along with
If you need several different headphone signal paths, then simply set this up in- the development of audio computers. They vary in price between about $50 and
ternally in the computer and send the signals over separate outputs, going directly $100,000. The basic idea is a replacement of mixing console panels, in order to get
into the headphone preamps. You can replace the preamps from an obsolete mix- away from the mouse-oriented operation of DAWs. It might also have a lot to do
ing console with high quality individual units. This way, you will surely achieve with outward appeal, though. Beyond the comfort of using well-built encoders,
better results than by using inexpensive mixers. these devices have only two real advantages for me:
62 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 63
this device is a forerunner on the market.
My personal experience from observations of many project studios has led me
to conclude that the combination of a minimal controller and systematic use of
working methods using keyboard shortcuts is more effective than an inefficiently
used controller and the constant, time-consuming switch between mouse and
controller. In the high-end range, there are a few interesting solutions, such as
the Icon console from Digidesign, the Euphonix 5MC-EuCon or the Smart AV
console. For a professional-sounding mix, a good mouse is sufficient, but obvi-
ously a large mixing console or controller is more fun to use. In a professional
studio situation, where an engineer works 10 hours a day and 25 days per month,
the advantages in terms of ergonomics and protection of the eyes and the hand
(which holds the mouse) are clear. Detaching ourselves from the visually-laden
environment of the computer monitor helps to obtain more balance between the
senses, to the benefit of our sense of hearing. This can be a factor in the quality of
the production. In addition, the purchase of large controllers can be determined
by rather odd economic factors such as being able to justify studio fees. The show
must go on!
64 65
Routing from the audio computer to the metering computer
16-bit or – even better – a 24-bit S/P-DIF input. This would be enough for using
the Pinguin or the PAS-Metering smoothly and without latency problems.
Since I use an RME Hammerfall audio interface in my computer which includes
a DSP mixer (“Total Mix”), I can route the signal for the monitor controller in
the summing buss digitally at 0 dB to the S/P-DIF output. This output is then
connected to the metering computer with a 75-Ohm S/P-DIF cable. In my studio
I also use an RME Multiface for the metering computer. That way, 6 parallel sur-
round channels can be routed in via ADAT optical cable for Pinguin surround
metering. The RME Multiface can also be used very well with laptop computers
with the Cardbus interface using the PCMCIA slot, making it possible to use the
Pinguin as a mobile metering solution for jobs outside the studio or even for live
concerts.
66 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 67
Here are some practical uses of a peak meter:
Notes on surround metering:
u Checking the peak headroom of the sum throughout the entire production
Y ou can see on the chart [preceding page] that I
have lowered the levels of surround channels 3
process, providing information for limiter settings in the master bus.
to 8 by 4.8 dB each and have routed them to the S/P- u Measurement of the operation, reliability, and precision of new plug-ins. (Are
DIF output. The reason for this is that for surround the level indications on the knobs correct? Is the limiter really keeping overs
productions, I like to hear the mix of the 6 surround from coming through?)
channels in the listening position while seeing the
results on a spectrum analyzer. That is why L & Ls is u Stereo level adjustment.
routed to the left and R & Rs to the right, while C & LFE
u Indication of level relationships between different elements; for example, be-
are in the center of the panorama. The level reduction is
tween bass drum and snare. (When the bass drum’s amplitude is greater than
necessary because the sum of the 6 individual channels
the snare’s, then it is probably because of an error in judging the levels or per-
in the stereo display should only exceed 0 dBFS in
haps a problem with the bass frequency reproduction of the speaker.)
exceptional cases.
Most devices that measure the amplitude peaks also provide loudness measure-
It is important that the signal is digitally routed into the metering and that the ment. Generally, the display shows a second bar which is colored differently or
level is exactly the same as the master level which goes into the D/A converters otherwise differentiated and which runs parallel to the peak value display. For ex-
before the studio monitors. If your S/P-DIF output is already in use, you can use ample, the WaveLab peak display is green on the outside and blue on the inside.
a simple Y adapter, which is available for both coaxial and for optical S/P-DIF. Loudness should indicate the “thickness” and therefore the perceived loudness
of the audio material. Loudness is always lower than the peaks. The important
Peak and Loudness Measurement factor in loudness measurement is the readability, which corresponds strongly to
What is the difference between peak and RMS levels, or peak and loudness? The the ballistic of the display as well as the ability to objectively be able to read and
maximum level of a digital recording, measured in dB = decibels (peak) is repre- interpret the information displayed.
sented by the greatest amplitude (wave size). For example, when the waveforms of Loudness measurement can be used:
two concert guitars perfectly match at a particular point, then amplitudes resulting
from the addition of both individual amplitudes can be very strong without us per- u To check the average loudness of tracks. (Is the mix too compressed?)
ceiving this sound event as being twice as loud as the other parts of the song. The
u To check the loudness consistency of different takes of a track.
peak value does not tell us much about perceived loudness.
u To check energy distribution between L & R in stereo.
The difference is therefore that loudness is measured in dB/RMS (root mean
square), which represents the density of a recording and therefore tells us some- u To prevent mixing and mastering too loud.
thing about its loudness. Looking at the waveform of a “loud” song is reminiscent
of a thick sausage. The thicker and more “sausage-like” the recording looks (with
the same waveform magnification), the louder it is.
68 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 69
Goniometer
There is a lot less to say about goniometers. In Pinguin it is called stereo meter,
which is self-explanatory. Goniometers provide an intuitive way of visualizing the
amount of stereo in the sound. If you send a mono signal into the left channel, it
will show a line at a 45-degree angle on the left side; a line in the middle indicates
a mono signal in the middle and a mono signal sent into the right channel pro-
duces a line at a 45-degree angle on the right side. If you send a dry mono signal
into the center (for example, the side click of the snare) along with a long stereo
reverb, you will see a line in the middle followed by a rounded shape. It is best
to examine this within your DAW. The goniometer is used to quickly and intui-
tively get an idea of the stereo width in different mixes, which should be unified
and consistent. Goniometers also provide information about the differing stereo
widths of keyboard sounds and all stereo signals in general. In addition, phase
problems are clearly represented. If the signal doesn‘t look like an elongated egg
along the vertical axis, but instead is horizontally oriented, then there is a phasing
problem. The Pinguin meter also provides a numerical readout that indicates the
left-right weighting in degrees.
Spectrum Analyzers
Spectrum analyzers are certainly the most important tools for visualizing sound.
A good spectrum graph reliably shows the frequency distribution of a record-
ing. The most common frequency resolutions are one band per octave and three The Pinguin Goniometer shows Stereo Width.
bands per octave. The frequency spectrum is split up – like a graphic EQ – and
shown in separate bands. In my opinion, reliable and intuitive work with such ends of the entire frequency range. From a scientific analysis point of view, the
resolutions is not possible. The spectrum analyzer in the Pinguin PG-AM-Pro “logarithmic pink noise” setting is questionable, since it does not provide any
displays in half-step resolution (12 bands per octave) and offers a number of precise measurement information. But what we need in this case is an intuitive
different settings and display possibilities. The PAS software can even be set to and readable display that gives us a sure way to recognize if specific frequency
quarter-tone steps (24 bands per octave). A particularly intuitive mode is the regions are over or under emphasized – this is why this display mode is very use-
“logarithmic pink noise flat” setting, which displays pink noise on a logarithmic ful. A good graphic analyzer uncovers problems in the infra-bass frequencies that
frequency resolution as a flat horizontal line. In this mode, the frequency dis- would escape even the best subwoofer.
tribution of recordings are particularly easy to interpret, since a good-sounding
recording has a nearly flat frequency spectrum with roll-offs on the low and high
70 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 71
The quick and intuitive graphic representation of information is particularly help-
ful, making it possible to easily recognize parts of the signal that are part of the
music’s rhythm or not. Working with analyzers taught me to put lowcut filters on
all tracks to rid them of unnecessary bass artifacts and thereby create more space
for the bass. With a glance, an analyzer will show you if a mix was properly done. Sine wave, fully correlated (0 degrees)
The peak hold function allows you to gather very precise frequency information
by using the mouse to highlight specific areas, making accurate use of EQs possi-
ble. If you listen to an instrument group spot solo, it is quickly evident if the lower
mid-range is overemphasized with respect to the lead vocals, making it possible
for you to effectively remedy the situation. On the other hand, you can also see
where frequency ranges are not being used, so that you can use these for events
that had been previously buried. All of this teaches you about frequency/pitch
relationships and the sweet spots of each instrument. When used live, the loudest
frequency is automatically displayed, which is very useful if there is feedback and
you need to know which frequencies need to be attenuated. Good metering is
highly valuable for the motivated sound engineer.
Sine wave, right channel shifted by 90 degrees;
Correlation Meter Meter in the middle
72 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 73
Correlation meters used to be a must in every studio and were usually either
mounted on the console meter bridge or simply built in. Because mono play-
ers are less common than they once were, mono compatibility is mostly only
an issue in television production. Even when the target medium is a vinyl LP,
you should still pay attention to correlation! Since LPs are mono in the bass
frequencies, recordings with phase cancellation cannot be used for records
because needles cannot follow such movement. While working in the studio,
glancing at the correlation meter should always be accompanied by switching
the monitors to mono. Only when your listen to your stereo mix “collapsed” to
mono can you really hear if potential phase displacement (in mono) truly re-
sults in acoustically relevant phase cancellation. If the mix falls apart when you
switch to mono, then maybe you have panned too much low frequency sound
or you have too much phase cancellation. Using the mono switch is referred
to as physical phase cancellation, since both channels are mixed together on a
physical level. Acoustic phase cancellation – in other words, the cancellation
that you hear in a room – is always weaker because the signal in the room is
naturally de-correlated.
Pinguin Spectrometer
For more information on the potential causes of phase cancellation, please see the
section “Dimension 1: Horizontal = Panorama’’ on page 107 Pinguin Spectrometer
For people with an affinity towards all things nautical, this device might re-
mind them of a “fish finder.” Once you get used to it, it is a good alternative to a
spectrum analyzer, with the last 20 seconds of audio being spectrally analyzed.
If the playback is stopped, the spectrum analyzer also stops, so that you have
time to check over problem areas with the computer mouse. In the line at the
lower edge of the window, information on frequency and level in 0.2 dB steps
is provided. The horizontal axis is the time axis from 0 to 20 seconds and the
vertical axis represent frequencies from 14.57 Hz to 22.354 kHz. Levels are color
scaled. Yellowish-red lines along the horizontal axis could for example indicate
continuous overemphasis in a particular frequency band, inciting the engineer
to find a solution to the problem. Vertical lines indicate short sound events. The
spectrometer is an effective tool for helping you see particularities in the audio
material along the time axis, making it possible for you to immediately see level
inconsistencies and ensure a good frequency distribution.
74 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 75
Extra DSP Power for Software-based Mixing
If your goal is to mix complex productions with many tracks on your computer
without having to constantly resort to rending your files as offline processes, then
you will definitely need extra DSP (digital signal processing) power. I see today’s
music computer as a tape machine, MIDI sequencer, and fully automated mix-
ing console. The sound production stage is not a part of the mix process since
the MIDI files have already all been converted to audio. For all intensive signal
processing that goes from a rudimentary lowcut filter on a single track to the
use of various send effects, using extra DSP power is unavoidable. A native (i.e.
computer-internal) plug-in is by nature a type of compromise between its per-
formance and the CPU power needed for this performance. It is relatively easy UAD-1 PCI Card from Universal Audio
to program an excellent native compressor, which is so performance-hungry that
you can only use a few on your computer. On the other hand, it is difficult to cre-
ate a good-sounding compressor that doesn’t bring your computer to a crunch- UAD-1 Card
ing stop when you use 25 instances of it. To compare with standard studios, I
consider extra DSP as outboard equipment. Regardless of which host application The UAD-1 card is a rather small card for PCI or PCIe slots. It relies on a very
you use – all midrange consoles costing around $100,000 have one thing in com- powerful DSP processor that was originally designed for video rendering. The
mon: they all have EQs and dynamic effects for basic work. But even here, the card operates completely with 32-bit floating point resolution and therefore per-
most important tracks are processed with the best outboard equipment. mits seamless integration into most host environments. Only in-house Universal
Audio plug-ins work on this card, which is available in several different varia-
Hard Choices tions. The only fundamental difference between the different UAD-1 card prod-
ucts is the software that is delivered along with the hardware. The UAD-1 varies
The following section provides an overview of four established, cross-platform in price between around $500 and $2000 and can be expandable to up to four
DSP systems: the UAD-1 by Universal Audio; PowerCore by TC; the Waves UAD-1 cards in one computer. Unlocking and updating additional plug-ins is
APA; and the SSL-Duende. To be complete, I should mention Liquid Mix from easy. The plug-ins that you own on one card will automatically be available on all
Focusrite. But I have simply not had the chance to thoroughly test this device and further cards ($500 version).
develop my opinion about it. If you are interested, you can surely find information
and test reports on Liquid Mix in the Internet and in specialized magazines. The latency which results from sending the signal to the card and then bringing
it back to the computer‘s CPU are equivalent to the system latency x 2 and are au-
tomatically compensated in Cubase, Nuendo, and most other hosts. When auto-
matic latency compensation is not available, then a plug-in tool (delay compensa-
tion) can help out. The more power required of the card, the more it also requires
in terms of CPU resource management power. The main focus of the product
is the emulation of legendary classic equipment such as those from Universal
Audio, including, for example, the 1176-LN compressor and the LA-2A Tube
Limiter. These are still little treasures that can be found in well-equipped ana-
log studios. Recently, the plug-in choice was broadened with a series of modeled
76 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 77
Neve legends (1073-EQ and 1081-EQ, as well as the Neve 33609 Compressor).
I will go into further details in the chapter regarding plug-ins (page 203).
The UAD-Xpander system is a UAD-1 card built into a small housing for laptops
with the new ExpressCard 34 connection (successor to the PCMCIA cardbus
slot). With a PCIe adapter, you can also connect the UAD-Xpander to a desktop
computer.
PowerCore FireWire by TC
Included in this category are the mastering tools, such as the MD3 compressor,
taken from the TC System 6000 (with a price in the five-figure range), or the
reverb algorithms of the System 6000 VSS3-Reverb. PowerCore is aligned more
with the high-end professional market with technically modern tools.
UAD Xpander with ExpressCard 34 connection
The PowerCore series behaves similarly to the UAD-1 DSP unit when it comes
to latency compensation. Plug-ins can be unlocked with the serial number of the
PowerCore PowerCore being used and is valid for one device. Automatic unlocking occurs
only when two or more PowerCores are connected to a single computer, in which
Shortly after the release of the UAD-1, TC brought out the PowerCore PCI card, all the licensed plug-ins are unlocked for all of the PowerCore DSP units.
followed by the PCI-Express version, PowerCore Compact, PowerCore FireWire,
The FireWire PowerCore is happiest when it is the only device connected to your
and PowerCore X8. The PowerCore system is also expandable and is based on
FireWire port. Combining the SSL-Duende or a FireWire audio interface should
150 MHz Motorola chips that operate in the 24-bit integer format. The Compact
be tested individually. One way of avoiding conflict would be to add additional
version has two Motorola DSPs, the PCI-Cards and the FireWire versions four
PCI FireWire cards for additional FireWire devices, as long as slots are still free.
DSPs and the X8 – as its name implies – is based on 8 DSPs. Depending on the
plug-in, the resolution goes up to 48-bit integer. The 24-bit integer format is the
only negative point; the PowerCore does not operate on the basis of 32-bit float- Incompatibility Between UAD-1 and PowerCore?
ing point format like the UAD-1 card does. Both cards generally get along absolutely perfectly together.
See page 79: “Incompatibility between UAD-1 and PowerCore?” Nevertheless, from a practical point of view, a few things should be taken into
Many very high-quality plug-ins from third-party manufacturers such as the consideration, which stem from the architecture of the cards. The vintage plug-
Oxford series from Sonnox are available. The base price begins at about $500- ins from the UAD-1, which operate on a 32-bit floating point basis, sometimes
$800 for the PCI and Compact versions. The flagship X8 costs around $1500. do not provide a way of controlling output levels (for example, the Pultec EQ)
All versions are delivered with a basic assortment of plug-ins. You should plan or output ceilings in order to reliably control levels and peaks such as what you
to budget more for the more interesting plug-ins, which are optional. You will be would usually find in modern, digital devices. In itself, this is no problem, since
rewarded with top-professional tools, which will turn your computer into a seri- the 32-bit floating point resolution can easily deal with overs well above 0 dBFS.
ous audio workstation. But if you put some emphasis on the UAD-1 card’s Pultec EQ and then route
78 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 79
an “overdriven” 32-bit signal into a TC PowerCore plug-in, then you may have Waves
problems. PowerCore’s inputs can only handle 24-bit signals, even when they
are internally calculated at 48-bit integer, as is the case with the MD3 software. With the APA DSP series, the well-known plug-in creator Waves has created an
This is why you should always make sure that PowerCore’s inputs are not being innovative DSP system. APA stands for audio processing accelerator. There are
overdriven (check the input over display). I recommend that you work with some two models available: The APA32 is a 19” 1RU device which – strangely enough –
headroom or use a reliable limiter with activated 24-bit dithering in front of the has 30% less power than the ½ rack sized APA44-M. The latter costs about twice
PowerCore along the signal path. You can also use a CPU-saving leveler in front as much as the APA32 and retails for around $1,500; two plug-ins are included.
of the PowerCore plug-in and reduce the level enough that the PowerCore plug-in The ingenious networking concept allows you to not only cascade multiple units
no longer shows any input overs. (Try out the FreeG plug-in by Sonalksis included to one computer, but to also share one or more APAs between multiple computer
on the DVD in the software folder.) The input level adjustment of the PowerCore workstations, giving good reason to create fights between workstation users for
plug-ins should not be used for level reduction in this case, since the level reduc- DSP power! When using more than one APA, an Ethernet switch is necessary.
tion occurs on the 24-bit level and cannot cleanly process the 32-bit overs. Calculation occurs in 32-bit floating resolution, meaning that this format can be
continuously used within most DAW environments. Only the additional Waves
I do not wish to stir up dithering hysteria at this point. If at one point, truncation
plug-ins are processed by the APAs. At present, all Waves plug-ins are not yet
is carried out (because of a lack of dithering), the mix won’t immediately fall flat.
compatible with the APA (Waveshell capable). An APA44-M is capable of han-
Truncation only becomes a problem when it is carried out repeatedly. There is
dling 26 Renaissance channels or 6 IR1 Reverb units. Installation is easy, as long
always a little room for interpretation in the debate concerning truncation vs.
as a 1000 Mbps Ethernet port is available. Latency can be set in the control field
too-frequent dithering. In the end, your ears should decide!
and is at least twice as high as the system latency you are using. Higher latency
The PowerCore should be considered as an external device. If you route an exter- provides more reserve power and better dropout protection, but make for more
nal digital effect device into your setup, you are also dealing with just the 24-bits sluggish handling with values of over 2048 samples (46 ms).
provided by the AES/EBU standard.
SSL Duende
The legendary British high-end mixing console manufacturer Solid State Logic
recently discovered the computer market and has brought out the Duende 32-
80 81
bit floating point DSP unit. The Duende is not a DSP
Firewire and unit that emulates the SSL character; it is the real SSL
System Requirements sound.
for this function (bouncing, export audio, or rendering), which actually can be
thought of as a mixdown with automatic re-import of the processed file. This can This way of working is not as ideal as when using all plug-ins in real time, but
be laborious and time-consuming. Offline processing is easier, since you can do does provide a reasonable and very good alternative to external DSP units or
the processing right from the main window of your DAW where the file to be limited CPU power. For example, you can assign your favorite EQs to a keyboard
processed is located. Nuendo and Cubase provide a special function in that one shortcut and then process a complete 16 track vocal group offline. Other DAWs
or more audio files in the project can be simultaneously processed. This can be a provide similarly structured functions for offline calculation.
track, an individual event, or a complete block of tracks. These offline processes
are saved in each calculated event and can be undone – even out of order – with
the “undo” function. MIXING CONSOLE ARCHITECTURE
Most readers are familiar with typical mixing console functions. Since modern
DAWs now offer very complex routing possibilities, I would like to go into the
basics of mixing consoles so that “mixing console novices” will be able to under-
stand the terminology we are using.
The world of studio mixing consoles can be divided into two categories: in-line
and split-consoles. Both mixer concepts are designed to record up to 48 tracks si-
82 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 83
to “line input” on both kinds of mixing consoles so that they can be used as
full-fledged audio channels for the mix. Most consoles provide some kind of
possibility for switching between recording and mix modes. The mix mode
routes the tape returns to the input channels and frees the tape return channels,
so that they can be used for additional inputs, keyboards, or effect returns. A
24 in 24 console can manage 24 channels during recording and 48 for mixing,
since a separate control room mix – independent from the input channels and
recording levels – is not necessary during the mixing process.
In-line Consoles
As far as I know, there is no counterpart to the in-line mixer in the virtual world,
The Nuendo/Cubase – Process List handles event-related offline processing history.
so I will simply point out the more obvious differences. Tape return channels are
optically and physically integrated into the input channels on an in-line console.
The most apparent characteristic of an in-line console is the fact that there are
multaneously with good sound and appropriate levels, and to channel them into two channel faders and two pan pots per input channel. Generally the main fader
a recording medium as well as providing independent control room and head- is slightly larger. During recording, it allows you to adjust the level of the recorded
phone mixes. In most project studios, people don’t usually record more than one signal and in mix mode, it controls the tape return signal. In recording mode,
track at a time. This is why many self-taught audio engineers have a difficult time the in-line channel handles the control room mix; in mix mode, it can be used as
understanding the entire concept on which complex mixing consoles are based. an extra channel for FX return, or additional inputs, for example. Most consoles
In addition, the simultaneous co-existence of several mixing consoles can further are designed so that the input or main channels are better equipped than in-
add to the complexity (the computer-internal software-based mixer, an analog line channels. For example, a main channel might have a four-band parametric
or digital hardware mixing console, and often an audio interface mixer, such as equalizer and eight aux-sends. Most in-line consoles are very well designed and
the RME Total Mix). If there are no clear work procedures, then even seasoned offer many “flip” possibilities, in that functions can be assigned in various ways.
professionals can become filled with doubt and confusion. This is why I believe If you are working with such a console, you should study the user manual until
that two good microphone preamps and a good volume control are worth more you have understood every function and you have the block diagram in your
than a mediocre mixer, and advocate getting rid of unnecessary hardware mixing head. That will improve your ability to spontaneously react to all situations that
consoles in favor of a clearer, more logical, and more effective setup. Nowadays, an engineer might encounter in the course of a working day.
a good DAW can handle complex live studio recordings with 4 or more separate
headphone mixes and all other necessary functions.
Each console is split into a channel area for the input channels and an area for
tape return channels. This is necessary for adjusting the sound and levels to be
recorded with the input section, as well as providing an independent control
room mix based on the tape return. The headphone mix is always captured
pre-fader from the tape return signal path, so that the musicians can hear what
the band has already recorded when they need to overdub. Tapping the head-
phone signal pre-fader makes it possible for the engineer to change the control
mix without affecting the headphone mix. The input channels can be switched
84 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 85
Before we can start working, the application must know what audio hardware we
Split Consoles
will be using and how we want to use the available inputs and outputs. With F4
Split consoles are somewhat easier to understand, since all channels are nicely you can open a dialog box concerning VST connections, input and output con-
laid out next to each other and there are fewer flip functions that tax our under- figurations, as well as connections to outboard equipment. The number of tracks
standing. In recording mode, a classic split console uses the input channels to ad- you can record to is the same as the number of inputs on your audio interface. If
just the signal being recorded and the tape return channels – generally located on you are using two stereo monitors and want to have two stereo headphone paths,
the right side of the console – are used for control room and headphone mixes. then the outputs of an eight-channel audio interface might be configured like
Like in-line consoles, the configuration can be switched to mix mode, which this:
routes the tape returns to the main input channels and frees up the split section
– previously used as tape return – for other purposes. Stereo monitor 1 Device-Port 1 & 2
Stereo monitor 2 Device-Port 3 & 4
Since the surface representations of the virtual mixers in Nuendo and Cubase Headphone Amp 1 Device-Port 5 & 6
are almost identical to hardware consoles, it makes sense to use Nuendo as an Headphone Amp 2 Device-Port 7 & 8
example to take a closer look at split console design.
The inputs might be configured like this:
The only thing that is missing on the virtual level is the switching between re-
cording mode and mix mode. The reason for this is that switching is no longer Device-Port 1 SPL Channel-One 1
necessary – unlike their hardware counterparts, software mixers can have practi- Device-Port 2 SPL Channel-One 2
cally an unlimited number of channels available. The input channel section can Device-Port 3 SPL Gainstation 1
just be clicked out of sight with the mouse. That would be somewhat wasteful for Device-Port 4 SPL Gainstation 2
a hardware mixer. Device-Ports 5 & 6 Line In guitar preamp
Device-Ports 7 & 8 Line In turntable
The Cubase and Nuendo Split Console The Input Channel Section now shows the number of channels you have con-
figured. These input channels are statically configured. This means that each in-
The mixer is split into the following kinds of channels:
put channel corresponds to a single physical input. Beyond general system con-
u Input channels figuration, there are no additional assignments necessary. The purpose of input
u Audio channels channels is simply to control the level and sound of the input signals that are
being recorded. This is why these channels can be compared to the recording
u ReWire channels channels of a split console in recording mode. We cannot hear anything because
u MIDI channels the fader determines the recording level and not the control room mix level. One
advantage of this concept is the option of inserting effects on the input side, or
u VST instrument channels
dropping in an EQ during recording, which thus processes the recording signal
u Effect channels in real time. These channels are therefore located before the hard disk recording
u Group channels medium. In the input channels, aux-sends are intentionally left out because they
would make no sense for effects or headphone sound. For example, if reverb were
u Output or master bus
placed on the vocals being recorded via aux-send here, then the singer would
u Audition bus indeed hear the reverb while recording, but could not hear it when the recorded
material is played back for a punch-in take. Or if the headphone mix were routed
86 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 87
limited in number, there is no longer the necessity of switching between record-
ing mode and mixing mode, as would be the case using a hardware console with
a limited number of channels. Audio channels allow us to create control room
and headphone mixes.
To create a headphone mix, choose the headphone output in the appropriate
channels in the send dialog box, then switch the send path on, and switch to
Headphone signal
path in pre-fader pre-fader. On the left side of the mixing console, the channel view switch lets you
mode show the sends of all channels, making it possible to quickly and easily create an
independent headphone mix.
The audio channel has a particular function that refers to a tape machine func-
tion; monitor mode. Here you can choose whether to hear the input signal or the
already recorded signal. When recording, it is very important to understand how
this switch works.
In the Cubase /Nuendo Preferences dialog (File Menu) you will find several set-
tings for Auto Monitoring in the VST submenu. I recommend using the “Tape
Machine Style.” On tape machines, this mode is also called “Auto Input,” which
better describes its function. Since your DAW is simultaneously serving as a mix-
er and a virtual tape recorder, this function has been introduced into the mixer
GUI. If Tape Machine Mode is selected, then you don’t really have to worry about
the monitor switch any more. It automatically puts a record-ready track into in-
put mode. While the “tape” is in record-ready mode – the track is “hot” – you
On the left side of the console is the hear the singer’s voice as the input signal so that you can talk about the next take.
channel view switch. In the above As soon as the “tape” is running and you are not recording, you (and the singer in
send-view of the console you can
see the selected headphone signal the headphone mix) hear the previously recorded material. As soon as you switch
paths, which are switched to „Pre- the tape machine that is already rolling (in other words, your sequencer is run-
fader.” The input assignments can
be seen at the top of the channels. ning) to record, then the channel switches the singer’s headphone mix to “input”
so that she can hear the recording at the input. The Auto Input or Tape Machine
out at this point, then the previously recorded take would not be audible, since Mode function is therefore a prerequisite to smooth transitions between moni-
here we only have the input signal. toring routing when punching in and out of a previously recorded take. The loop
function automatically switches between the two monitor modes, so that you can
So that the signals coming through recording channels can be heard in the con- concentrate on the performance.
trol room, the necessary number of audio channels must be created and assigned.
In the upper part of the audio channels is a field where a pull-down menu can be
opened with the mouse. Here you can choose the input channels that you have
already configured. To make another parallel to split consoles, the audio channels
are full-fledged tape return channels. Since these channels are equipped and un-
88 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 89
use the VU meter as an input monitor display for the recording level. In the Pre
position, the unaltered level is displayed.
At the right you can see the signal send points for PFL (Pre-Fader Listening) and
AFL (After Fader Listening) switch. This function is only available in Nuendo and
is for uninterrupted monitoring of tracks over a special “listen bus.” This makes it
possible to check a particular signal, while the master bus reproduces the entire
mix (useful for television broadcasts, for example). Differentiating between PFL
and AFL allows you to monitor signals and levels before and after insert effects
and channel level changes.
The middle of the diagram schematically shows the typical order of the channel
functions and various send points. After the phase switch – which changes the
phase of the input signal by 180° (phase reversal) if necessary – comes input trim.
Then come the first six inserts: EQs, compressors, and other insert effects. These
are followed by the channel EQ, which is good for rudimentary sound processing
needs such as a lowcut filter, then the aux-send points (when in Pre-fader mode).
Since the aux signal comes before the channel fader, the level is independent of
the channel fader position and therefore is a good send point for headphones.
The signal is even present when the channel is muted. This way, you can switch
off a track in the control room mix without affecting the headphone mix (this
function can be changed globally in the Cubase/Nuendo Preferences). Inserts 7
and 8 come after level and mute and – because of their position in the signal path
– are good for limiting and dithering. After the pan pot, the signal is sent to the
selected group or output channel.
ReWire Channels are full-featured audio channels whose signal comes from
ReWire applications such as Reason.
MIDI Channels do not have an equivalent in the hardware world. Since all of the
MIDI data is already converted to audio in the mix, it makes sense to hide these
Here is a typical structure of an audio channel. The Tape Machine channels.
Mode automatically switches back and forth between the chan-
nel input and HD drive during a recording.
VST Instrument Channels also behave like full-featured audio channels. They
receive signals from VST instruments. The VST channels do not need a monitor
Description of the Audio Channel Diagram function.
The level display on the VU meter can be taken from three different points in the
Effect channels are full-featured audio channels that serve as effect returns for
signal path. At the input, before the channel trim pot (Pre), before the pan (Post
effect signals. Up to eight effects can be cascaded together. The send path of the
1), and after the pan (Post 2). I recommend Post 2 as the basic setting because you
effect channels are only available to be sent to output busses for adding effects
see what is actually in the mix. The Pre position makes sense when you want to
90 CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX CHAPTER 2: THE TECHNICAL REQUIREMENTS FOR A GOOD SOUNDING COMPUTER-BASED MIX 91
into headphone mixes.
Group channels are like funnels in which a number of audio, ReWire, and VST
Instrument channels can be brought together. Effect channels cannot be routed
into group channels because of latency issues, but group channels can be routed
into other group channels, if necessary. To set up routing, use the routing win-
dow, which can be found in the second pull-down menu on the upper edge of
the mixer window.
The Output or Master bus is the final “funnel” in which all signals come together
to form the end result; the stereo mixdown.
The Audition Bus provides the monitoring possibility of solo-listening to a track
either PFL or AFL, without interrupting the signal path to the output bus. This
function is only important for live recording. Pan position is taken into account
with AFL.
In Nuendo 4, there are more routing possibilities so that group and effect return
tracks can be routed – for example – back to audio channels for re-recording.
This makes sense in film postproduction, where so-called stems (group chan-
nels) are brought together in the final mix to keep up to 500 different channels
organized.
I
NDEPENDENT OF GENRE, sound aesthetics and design structure, each
mixing process follows the same set rules. With this toolbox in your lug-
gage, you will reach your target safely.
After you have converted all audio files into 32-bit floating point format (or into As soon as the actual mixing process starts, I place the main harmony instru-
24-bit integer files if this is not possible), you can then begin work. ments after the drums and bass, and finally give the lead vocals their spot, so that
I can put everything else (backing vocals, percussion, strings, and whatever else)
in the gaps.
LISTEN TO AND CLEAN UP TRACKS
One thing has not changed. If you received files for a mixing job in the form of Basically, I create a bed in which to place the vocals. After successfully putting the
a collection of continuous audio files, the mixing process begins – as it always vocals in, I deal with the “decoration.”
has – with systematic listening to each track in order to make sure there are no
editing mistakes. Also, this is when sound quality and loudness consistency is
checked. In the past, the process of erasing mistakes and unwanted passages in
each track was a source of adrenaline, since the luxury of automation muting did
not exist. Each track would be cleaned up manually with the record button. Even
96 CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW 97
WORKING WITH GROUPS
A very important step is the creation of group tracks (traditionally called “bus”).
Group tracks are a kind of audio funnel, in which individual audio tracks – or
even other groups, if necessary – are combined for being able to control them
together.
To do this, you have to assign a group to each folder (see screenshot) and then
route the audio tracks to the appropriate group. Naturally this only makes sense
if one or more tracks are assigned to each folder. For the sake of clarity, place the
group track underneath its associated audio tracks within the folder track.
Here is an example of track/group organization in Nuendo/Cubase. The number of groups was pre-
viously determined. On the upper edge of the mixer window, you can see the pop-up window that
is used to choose the input and output routing. In the illustration, output BUS 1 plus 6 different
groups can be selected.
The track structure is clear in the Nuendo arrange window: In the folder track “Per- Using this group constellation within the folder track has a number of advantages
cussion,” the audio tracks are routed in a group track (14). Folders that are not being
worked on are closed. In the percussion group track (14), two marker tracks are visible. during the practical mixing process:
The upper track is typical in Nuendo/Cubase, the lower track consists of empty MIDI
regions. The regions are useful for quickly selecting parts of the song for looping, while u You can just open the folder track or tracks that you are currently working on
the markers have count-in bar and can be controlled directly through the computer
keyboard or remote. You can select the MIDI regions and loop them with the keyboard in order to keep the screen layout as clear as possible, particularly in larger
command “P”. Shift-G activates the commands “loop, cycle, play.” projects.
u When dealing with mute and solo switches, you need to be efficient. Does this
sound familiar: you start off with one solo track, you work on and soon have
10 tracks in solo mode, and then you want to listen to two of these 10 tracks in
98 CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW 99
solo mode? You can make life a lot simpler for yourself if you consistently use 20 individual string tracks – would be quite impossible without using sub-mixes
the solo switch only within a particular group and make sure that you’ve previ- in groups.
ously muted those folders and channels that you do not want to listen to. For
The loudness control of the group tracks is also an advantage in controlling of
example, if the drum group is the first one I work on, all other folders and all
the master bus level: a simple combination of the group fader allows you a quick
the tracks within the drum group apart from the first track that I am working
correction of the master bus level in case of overdrive or underdrive. The level of
on are switched to mute. Then I work my way through track by track, by de-
the master bus fader should remain at 0 dB; the level is controlled with the limiter
muting the individual channels; with this process I can use the solo function
only. To do this, use the level and gain reduction display of a good limiter that
for what it is designed for; a solo function. Using the solo switch in this way al-
only displays a slight activity of the limiter at significant peaks with a working
lows you to easily do A/B comparisons between different audio tracks: Track A
range of 1 to 2 dB, counting down from 0 dB full scale. This ensures that only
is un-muted and track B is selected, but muted. Now you can use the keyboard
the peaks of the loudest passages are slightly limited, thus protecting the output
shortcut “S” to switch to track B, because selected track B will be switched to
from overdrive, while also making sure at the same time that the gentle use of the
solo, while at the same time track A is muted.
limiter does not cause any damage to the transients.
u If you consistently use the audio channel send paths rather than the group
It also allows you to reduce all group levels evenly in order to create headroom
send paths to control send effects such as reverb and delay, you can use the solo
for a group requiring an even greater level within the overall mix despite having
function of the folder track to listen to “wet” solos and the solo function of the
reached the upper stop of its fader setting.
group track to listen to “dry” solos. This makes it very quick and easy to com-
pare the wet and the dry sound of a group. The logic behind this is as follows;
when you switch to the group solo function, all associated audio channels are USING COMPRESSION IN GROUPS:
also switched to solo mode, but only the send-return paths of the send paths SAVE CPU AND INCREASE PUNCH
actually used within the group. Because the send paths of the group track are In order to save valuable CPU or DSP resources in view of the limited availability
not in use, you can now listen to the dry sound. However, if you switch the of good outboard equipment, group tracks are popularly used to insert EQs and
folder track to solo, all tracks within that folder (audio tracks and their associ- compressors. If 40 chorus tracks with a good EQ or compressor are inserted into
ated groups) together with all the used send-effects are switched to solo – as each, they are likely to be too much for some computers. As a result, some of
a result, you hear the group with added effects-returns. This function is very the best little treasures are found in the inserts to important tracks such as lead
helpful in assessing the use of reverb creating stage depth. By contrast, adding vocals, bass drums, snares, and to the group channels. The use of latent dynamic
the effect amount by switching the FX return channels to solo only works sat-
isfactorily as long as there are no other audio tracks to control that particular
effect. Otherwise, you may well hear the drums with reverb, but also the reverb
part of the chorus and of other tracks using this send effect.
Working with groups has many other advantages. As in a jigsaw puzzle, the in-
dividual pieces fit together. First, the drums have a nice fat and spatial sound, The UAD-1 Compressors 1176 LN
then the puzzle piece with the harmonic instruments is nicely “rounded” off by and LA-2A are particularly appreci-
ated for group tracks and important
distributing the instruments appropriately within the panorama and frequency individual tracks.
bands. Then you use the group fader to rejoin the pieces. The subsequent level
control is much easier because of the coherence in the overall setting of the group
blocks. Defining a good balance in loudness, panorama, and sound – for example
100 CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW 101
processing welds the individual events into a unified sound. Particularly popular 350 Hz. This range is predestined for overlaps and a fuzzy sound, as almost all
are plug-ins that are full of character such the UAD-1176 compressor or the LA- instruments are represented in this area. It is also possible to use group EQs to
2A. increase the trebles of a vocal group in order to increase speech recognition and
to convey the impression that the group in question occupies more of a front
Specific settings for dynamic compressors will be discussed in the chapter about
position within the mix.
“dimension 2.” (See page 130.)
It is better to compress frequently and gently rather than rarely and hard. Working HANDLING GROUPS:
with compressors in groups has a lot of similarities to working with large analog WHAT TO CONTROL VIA GROUPS,
consoles. Even in the best studios, the number of good compressors is limited. AND WHAT VIA THE TRACKS?
If there are a lot of individual drum tracks, a good compressor is applied to the
most important tracks - snare and bass drum - and a group compression is then To avoid losing track of things with panning, you need to make clear strategic
used to weld the other individual tracks together with the snare and bass drum to decisions. My decision is; I never control panning via groups, but only through
a single entity. The same applies to lead vocals, doubled tracks and adlibs. individual channels. Otherwise you run the risk of getting “into a tangle.” The
same applies to automation. Just imagine the following: the individual channel
and panorama settings are all right, and you decide in the final stage of the mix to
EQING GROUPS TO SAVE RESOURCES automate the group panning for 4 beats.
AND CREATE SPACE
It is much easier to create space by using EQs on the spectral level – i.e. the verti- The next day you notice that a particular event in this range is placed too far to
cal dimension or Dimension 2 – rather than EQs on lots of individual channels. the left within the panorama. You counteract this in the audio channel and in that
For example, if the lead vocals require more space in the frequency range for instant the group automation sets it back to the opposite direction. My recom-
warmth – i.e. the lower mid-range between 350 and 400 Hz, this can be easily and mendation is: “Hands off group panorama”!
effectively accomplished by using group EQs.
WIDENING STEREO BASIS
A good graphic analyzer in group-solo mode allows you to quickly unmask the WITHIN GROUPS
keyboard pads, which create a fuzzy sound in the vocal range through overlaps,
and counteract this with EQ. A graphic analyzer is a valuable tool to supplement It is much better to apply basis widening effects to groups than to the whole mix.
your ears and to “educate” our most precious sensory organ. Correction occurs If the number of instruments in a particular frequency range is too great and if it
particularly frequently in the lower “mid-range misery” range between 120 and is difficult to differentiate them clearly, stereo expan-
sion offers an opportunity to create a new dimension What is Stereo Basis
or at the very least widen the horizontal dimension Widening?
(Careful, check for mono compatibility!).
The StereoExpander allows you, for example, to swap S tereo basis widening uses frequency-
dependent phase-shifting to create more
wideness on the panorama level than would be
high chorus sections and move them to the extreme
left or right. For guitar walls-of-sound like Rammstein normally possible with speakers placed 60° apart.
you can widen the wall by transferring some of the Every DAW has plug-ins that does this, usually
layered guitars to a separate group in order to insert a called something like “stereo spread” or “stereo
stereo spread. expander.” Overuse of this effect results in phase
cancellation when the mix is switched to mono.
102 CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW 103
u Balance the group levels of all groups edited so far.
WORKFLOW OVERVIEW
Within the framework of a creative and spontaneous mixing process, the fol- u Distribute decorations and additions in a spectrally sensible manner around
lowing overview acts as a guideline and reminder, which also takes into account the existing basis.
suggestions made in the later chapters: u If a particular event sounds fuzzy, look for a spot within the three dimensions
u Convert files to 32-bit floating point format whenever necessary and pos- where it can be clearly heard. If you cannot find the spot either with a good
sible. panning strategy or with EQing or layering, reconsider the reason for having
this particular event where it is, and possibly mute it or look for a better spot
u Convert to audio the MIDI tracks of external expanders and VST instruments. elsewhere in the song.
Use 32-bit floating point format.
u Fine-tuning of volumes at extremely loud or quiet listening levels.
u Check each track individually for breathers, editing mistakes, and clicks; clean
them. u With a little bit of routine and experience, after 3 to 4 hours the mix is 90 per-
cent ready. The static mix provides the starting point for the hard part of the
u Label, sort and color-code tracks, assign them to group tracks, folders, and work that follows.
route them.
u In order to shape the final 10 percent, you need a healthy level of perfection-
u Use the group-solo function to control the routing. ism. But don’t do this by yourself, be honest with yourself and team up with a
colleague in order to generate the dynamics required to reach higher targets.
u Define your mixing strategy with a panorama sketch (see chapter on dimen-
sion 1 – Panorama). u Volume Automation for introducing new events.
u Mute all folders and tracks, with the exception of the drum folder and the u Volume Automation for song structure dynamics.
individual track you are starting with.
u Panorama and stereo expander automation for clearing up the last remaining
u Start building up the rhythmic backbone, starting with the bass drum, fol- fuzzy spots.
lowed by the snare drum, making use of panning, EQing, compression, gates
and reverb, until the folder track “Drums” represents a powerful and rounded u Carry out further automation.
sound. u Creative fine-tuning to refine details.
u If the bass is part of the drum group, add it to the mix after editing as re- u Constantly experiment in order to improve individual events that do not yet
quired. sound right.
u Your next step is to build up the instruments that provide harmony and u Set the brickwall limiter in the master section to - 0.3 dB in order to ensure
warmth. Distribute the elements according to their complimentary spectral slight and consistent peak limiting.
properties to the left and to the right in the panorama.
u Export the master with a bit of lead-in and lead-out and without fade-out in
u You can perform the final step either in group solo mode or in conjunction 32-bit floating point resolution.
with the “open” drum group.
u Create a good lead vocal sound and add it to the center.
104 CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW CHAPTER 3: THE SYSTEMATIC APPROACH — CLARITY AND WORKFLOW 105
CHAPTER 4:
DIMENSION 1:
HORIZONTAL = PANORAMA
T
HE MASTERY OF THE FIRST DIMENSION is much underrated.
“That’s easy, it’s just a matter of left-right distribution.” How wrong can you
be! If you do not do your homework in this area, you will soon encounter
problems on the way to a tidy mix, because a good plan for the panorama is the
basis of a good mix.
Start the mix by drafting a mixing strategy in which you set out the placement of
108 CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA 109
that is more suitable for the overall picture. This way, there is sufficient space on The art of a skillful panorama distribution is to pay due attention to the second
the right for the guitar and it now no longer overlaps with the frequency domain dimension (the spectrum), which is not part of the outline - and that at least
of the Rhodes. three times as much! The spectral distribution across the center is a slightly vari-
able standard, and, as far as left and right are concerned, it makes sense also to
Take the same approach to all other similar matters. If the hi-hats are slightly to
be aware of the spectral distribution. Distribute the events in such a way that
the right in the panorama, place the shaker as a counterweight slightly to the left.
the frequency distribution is also balanced in the outer sectors. If a mix is very
Mixes without an overall panorama strategy frequently suffer from an imbal-
complex, you can approach this spectral distribution in a series of “slices” such as
ance in the energy between one channel and the other. The distribution strategy
“outer left - 9:00 to 10:30 – center – 1:30 to 3:00 – outer right” in order to find a
becomes more complicated, the more instruments and events there are to be dis-
place for each event. In this instance, using a stereo expander can create an extra
tributed and the more inconsistent their appearance. In this situation, I recom-
dimension.
mend separate panorama outlines for the various parts; if necessary, the guitar
switches sides in the subsequent part, if this is what is required by the instru-
mentation. In complex pop music, there is no detrimental effect associated with
instruments switching sides within the panorama. However, any unnecessary EXERCISE: WHAT IS PHASE
panorama switches should be avoided in more intimate music with a consistent CANCELLATION?
instrumentation.
To underpin your panorama strategy, follow these rules:
P lease take a mono bass track and duplicate this at a 1:1 ratio on a different
track without moving in the timeline. The easiest way to do this is by using
the “Duplicate track” function. Now rotate the phase of the copied track by 180
The acoustic rule that deep sounds spread in circular form and can hardly be de- degrees. Now pan one track on the far left and the other on the far right. Listen
tected below 100 Hz, whereas high frequencies spread directionally and are easy to the result while switching the mono switch of your control room matrix or the
to detect gives rise to the following panning rule: control room mixer on and off. Bingo! If your DAW does not have a mono switch in
the master section, you can use a plug-in that has one or you can put both channel
Choruses, orchestrations and general instruments in pop productions are gener- panning knobs in the center position. When the mono switch is not pressed, you
ally placed in such a way that deeper sounding instruments are always placed will hear the acoustic phase interference in the room, which is not as strong due
more towards the center; the higher the sound event, the further out they are to natural spatial de-correlation. When the mono switch is pressed, you’ll hear the
likely to be placed. Within a chorus group, you would thus place the high voices physical phase interference, when both channels are mixed together.
further to the sides in the panorama, and deep voices more towards the center. Repeat this exercise in WaveLab with stereo files and finished mixes by selecting
The same applies to string ensembles, orchestrations and almost all many-voiced one channel and reversing the phase with the command Invert phase in the
(thick chorus) sound events. Process menu. Here, too, the difference between acoustic and physical phase
interference is very obvious, with the latter occurring whenever your stereo mix
Of course there are exceptions to this rule, depending on your own preferences is played on a mono device. Once you have become familiar with what phase
and artistic freedom. displacements sound like, the regular use of the mono switch is a simple, but
effective device to monitor for unintentional phase interferences.
You can visually check the phase with the correlation meter, which very clearly
shows that the signals with the same frequency on the right and left stereo
channels add up in their amplitudes as long as the phase of both signals is within
the 0-to-90 degrees sector – generally shown as green.
Please refer to the section “Correlation Meter.”
110 CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA 111
SUMMARY OF PANNING RULES Phase Interference Mostly Occurs in the
u Before mixing, make a sketch of your panning strategy. Following Cases:
u Anything that is not bass, bass drum, snare or lead vocals should not be in the u Bad stereophonic recordings: when you record a signal using two micro-
center. phones, you have to position them in such a way that no phase interference
occurs. For that reason, they are positioned either with the capsules at equal
u Instruments present in the same or overlaying frequency sectors should be
distance and on the same axis to the source of the sound, or at such a distance
placed at opposite ends - complimenting each other - within the panorama.
that the distance of the second microphone to the source is at least three times
u Once you have established the basic static panning, but still find parts within that of the first. Here, too, during the recording you should monitor the phas-
the song that are unsatisfactory in their transparency, it is always worth an at- ing with a correlation meter in combination with a mono switch.
tempt to remedy this by using panning automation.
u Simultaneous multi-channel recordings with correlation errors (mostly in the
u Well planned and carefully automated panning often creates greater clarity case of drum or live recordings). For example: a snare recorded with a top mi-
in the mix than the use of EQ and is much better than unnecessary EQing. crophone and a snare strainer microphone generally has phase interference,
If an event is drowned in a sound mush, your first step to solve the problem if the phase reversal switch on the snare strainer microphone channel is not
should involve the panorama knob before resorting to EQ. activated. Phase problems between the snare top microphone and overhead
u Used in a controlled fashion, widening the sound basis can create extra space microphones can also lead to interferences. Controlled repositioning of the
in the horizontal dimension and thus ensure a clearer sound. Check by using microphones and/or the use of the phase reversal switch on the console or
the correlation meter and the mono switch on your monitoring board. Please the microphone preamplifier can remedy this. If the recording along with its
refer to the Sound Examples #06 and #07 to illustrate this. errors is already “on tape,” using the phase-shift switch in the individual chan-
nels can remedy the situation at the mixing stage. Thanks to HD systems with
u Be courageous! Try extreme panorama settings, making the center free for
Sound Example sample-exact zoom functions you can also use extreme enlargement of an au-
lead vocals.
#06 & #07 dio file and move it by just a few samples – and maybe solve the problem that
way. This simulates the change in the distance between microphone and sound
PREVENTING PHASE CANCELLATION source.
The mono-switch in the master section is a way to control mono-compatibil-
u Keyboard pads and layered sounds often contain phase effects that are not
ity that is much underused in the native world. Even today, there are still many
mono-compatible. Thanks to the three different panning functions available
mono radios and TVs. If your goal is to get airplay for your production or to
in Nuendo & Cubase (Stereo Dual Panner, Stereo Combined Panner, Stereo
publish the mix on vinyl, please activate the mono switch regularly for checking
Balance Panner), you can replace the old panning slider in a stereo track with
for compatibility. If the mix collapses as soon as the mono switch is pressed, then
separate panners and, use them to limit the stereo width from 9:00 to 3:00 - in
there are probably too many deep sounds distributed across the panorama or
the same way as with classic knobs. The resulting signal is generally still broad
there are simply too many phase interferences.
enough, but mono-compatible. The same trick can be used for problematic
overhead tracks recorded with microphones not positioned on an axis. If your
DAW only has a panning slider in the stereo channels, I suggest using small
plug-ins. For example, the PSP Stereo Controller from the PSP Stereo Pack
lets you easily adjust stereo width. Logic has separate pan controls on each
individual L-R channel.
112 CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA 113
u Reverb returns; the same applies. The 9:00/3:00 setting - or the 80 left/80 right to create more space at certain points, insert a stereo basis widening effect into
setting, to talk in Nuendo readings - generally remedies the situation. But be the group in question, which remains in a neutral position throughout the song
careful: limiting the width is not always necessary and can create undesirable and, using automation, can be targeted at these specific fuzzy locations to create
masking. (The masking effect will be explained later.) more space.
114 CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA 115
in order to achieve a happy medium between transparency and mono-compat-
ibility.
STEREO PAN MODE
As far as I know, Nuendo, Cubase, and Cakewalk SONAR 6.0 are the only se-
Signals that are placed on the left hand side within the static panorama can be
quencers that have a stereo pan mode with an adjustable panning behavior.
used with reverb that is weighted to the right, and vice versa. In that way, the lis-
tener is provided with the spatial information without watering down the mix. In Sonar, there are six different panorama characteristics available, whereas with
the Steinberg products, there are five modes available in the Project Setup dialog
In surround sound mixes, you pan the stereo reverb return for signals that are
“Stereo Pan Law” that provide automatic level adjustment for mono tracks. This
oriented towards the front to surround left and surround right, and the reverb
is based on the observation that a mono signal which is reproduced simultane-
return for signals placed at SL and SR to left and right.
ously and at equal volume on the left and on the right, i.e. at the center, sounds
Don’t worry – decoupling direct signals from reverb won’t irritate the listener. louder than it would if was only reproduced at the far right or at the far left. The
Imagine the following real-life situation; you stand in the front third of a large default setting for the automatic level adjuster is minus 3 dB and results in a level
church, looking towards the altar, and you are addressed loudly from the altar reduction of about 3 dB in the center position. At this setting, a mono signal
from a few meters away. In addition to the direct signal, the full reverb tail spreads always retains the same physical sound level regardless of its position within the
forward from the back of the church to the front. Would that make you think that panorama. The settings minus 6 dB and minus 4.5 dB reduce the mono signal at
you were being addressed from the back? the center correspondingly.
Let’s summarize the salient points with respect to masking: At the 0 dB setting, there is no level compensation, and right-left channels brought
together in the center are twice as loud as they would be if they were positioned
u Because of directional masking, mixes that are supposed to be mono-compat- fully left and right. These different so-called “panning laws” correspond to the
ible – for example for TV – can handle less reverb than mixes that are pre- behavior of analog mixing desks.
dominantly intended for stereo or even surround sound productions. Mono
mixes thus tend to become “mushy” or over saturated much earlier, because This function is very useful for eliminating the need for level adjustments after
the reverb increasingly dilutes the perception of the direct sound. panning changes. Experiment to find the right settings for your needs. In my
experience, the settings between 3 and 6 dB are particularly useful when you mix
u The greater the number of available loudspeakers for the reproduction of a orchestral music for mono-compatible TV productions.
recording, the greater the amount of reverb that can be used. In order to put
into place a clear spatial structure without over saturating the mix with spatial
information, you should keep the reverb away from the direction that the dry
sound originates from. This avoids masking.
u Masking also applies to different sounds that occupy similar frequency ranges.
That is why a transparent mix requires that instruments and events with over-
lapping frequencies be placed in complimentary positions across the panora-
ma, according to the panorama strategy made at the beginning of your mixing
project.
116 CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA CHAPTER 4: DIMENSION 1: HORIZONTAL = PANORAMA 117
CHAPTER 5
DIMENSION 2:
THE VERTICAL DIMENSION =
FREQUENCY DISTRIBUTION
T
HE SECOND DIMENSION concerns skillfully distributing events in the
frequency spectrum. We have already accomplished the very important
step of distributing events in the panorama (as described in the previous
chapter) by taking the frequency characteristics of individual instruments within
the left-center-right range into account. This now raises the question of purism.
Purism is only appropriate when no corners have been cut at the front-end and
the recording engineer or producer knew exactly which sound he or she wanted
to hear. If the recorded audio files are delivered in perfect shape, it suffices to use
EQs sparingly, especially in the lowcut range, in order to remove DC offsets and
low-frequency artifacts such as breathers, pops, and so on. For our purposes, we
will assume a “worst case scenario” to cover all situations.
C#/Db 34.6 69 139 277 554 1109 2217 4435 8870 17740 u The bass sector proper comprises the range from 25 Hz to 120 Hz: the
bass sector is first and foremost the sector for the bass. That might sound
A good mix
D 36.7 73 147 294 587 1175 2349 4699 9397 18795 trite, but it is important! The frequency list shows you that the deepest requires at
note B of a 5-string bass has a frequency of 30.94 Hz. If your bass drum
D#/Eb 38.9 78 156 311 622 1245 2489 5978 9956 19912
has a characteristic core frequency of 90 Hz, this means that there are least the same
E 41.2 82 165 330 659 1319 2637 5274 10548 21096 almost 1.5 octaves available for the bass alone. If the bass drum is not number of
F 43.7 87 175 349 698 1397 2794 5588 11175 22351 too low and if a lowcut filter is used only sparingly, there is hardly any
interference, and the range provides enough scope to manipulate the lowcut filters as
F#/Gb 46.2 92 185 370 740 1480 2960 5920 11840 23680 bass independently – even in the finished mix. Sometimes you can hear
bass dead spots. These are individual notes played on imperfectly made
there are tracks.
G 49 98 196 392 784 1568 3136 6272 12544 25088
instruments, which sound quieter than other notes because of instrument con-
G#/Ab 51.9 104 208 415 831 1661 3322 6645 13290 26580 struction errors. With the aid of the frequency list, you can equalize these dead
A 55 110 220 440 880 1760 3520 7040 14080 28160 spots accurately by accessing them precisely using a parametric EQ with steep
correction in order to make sure that surrounding notes are unaffected.
A#/Bb 58 117 233 466 932 1865 3729 7459 14917 29834
The bass sector includes the frequency range of the bass drum and ends at
B 61.7 123 247 494 988 1976 3951 7902 15804 31608 about 120 Hz. Together with the next higher frequency sector, the bass sector
is crucial for the perception of “warmth” in a particular recording.
The bass drum should be somewhere between 75 and 100 Hz. For club music,
The low B at 30.94 Hz equates to the open B string of a 5-string bass.
you should consider that depth is not the same as pressure. In clubs, pres-
In order to organize the spectral distribution of our sound events, we first need an sure develops at around 90 Hz, because the system cannot powerfully transmit
overview of the relevant frequency range from 1 Hz up to about 22 kHz. bass drums that are much lower. The lower a bass drum, the more difficult
it is to edit. Bass drums that extend to 60 Hz or deeper have to be carefully
120 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 121
adjusted to the key of the song. Deep bass drums tend to have their own tonal-
ity, which runs the risk of interfering with the bass, and causing the sound to
appear washed out. Another rule for dealing with very deep sound events is:
“the deeper the shorter!” Cut deep events short so that they do not take up too
much space within the mix.
u The lower mid-range “misery” or low mid range sector between 120 and about
350 Hz is the second pillar for the “warmth” in a song, but also a potential
source of unpleasant misery. You should pay particular attention to this fre-
quency band during mixing, because almost all instruments are present here.
Stick consistently to the mixing strategy that you have outlined at the start. The Artifacts & Bass & Bass Drum Mid-Range “Nasal” Speech Comprehensibility Air Band
(Particularly between 2,5 and 4kHz)
DC Offset Fundamental Thumpiness Region
central part of this section belongs to the lead vocals, and the left and the right (Particularly Sibilance (8kHz and higher)
Bottom Warmth
to the harmonic instruments providing the warmth. It is better to thin out all around 1 kHz)
other events in this area rather than risk overlaps. This is the section where one Upper Trebles
finds most of the acoustic recording errors, for example when recording lead 12kHz–22kHz
vocals, too little attention is given to the comb filter effects of room acoustics. Sub Bass Bass Lower Mid-Range Mid-Range Upper Mid-Range Highs/Treble
Room modes in particular affect this recording range and add up to produce > 25 Hz 25 Hz – 120 Hz 120 Hz – 350 Hz 350 Hz – 2kHz 2 kHz – 8kHz 8kHz–12kHz
tedious misery. Should you find this to be the case, use notch or parametric
filters to look for the problem frequencies in the affected tracks, and lower
The various bands of the frequency spectrum
them steeply. However, you need to ensure a good balance between warmth
and muddiness. If you deliver a mix where these frequencies were imprecisely u The upper trebles between 12 kHz and 22 kHz are often referred to as
and therefore unsuccessfully eliminated – taking away much warmth while air band. A broadband lift in this range can aid the airiness of a record-
still being muddy – the mastering studio can do little to remedy the situation, ing. However, an overemphasis can result in a digital or harsh perception.
because the mastering engineer cannot improve one feature without worsen- For a natural sound in the upper spectrum, the trebles from about 12 kHz on-
ing another in this case. wards gradually decrease in loudness, called roll-off. The following illustration
u It is very difficult to draw up a set of rules for the nasal mid range of between shows the roll-off in the high end of the frequency spectrum.
350 and 2000 Hz. Any rectifications in this area have to be done very much on Before we concentrate on the actual use of EQs, we will look more closely at the
a case-by-case basis. If too much emphasis is placed on this range, the resulting various filter types.
sound is nasal, woody, and piercing.
u The upper mid range between 2 kHz and 8 kHz is responsible for speech com- The Most Important Filter Types:
prehensibility! As a basic principle, please check whether a broadband lift on u The parametric filter is generally a bell filter with three adjustable parameters;
the lead vocal track between 2.5 and 4 kHz would make sense in order to aid Gain increase or reduction, characteristic core frequency (F) and quality (Q).
comprehensibility. The drum skins, which are important for the rhythmic lo- Quality – also referred to as Q-factor- is given as a mathematically calculated
calization of bass drums, are also present in this range. factor; higher values up to 20 denote greater edge steepness, whereas lower
values up to 0.1 a greater filter width. In traditional parametric filters in the an-
u The treble range covers the frequencies from 8 kHz to 12 kHz. This is home for
alog world, for example, the ones built into Neve consoles, there is interdepen-
cymbals, high percussion instruments, S-sounds, chimes, and the high range
dence between gain and Q due to the circuitry, with Q decreasing – in other
of many instruments.
122 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 123
words getting broader – when gain increases. This feature is often referred to
Working with EQs
as “musical,” whereas the decoupling of these parameters is perceived as cool
and clinical. SSL consoles of the 4000 series are typical representatives of this Parametric EQs are the type most commonly used in mixing. My favorites are the
EQ characteristic. The Sonnox Oxford EQ and the SSL Duende Channel Strip Cambridge and Precision EQ for the UAD-1, the SSL Duende Channel Strip EQ,
EQ offer a choice of both filter characteristics. SPL’s PQ-Mastering is the first and the Sonnox Oxford-EQ. Unfortunately, the Precision EQ only has grid posi-
and only hardware EQ that can be switched between these two filter types. tions for the frequency selection, which can be somewhat imprecise for certain
cases, but it excels in particularly soft high frequencies thanks to fourfold overs-
u The shelving filter lowers or respectively amplifies the entire frequency band ampling. The EQ shows how the plug-in’s discrete oversampling – in other words,
above or below frequency F. Depending on the type of shelving, deflections multiplication of the sample rate – can produce an outstandingly soft sound in
occur in the lower or upper transitional area - also referred to as overshoots, the upper frequency bands, without the hassle of dealing with high sample rates
the characteristic feature of this kind of filter. The UAD-1 Cambridge EQ for throughout your project.
example reproduces three standard shelving filter versions.
I have often observed that a suitable EQ is picked out and inserted while a loop is
u The notch filter is very steep and is particularly suitable for the radical removal playing non-stop. This numbs our hearing, because our ears quickly become ac-
of interfering frequencies. The notch filter has to be applied with great preci- customed to acoustic defects. This can result in the engineer to becoming accus-
sion, because it is not adjustable as to the level of the reduction; inaccurate set- tomed to the imperfections before the right plug-in is found and adjusted proper-
tings can damage the sound due to the high level of reduction. The adjustable ly. My tip for the use of EQs is to keep a mental image of your spontaneous sound
version is a parametric filter with a high Q-factor. impression: stop the playback, select the plug-in, and make the settings quasi
u The highpass filter allows frequencies above the cut-off frequency to pass and blindly according to your ideas. Then you can check on the results with “fresh
thus equates to a lowcut filter blocking frequencies below F depending on its ears” and make any finer adjustments that may be necessary. For the targeted
steepness. removal of annoying frequency ranges, it is quite legitimate to use a parametric
filter with a high Q-factor and strong amplification, and to sweep forwards to-
u Conversely, a lowpass filter, which lets frequencies below F pass, equates to a wards the problem area. That way, you can target problem frequencies exactly
highcut filter. and dampen them accurately.
u The bandpass filter only lets a certain bandwidth of frequencies pass and is To achieve the ultimate target of a balanced frequency distribution, equalizers are
used in graphic EQs, de-noisers, and crossovers. It consists of a series of high- used in mixing to equalize and distort individual audio, group, and master bus
pass and lowpass filters. tracks. The basic prerequisites for appropriately using EQs in mixing are:
u The Bessel filter is used as a lowcut or a highcut filter; its effect extends far into u A clear and logical panorama mixing concept, taking into account a balanced
the passband. For that reason, it is unsuitable for hard cuts; one example is the frequency distribution across the Left-Center-Right range.
effective removal of sub-bass artifacts.
u A good understanding of the frequency ranges, where each instrument can
u The Butterworth filter is better suited as a lowcut filter because of its charac- fulfill its role. Many instruments are effective in two crucial frequency areas
teristic feature of a steep slope at the cutoff frequency. while other instruments only operate within a single frequency band.
I will go more deeply into this in Chapter 9, “Working with Individual
Instruments.”
Keeping these two conditions in mind, use EQs above all to thin out and remove
interferences and artifacts – lowcut for the sub-bass sector, for example – or to
124 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 125
thin out and create space in frequency areas that are of little importance for the
A Good Mix Starts with the Arrangement!
instrument in question and, only as a second step, use them to emphasize fre-
quency areas that have been underemphasized at the recording stage. Depending on the complexity of the mix, it is important to split the frequency
range three-ways into left - center - right to achieve transparency.
First Remove and Then Add!
Of course, you should only lower those frequencies of an instrument that are of
lesser significance for its sound character and conversely emphasize only those
that are essential.
There are some basic practices in the use of Q-factors; decreases can be done
steeply with a high Q-factor in order to lower problem frequency areas without
affecting neighboring areas more than absolutely necessary, whereas increases
are done rather more broadband, i.e. with a low Q-factor. This is because high
Q-factors have a tendency to produce filter sounds when raised. Such manipu-
lations are easily detected and perceived as artificial. Exceptions to this include
weak-sounding toms, which can be brought to life with a good filter.
It is impossible for each soloed instrument or group to always have an outstand-
ing sound on its own. However, with a skillful mixing concept combined with
tight instrumentation and a good recording, it is definitely possible for each solo-
switched instrument and the total mix to sound very good.
This does not necessarily apply to pop productions. An acoustic guitar that beau-
tifully blends into a pop mix often sounds pitifully thin when switched to solo. Within the panorama dimension, the vertical frequency dimension should be
The same can be true with chorus groups. In those cases, it is a good idea to split into at least three different ranges (L-C-R).
start off by mixing a harmonious overall sound for the group in ques-
A good tion and then apply EQing to the group within the framework of your I will illustrate this approach using a typical and quite common example. As al-
ready explained in the earlier chapter on panorama, the center section is the home
arranger works overall mixing strategy. If there are too many harmonic instruments ground of the bass, the bass drum, the snare, and the lead vocals. Any overlaps
competing for space in the lower midrange on the left and the right
with frequency of the panorama, less important instruments can survive significant in the lower mid-range in which all events are present could already at this stage
lead to overlapping and fighting for space. With the knowledge that the lower
distribution reductions in sound and still find an adequate place in the overall mix mid-range is of no importance for the bass or the bass drums, it is very simple
without resulting in an overlapping or fuzzy sound. This could apply,
in mind. for example, to an organ, which might lose its lower mid-range, but to create space in this area for the lead vocals. The bass gains its warmth from
places additional emphasis on the upper mid range with its tinny distortions. the lower one and a half to two octaves, and its tonal comprehensibility between
Remember, during the recording and production stage, each additional instru- 400 and 800 Hz. The range between 800 and 1200 Hz makes the bass sound
ment raises the demands made on the mix. A good arranger works with fre- more woody and “planky.” A Musicman-Stingray only works to about 800 Hz,
quency distribution in mind. whereas a jazz bass can go as far as 1500 Hz. The clicking sounds of the frets as
well as slapping are of course higher. The bottom of the bass drum is at about 90
Hz and its rhythmic localization in the upper mid-range. Correspondingly, the
126 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 127
lower mid-range is rather unimportant for the sound and can be reduced. This of your song. If you really want to use deep and long bass drums, pitch the bass
results in a balanced sound in the distribution of the instruments’ characteristic drum to the respective chord roots of the song. Listen for example to the CD
core frequencies. Apply the same principle to the frequency distribution in the Futuresex / Lovesounds by Justin Timberlake.
left and in the right section of the panorama. For very complex mixes, imagine
On the DVD there is an exercise called “Editing Loops,” a practical example of deal-
the frequency distribution as split into five sections in order to find space for each
ing with low bass drums.
event.
128 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 129
and master bus, are hardly possible, provided you work consistently with 32-bit time line – release – is a further reason to assign compression to the second di-
resolution. As previously discussed in the section on bit depth, even levels above mension. Through the attack parameter, the editing of transients with compres-
0 dB can be processed perfectly well in 32-bit floating point resolution. sors affects the perception and therefore the level. These are the most important
sound editing features of compressors.
The level adjustment itself depends of course on the genre and personal taste.
Should the vocals be in front as in radio-compatible pop productions? Is it neces- Since I have repeatedly seen that even audio professionals with a great deal of
sary to understand every word? Should the final result be a dance floor mix with experience in working with digital or analog signals have problems with the
greater emphasis on the bass sector? Different chefs use different recipes, but all proper application and operation of compressors, I will discuss compression in
with good results. For this reason, my personal approach is just a suggestion; I particular detail. There are a number of approaches; in my workshops, I have
start with the rhythmic backbone (drums and bass), than adding the harmonic found that carefully listening to various ways of using compressors is absolutely
instruments, which provide the foundation to the rhythm group. Then I set the crucial for understanding them. This is why I suggest you load and work through
level for the lead vocal group. After that I add the accessories (percussion, effects, the sound examples in the Compression Exercises on the DVD into your DAW.
fillers, and less important instruments). A good trick is to listen to the mix at an The examples are also provided with compression already added, so that you can
extremely low volume. If you can still recognize the melody, bass drum, snare compare your results with mine. Of course, your results will depend quite a bit on
and bass, you have achieved initial cohesion. what plug-in you are using. You might take the opportunity to reorganize your
plug-in folder. Instructions for the exercises can be found in the “Compression
To determine the relation between snares and bass drums, an RTW peak meter
Exercises” section below.
or its software equivalent is a useful tool. Depending on the style, kick and snare
should usually have the same level of energy. The graphic analyzer can also be At the heart of each compressor is the VCA – the voltage controlled amplifier.
Don´t startwith very useful in setting the levels. At the very least, it can save you from
serious misjudgments.
The VCA is controlled with a number of different parameters and readjusts the
amplitude of the input signal – in other words, its volume. In a simplified view,
automation If there are uncertainties regarding the levels for certain tracks, it helps to
imagine a little person who controls the volume setting and makes quick up and
down adjustments in correspondence with the input signal in order to shape the
before getting pull the faders in question down and then to bring them slowly up again.
amplitude of the signal.
The volume level of the tracks should be so consistent that it is possible
ready with a to achieve a good transparent result without the use of fader automation.
Common Basic Parameters of Compressors and
coherent The static mix is the starting point for fine tuning with fader automation.
Their Use
If you start with automation, before the static mix is coherent, you risk be-
static mix! ing inefficient and having to rework the automation repeatedly, leading to A compressor is a “thickener” which reduces the dynamics of a sound event and
confusion (see section on Automation). Inconsistencies are better fixed directly on therefore initially also its volume. The actual gain in volume is achieved by re-
the audio takes, events, or clips (or however they are called in your DAW). Many turning the level of the compressed quieter signal after compression back to the
programs have keyboard shortcuts which allow you to process level changes in initial level or even increasing it. This parameter for automatic level adjustment is
individual files with off-line processing. generally called “Auto-Makeup-Gain.” If that is not available, there is an output
level control “Volume,” “Output” or “Makeup-Gain,” allowing you to increase
DIMENSION 2: 3RD ASPECT – the level manually.
COMPRESSION
I associate compression with the vertical dimension, that of frequency distribu-
tion. This is because compression has an effect on the density – and therefore on
the perceived loudness of a sound event. The modulation of basic material on the
130 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 131
The simplest graphic illustration showing how a compressor works is an X/Y Threshold: The threshold parameter represents the outset or the lower limit
graph, where the vertical axis represents the output level and the horizontal one where the compressor begins working. This value is set in decibels; starting from
the input level. If the output level is the same as the input level – in this case the that value, the compressor begins to process the input signal. Since all level mea-
compressor is inactive – the resulting graph is a diagonal. surements in the digital domain are negative (0 dB = full scale), the threshold
value is given as a negative, in other words minus X dB. The higher this negative
value or respectively the lower the level, the sooner the compressor will start to
work. Lower levels or respectively greater negative values represent greater com-
pression. In the illustrations at left, the threshold levels are set at minus 6 and
minus 9 dB.
If there is no threshold parameter – for example as is the case with the Urei-1176
– use the input level to calibrate the threshold. The higher the input level, the
greater the compression.
The threshold value corresponds to the value of the input signal level. If you want
to compress a snare whose highest peaks extend to 0 dB and select a threshold
value of - 6 dB, you significantly affect the dynamics. At the same setting, and
with an input signal that is 6 dB quieter, the compressor range does not even
Compressor with a threshold of -6 dB Compressor with a threshold of -9 dB With a 0 dB peak on the snare and a threshold of -6 dB, the compressor affects the dynamics starting
from the level indicated by the red line.
132 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 133
come close to the range that you actually work in, because even the loudest parts familiar with this parameter, I recommend choosing some
of the signal might only just reach the threshold value of -6 dB, but not exceed it. practice samples similar to the ones used in the following
When I explain all the other parameters, you will learn how to interpret the “Gain Compression Exercises, and to play around with the attack
reduction display” for a quick adjustment of the threshold value. time. The higher the value, the greater the number of ar-
tificial transients that are added. The attack time refers to
The second most important parameter is Ratio. Ratio defines how much the out-
the time between the crossing of the threshold value and
put level is to be reduced once the threshold level has been exceeded. The higher
the beginning of the level reduction. This time offset allows
the value, the greater the compression. The setting “Infinity: 1” equates to limit-
peaks to get through the compressor that are then, once the
ing and is represented by a characteristic horizontal line once the threshold has
attack period is over, processed according to the ratio set-
been exceeded.
tings. The waveform representation lets you actually see the
emphasized transients from the examples.
There are two ways of using the attack time creatively: the
purpose is either to find short unobtrusive settings that
make compression discrete and subtle, or long attack times
to emphasize clearly audible artificial transients. Attack time Compressor with
in ms
long attack time
and strong artificial
transients
Compressor with a ratio of 2:1 and a threshold of Compressor with a ratio of 20:1 and a threshold of
-6 dB -6 dB = near-limited at -6 dB
The input signal level is changed by a ratio of 2:1
compared to the output signal. If the input signal
(horizontal axis) increases by 6 dB, the output sig-
nal (vertical axis) will increase by only 3 dB. Uncompressed bass
In mastering, settings between 2:1 and a maximum of 4:1 are commonly used.
Very rarely are higher ratios such as 12:1 used for individual instruments, which
is quite extreme. Until you have developed a sure instinct in dealing with com-
pressors, you should stick to settings no higher than 8:1. The higher the ratio
setting, the more limiting character you receive and thus, the more noticeable the
dynamic intervention. Low values result in subtle compression.
The Attack parameter allows you to emphasize the transients and therefore to
increase the “snappiness” – or percussiveness – of the recording. In order to get With a long attack time, the transients are easy to see.
134 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 135
Long attack times give snares, toms and all other percussive sound events greater The snare drum example in the Compression Exercises gives you a better idea of
assertiveness, bite, and rhythmic definition. This treatment is also well suited for how to use this parameter. The raspy snare sound can’t be heard very well here. A
funky rhythmic Stratocaster style guitars in order to create their typical snappi- shorter release time means that the snare “strainer” sounds – in other words, the
ness. The Urei-1176 is particularly used for editing vocals, because with a long quieter parts of the snare sound – are increased in volume, because the amplifier
attack time it adds an unmistakable bite to the vocals; this is particularly evident springs backs quickly to its original value and thus increases the quieter parts of
in percussive sounds and hard consonants. Many legendary lead vocal sounds the material. Conversely, you can use a long release time to “extend” an acoustic
owe their success to the use of the 1176. Caution, the attack time works the wrong guitar, which fades too quickly due to a lack of sustain. This technique can also
way round; knob to the left equals long attack time, and knob to the right equals be used to move bass notes to the foreground that otherwise fade too quickly. If
short attack time! the compressor makes unwanted sounds – such as headphone crosstalk – too
audible, then try to increase the release time.
In some situations, a compressor will have only a subtle effect on the sound, but
within the context of the whole mix, it might give There are therefore three ways of using compressor release time:
Transients the individual events more punch and greater lo-
u Discrete compression settings depending on song tempo and program ma-
calizability. Long attack times are always welcome
T ransients play a huge role in defining natural
sounds for our ear, for example with a
bowed string. With layered sounds that have
when events sound tired and powerless, or when
an instrument – such as bass – is difficult to rhyth-
terial.
u Increasing the volume of quiet sections that immediately follow transients like
no attack, there are no transients. The more mically localize in the mix. snare drum strainer sounds. This requires short release times.
percussive a sound is, the greater the importance Release is the most important parameter in or- u Increasing the volume of quiet sections that would otherwise be drowned
of the transients. Using too much de-clicking der to keep the pumping sound as low as possible. in the mix due to a lack of sustain. With a decreasing input signal level, the
or de-crackling can negatively affect transients,
Release refers to the time period that passes until VCA (Voltage Controlled Amplifier) jumps back to the starting position, thus
damaging and deforming the natural sound of
the amplifier level is back at its initial setting. Once amplifying the weakening signal. In that way, even a mediocre bass with old
instruments. This is why it is important to pay
the signal has exceeded the threshold, the level is strings gets sufficient sustain for a good foundation. This requires long release
attention to percussive elements in the recorded
reduced in accordance with the ratio setting. Until times.
material when using such tools. Plug-ins such as
the next time that the threshold is exceeded, the
the Sony Oxford Transient Modulator specializes
reduction moves back to the initial setting by in- RMS/Peak is a rarely used parameter, which determines the characteristics of
in processing the transient parts in a signal and is the release and compression behavior. RMS oriented compression is advanta-
creasing the level within the release time. The ac-
therefore able to increase or reduce the snappiness, geous in the mastering process in order to achieve the loudest possible results,
tual shape of this curve – linear, logarithmic, ex-
liveliness, and percussive sound of a piece. whereas a peak-oriented compression is a pure “top-end” compression with little
ponential and so on – determines the character of
a compressor. The ability of a compressor to con- effect on the overall loudness. The only native devices to provide this parameter
dense loudness by increasing the quiet parts of the signal through the release of are the compressor from the Steinberg-Surround edition and MD3 software by
the amplifier is determined by the shape of the release curve. PowerCore.
If you want to achieve subtle results, adjust the release time to the tempo of the Electro/Opto: The electro mode is another description of RMS-oriented release
song. To get an idea of what pumping sounds like, take a song and set the com- behavior and is modeled on analog compressors. With this release behavior, the
pressor to a low threshold, set a ratio of 20:1 and a fast release time. As a result, the return speed of the VCA to the original setting increases, the closer it gets to the
dynamics will rise and fall in rather unnatural waves called “pumping.” original setting 0. Opto works the other way around; here the return speed de-
creases in line with an approach to 0. The Opto behavior is well-suited for drums,
136 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 137
for example. Some Waves plug-ins offers the choice between the two modes. complex routings and does not permit the synchronized editing of stereo tracks.
The new VST 3-Standard now allows side-chains; it can therefore be expected
The crossing at the threshold point is referred to as a knee, or more exactly, as
that this function will soon be used in many VST plug-ins. Nuendo 4 and Cubase
a soft-knee or a hard knee. For soft-knee settings, the level reduction is applied
4 now has VST3 plug-ins providing side-chaining flexibility already familiar in
moderately even before the threshold value is reached in order to achieve softer
the analog studio world.
dynamics. Soft-knee settings are particularly good for subtle compression. The
use of soft-knee settings is incompatible with the use of the attack parameter in Gain reduction – or “GR” for short – generally refers to metering and shows
order to generate snappiness because they pursue contradictory targets. level reduction on a display. The greater the displayed level reduction, the greater
the compression. The speed with which the display jumps back to zero indicates
the release time setting. If the ratio is set to 1:1, nothing is displayed since there is
no compression. If the compressor that you use has no automatic make-up-gain
function – in other words, if it does not increase the output level in line with the
compression – you can use the gain reduction display as a guideline for raising
the output level. Once you have cut 6 dB from the peaks, you can raise the out-
put level by 6 dB. Remember, though, that an honest A/B comparison of your
compressor settings is only possible if you listen to the original and the processed
version at the same volume setting.
The gain reduction display is useful for visually monitoring the threshold, ratio,
release, and output gain parameters.
Compressor Types
Single-band Compressors
Compressor with a soft-knee setting Single-band compressors process the entire spectrum as a single band. Traditional
analog devices are always single-band. This type offers all compression modes
The term Side Chain is borrowed from hardware devices and either refers to a relevant to mixing. Single-band compressors tend to produce pumping effects
separate physical input (side chain input) or a signal path captured from the input more than multi-band compressors.
signal for peak detection. This independent signal can generally be processed
outside of the monitored signal with an EQ, generally a double bandpass. For Multiband Compressors
example, you can remove bass parts from the side-chain path to prevent the snare Using bandpass filters, multi-band compressors split the spectrum into 3 to 10
compressor from reacting to every beat of the bass drum. In a practical studio set- bands – similar to a graphic EQ – in order to process them separately. The thresh-
ting, a discrete physical side chain input can be used to add a spoken voice track old of the bass area can be determined by the bass drum beat and the threshold
signal in order to control the background music with the level of this track. The of the mid range by that of the snare drum, and so on. This disassociation allows
music becomes quieter as soon as the spoken voice exceeds the preset threshold stronger compression of individual bands and avoids pumping which would re-
value. This is very often used in radio, for example. On the software side of things, sult in interaction between the various frequencies. The multi-band compressor
the TC Native Bundle with Side chain plug-in has this feature, which is no lon- is best used as a mastering tool. I advise against using it in the mixing process in
ger available. Certain Waves stereo compressors provide the option of using one order to avoid over compression and also to improve the chances for later rectifi-
channel as a side-chain channel and the other for processing. This requires rather cation during mastering. High-quality multi-band compressors for the purpose
138 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 139
of loudness optimization work with soft crossover filters whose steepness does
Exercises with Compressors
not exceed 6 dB per octave. This avoids any phase errors caused by steep band-
pass filters. A further feature of a good multi-band compressor is its ability to If you are using Cubase or Nuendo, open “Compression_Exercises.cpr” or
increase density – i.e. loudness – without pumping or sounding artificial. Here, “Compression_Exercises.npr” in your host application. If you are using Logic or
too, the shape of the release curve is crucial: RMS (loudness-oriented) compres- other software, please import the OMF-2 file “Compression Exercises / OMF2.0”
sion has release behavior that is analogous to compressors based on Electro-con- into an empty project.
trolled VCAs. The optional Powercore MD3 compressor is the only multi-band The unprocessed wave files 01 to 07 are on the light green tracks 1 (mono) and
compressor that I know of which offers different RMS and peak-oriented char- 2 (stereo). Insert either the recommended compressor or the best compressor
acteristics. The use of multi-band compressors requires precise settings and har- you have available for working through the exercises.
monization of many parameters; basically, it only makes sense when this broad
palette is used for creatively structuring the sound and not just for compression. (Generally, compressors are to be inserted and only in special cases used as
The different processing and leveling of the individual bands has a great deal of send effects!)
influence on the overall spectral design, comparable to an EQ. If you do not wish
It is up to you if you want to mute the playback tracks or not.
to process the sound spectrum, it is better to use a loudness maximizer, which is
both easier and quicker, provided that all you want is to optimize the loudness. For comparison, the yellow-colored tracks 3 and 4 (mono and stereo) have al-
ready been processed. Naturally, they are muted at first. In some cases I have
Loudness Maximizer EQ’d the examples before compression. Interfering signals that become amplified
The loudness maximizer plug-in from the Steinberg Mastering Edition has given through compression should be removed with an EQ. This includes low-frequen-
its name to a whole group of compressors specialized in maximizing loudness, cy interference and crosstalk. If you are using the UAD-1 1176 or the Sonnox
particularly during the mastering process. Among them are also the L2 and L3 Oxford Dynamics compressors, the presets I use are already in the fxp folder. For
plug-ins from Waves, the Sonnox Oxford Inflator (PowerCore or native) and the this you should use the Load Effect function within the plug-in. Here you will
UAD-1 Precision Maximizer. Like multi-band compressors, these plug-ins are also find the EQ presets, if EQ has been used prior to compression. The names
not suitable for mixing. An exceptional case where they may be used would be FX1, FX2, etc. represent the order of the plug-ins used.
for increasing the assertiveness of a drum group. If you do not have a brick wall
limiter, the loudness maximizers can also be used as reliable limiters as long as There is no surround track on the OMF version. Since the examples come from
you make sure not to be too aggressive. a surround mix, I have included the surround playbacks just for fun.
Before we can start with the actual exercises, there is one final quirk: because the
1176 compressor is a true copy of the Urei compressor, the original’s somewhat
eccentric features have also entered the world of plug-ins. Among them is the
illogical operation of attack and release; the further you turn the knobs to the
right, the shorter the values. For long attack and release times, you have to select
settings between the left limit and 12:00.
140 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 141
negative threshold therefore give you more level headroom in achieving your tar- prevent pumping on the vocals. I have also used EQs here. In the mix (with full
get: a bass that can be heard continuously at a consistent loudness. You can choose playback) I copied (doubled) the lead vocal track and treated it slightly different
between short (discrete) and long (to emphasize transients) attack times. Longer (de-correlation) and then mixed both tracks into the center. That way, the vocals
attack times increase rhythmic localization. Set the display on the 1176 to GR for come through more clearly.
showing gain reduction. Adjust the threshold value by changing the input level.
The higher the input, the greater the compression. When the display is in GR Exercise 7: Make a Funky Guitar Sound Crispier
mode, notice how much the dynamics are reduced and how fast the amplifier Here we are also using Attack to create artificial transients, which refresh the sig-
springs back. The longest release time on the 1176 compressor is 1100 ms; you nal. At the same time, the fast release time is used to increase the presence of the
will certainly find the right setting with the UAD-1 1176 or a good native clone. quieter ghost notes, or strumming sounds, to make them clearly audible.
However, in this exercise I prefer the Oxford Dynamics compressor for a thicker,
I have placed a lowcut in front of the compressor in order to avoid bringing
almost artificial result. You can even see the significantly weaker level reduction
out interfering low frequency signals, which frees the sound of any disturbing
of an individual sound. You can also recognize the transients generated by the at-
noises.
tack time. In pop music productions, I often notice extremely compressed tracks
like this. (See page 135 with illustrations of the two wave forms before and after Conclusion
compression.)
Compression on individual tracks in the mix is mainly used for shaping the
Exercise 2: sound and controlling natural dynamic jumps. At this point, it is not used to cre-
Lengthening a Short Acoustic Guitar Sustain ate a thicker sound, as is the case during the mastering phase. Compressor usage
Similar to exercise 1: I dropped in a lowcut filter in front of the compressor to falls into three different categories:
eliminate low frequency interference. u Supporting sustain (presence through level consistency).
Exercise 3: Louder Snare Strainer u Supporting transients (presence through snappy artificial transients).
To bring out quiet parts, set the release time to a very short value. Use this to
u Increasing the level of quieter sections.
bring out the strainer part of the snare sound.
Indirectly, this processing results in restricting the signal dynamics; the com-
Exercise 4: pressed signals can therefore be louder and more even in terms of their respec-
Snappier and More Prominent Snare Attacks tive levels.
Here you will need long attack times. For the 1176, adjust the attack knob to
somewhere between 9 and 11 o’clock. The longest attack time of the 1176 is 800 The use of compressors in groups often requires a more subtle and inaudible set-
ms. ting, with low ratios where the goal is to join individual signals and achieve a subtle
restriction of dynamics. In individual cases, a loudness maximizer may be used in
Exercise 5: Bringing out Tom Attacks the compression of groups as long as this results in the desired punch. Transient
Just as in exercise 4. modulation – first introduced by the SPL Transient Designer and now also avail-
able for DAWs with the same named UAD-1 version and tools like the Sonnox
Exercise 6: Putting “Bite” into Vocals Oxford Transient Modulator or the Duende Transient Shaper (drum strip) – is
You can turn the ratio up to 12:1 of this (with conventional compressors, the val- particularly suitable for percussion and drum groups to inject life and spirit into
ues will generally be somewhat less than on the 1176). The “bite” is generated by flabby-sounding recordings.
a long attack time. Release time should correspond to the song tempo in order to
142 CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION CHAPTER 5: DIMENSION 2: THE VERTICAL DIMENSION = FREQUENCY DISTRIBUTION 143
Only in the master bus should there be a limiter. This could be called a “technical
limiter,” because its purpose is simply to protect the output against overs. As a
basic rule, during the mix you should cap only the highest peaks. That is why it is
always advisable to glance at the sum limiter whenever you have raised the level
of channels or groups, just to make sure that the limiter is still only processing
occasional peaks and not intervening constantly.
If the compressor settings are satisfactory, don’t make any changes to the signal
level prior to compression, since this will indirectly change the threshold level
and therefore, the amount of compression. This could happen, for example, if
you compress in one group and then raise the level of an audio channel routed
to that particular group. Whenever you have to make such a correction, make
sure that you keep an eye – or better an ear – on the relevant group compressor.
Insert compressors either in the second or third slot on the insert rack so that
you keep your options open with regards to dropping in EQs in front of them.
In Cubase and Nuendo, the channel EQs are post, so that the channel EQs are
not suitable for removing undesirable signals upstream from the compressor. In
Nuendo 4 and Cubase 4 you now finally have the possibility of changing plug-in
order simply by drag-and-drop.
G
OOD REVERB PAYS OFF in the stage depth of complex mixes! But
what are the hallmarks of good reverb? In order to identify the features
of a good reverb device, we will first look at the bad ones in so many
reverb plug-ins: Unsatisfactory reverb is characterized by weak stage depth in
the final result. It may sound like reverb, but does not create spatiality within the
mix, only a greater fuzziness of sound. One of the characteristics of bad reverb
is to create the need for a lot of reverb in the mix in order to convey the spatial
information. If 99% of the presets of a reverb plug-in are things like “Big Tunnel”
and “Fantasy Cathedral” and if there are no good ambience presets, you should
be wary!
One look at acoustic reality explains why that is the case. The reverb information
of your current location is made up of an unlimited number of reverb returns that
reach your ears from all directions at various levels. Clap your hands, and you
first hear a loud peak, followed by the highly complex reverb return. However,
the reverb return is extremely quiet compared with the peak of the clap. It could
be that the reverb return is 80 to 90 dB quieter than the peak. Nevertheless, you
perceive this spatial information. This still works, even if other environmental
noises overlay the reverb echo. Our ears are veritable masters in processing and
interpreting this very quiet spatial information subconsciously, that is, psycho-
acoustically!
What distinguishes a reverb device from the real event? The direction of the echo
and the information density or calculation depth!
The second aspect, calculation depth, is of even greater importance. This is the The increased calculation power of DSP-based reverb devices alone results in
area where reverb plug-ins struggle for a compromise between CPU savings and their higher performance levels. The Waves APA 44, for example, “only” calcu-
good calculation depth. Allowing our ears to interpret a reverb echo as such on lates 6 good convolution reverb devices compared with 26 channel strips. This
this subtle level requires a certain information density. If your plug-in cannot do indicates the power hunger of serious reverb plug-ins.
that, increase the level of the reverb return in order to force the spatial informa- If you do not have the CPU resources for simultaneous real-time operation of 4
tion onto the ears for as long as it takes to be accepted and interpreted as spatial first-class reverb devices, you can re-import the reverb response as an audio file
information. This is almost bound to result in a fuzzy sound. and save the preset that you used in order to implement later changes, if the need
Therefore you cannot recognize a good reverb device by the great sound of its arises. This is undoubtedly the better option rather than compromising with re-
presets when stepping through them in solo mode; instead, you need to test it gards to reverb plug-in selection.
to see if the reverb return still stands up in the complete mix at low levels. To do In “Overview of recommended plug-ins” you can read more about the crème de la
this, take a dry drum group or a dry loop and try out your reverb devices in subtle crème of reverb devices.
room settings such as “Ambience” or “Small Booth” by mixing the reverb onto
the drums and switching the reverb on and off while listening to the playback as
a whole. A good reverb does not need to be consciously audible. But you should
Using External Reverb Devices
miss it as soon as it is switched off. Does the spatiality of the drum collapse when If you are lacking in computing power and do not have a DSP card, a top-of-
you switch off the subtle reverb, giving you the impression that the drums are the-range external reverb device is an excellent alternative. Since prices of digital
stuck on a plank at speaker lever? And did you earlier perceive the drum sound devices have dropped significantly, high-class range reverb devices – often no
Sound Example as naturally spatial? If so, then you have an excellent reverb device. (Please refer longer available new – can be bought at a fraction of their purchase price at stu-
#12 to Sound Example #12). dio clearances or Internet auction sites. The only prerequisite for their integra-
tion is an audio interface that provides the required number of input and output
channels (in addition to the stereo bus output). Generally, these are one or two
audio interface outputs in order to send into the reverb mono or stereo, and two
additional audio interface inputs for the reverb return signal or to add it to the
mix as an “open” input channel. In your host application route one or two (for
stereo) auxiliary sends to the audio interface outputs, which are then routed into
the external device. The effect part in the reverb device is set to 100% (completely
“wet”), which is common practice for all send effects. In order to be able to gen-
148 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 149
erate the required number of different reverb variations for the layering of your Apart from calculation density, quality and room size of the selected reverb, there
mix with a single device, you need to record the reverb return for different send are two major design concepts available with a significant psycho-acoustic effect
mixes with different reverb settings. This could be a little stereo reverb with delay on the ability of our ears to interpret distance: pre-delay and frequency curve of
tail for the vocals, a subtle little drum booth for the drum and a large reverb just the reverb return. Another important aspect is the quality of “early reflections,”
for the snares. depending on plug-in and preset choice.
This approach has both advantages and disadvantages: The disadvantage is the
experience that is required as well as imagination in order to select appropriate
Aspect 1: Pre-delay as Sound Design Component
reverb types and balance them with each other without being able to listen to Pre-delay time is a basic and important factor for setting the distance in the re-
them simultaneously from the start. The advantage is the ease with which the verb. This parameter is used to set the time span between the direct sound and
recorded reverb tracks can be edited retrospectively and the comparatively low the onset of its first reflection. Pre-delay use is a controversial subject: a high pre-
CPU requirement of audio tracks. It is easier and clearer to use EQ for managing delay time suggests proximity of the sound event, but at the same time tends to
panorama distribution, loudness, and sound structure in audio tracks than to use make the overall sound fluttery and less tight. Pre-delay times between 60 and
reverb return; if required, other effects such as delay can be added to the reverb. 100 ms will already be perceived as a “slap.” If used, they should be synchronized
to the song tempo. Before I confuse you, let’s recapitulate.
DIMENSION 3 AND ASPECTS FOR Case A: You are in the middle of a huge cathedral and the sound source is a meter
DESIGNING STAGE DEPTH (3' 3") away from you. The direct sound reaches you after 3 ms (speed of sound:
With computer-based recording now the norm, the absence of good sounding 1 meter (3' 3") = approx. 3 ms), whereby the first reflections – coming from the
recording studios – with their rich and varied acoustics skillfully recorded with nearest wall 15 meters (50') away – need 90 ms to reach your ears (30 meters x
additional ambience microphones on separate tracks – leads to more or less dry 3 ms). These first reflections are followed by the actual reverb tail. Your subcon-
recordings made in dead-sounding recording booths, or improvised recording scious ability to determine distances gives you an idea of your proximity to the
environments. This forces the mixing engineer to create artificial spaces in which sound source.
to place the sound. Otherwise a mix will sound as flat as a board. This is true for Case B: The sound source is 15 meters (50') away from you and is at the end of a
mixes that strike us as being dry on first listening as well as highly reverberant cathedral and is situated one meter (3' 3") from the wall. The direct sound takes
romantic rock styles. The difference is only that the dry mix uses discrete, small 45 ms to reach your ears, followed 6 ms later by the first reflections, which take
room reverbs with first reflexions, while the romantic ballad additionally em- 51 ms altogether. The direct sound, the first reflections, the reverb tail mix to-
ploys much more obvious spaces. gether, and your subconscious mind interprets this as distance. So much for the
academic explanation.
Reverb as Send Effect
This is applicable to classical music, chamber music and small acoustic ensem-
Reverb is usually used as a send effect, routed through the aux buses post-fader,
bles, all with carefully dosed amounts of reverb.
in other words, the signal after the fader. The advantage of this is that the level
in the signal that is sent adjusts proportionally to any changes effected by the In my experience, this rule functions better the other way around in pop music.
channel fader. If the send was pre-fader, the effect level would remain static, even I believe this may be a result of the significantly lower amount of reverb that can
when you mute a channel or set its level to very quiet. Because the adjustment is be used in a pop mix, as compared to real acoustic phenomena. In this case, gen-
done via the send level, the reverb or effect ratio in the reverb device is generally eral theoretical doctrine cannot be applied to everyday pop music mixing.
set to 100%.
150 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 151
Case A: Sounds which you want to appear direct or close by, function well with very
low pre-delay times (between 0 and 10 ms), while distance can be created with
increased pre-delay. You should be careful with placement when using high
pre-delay times with percussive sounds. All sounds relating to rhythm, such as
drums and bass, should have reverb either completely without pre-delay or with
amounts of only up to approximately 10 ms. When increasing pre-delay here,
check for rhythmic consistency.
High pre-delay values of up to approx. 60 ms are particularly suited to chorus and
strings. This puts them spatially toward the back of the “stage.”
You should experiment with different pre-delay values and test the impact of the
results. It is best to do this with a partner, since we tend to hear what we think we
should hear.
Here are some points to take into consideration when dealing with pre-delay:
u With very acoustic, natural mixes, follow the natural behavior of pre-delay;
longer delay times for nearby events and shorter values for far away ones.
In a natural listening environment a long pre-delay
indicates a close sound source u In pop music, use the opposite approach; short delay times for nearby events
and longer values for far away ones.
Case B: u With percussive sounds and long pre-delay times, sync delay times to the song
tempo; avoid “slap delays.”
u If the reverb signal muddies up the dry signal, try a higher pre-delay value, also
synched to the song tempo
Since we are focusing exclusively on parameters of importance for the design of
stage depth in this section, we will also briefly look at EQing the reverb signal,
which also plays a crucial design role.
152 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 153
all sounds, except those which pass over our heads due to the curvature of the u Lead vocals and reverb: it's all about the mix! Where a lot of reverb is needed
earth. (for example with ballads), create a dry counterweight by doubling the lead
vocal track; de-correlate the track by adding EQ, compression, and possibly a
On the basis of these two laws of physics, it is easy to deduce that high frequencies
short delay and mix the dry lead track back in. This way, the lead vocals don't
lose more level over the same distance than low ones. The frequency curve for
get pushed to the back too far, but at the same time sound fatter.
the decrease in natural reverb is therefore not linear, because the high frequen-
cies decrease faster than the low frequencies. The greater the distance between u Rich treble content in the reverb indicates nearness, lack of treble communi-
the listener and the sound event, the lower the proportion of high frequencies cates distance.
in the reverb signal. This is why treble roll-off of the reverb signal is one of the
u The main ambience – usually drum ambience – should be discretely mixed in
most effective psycho-acoustic means of representing distance to a sound source,
all important tracks to create acoustic cohesion.
since our ears interpret this information subconsciously. Reducing the trebles in
the reverb of the lead vocals – which you want to place right in front of your mix
– gives the listener a contradictory message. On the other hand, you will also run Basic Delay Rules:
into trouble if you place the chorus in a large room, select a pre-delay of 30 ms, u Delays that are intended to generate space should be tempo-synched even if
and make the reverb very brilliant. Here, too, listeners are given contradictory they are short (quarter, eighth, sixteenth, etc.; dotted and ternary versions).
data and therefore have no clear spatial information to interpret. This is why your Make notes concerning the appropriate delay times in relation to note values
reverb settings should have more treble for sounds that are in the front of the mix, for every song. With ternary rhythms, calculate the delay times for triplet quar-
and less for those placed in the back. ter notes, eighth notes, and so on. Divide 60,000 milliseconds (one minute) by
the song tempo (quarter notes per minute) and you will have the number of
LAYERING STRATEGIES milliseconds per quarter note.
Here are some basic rules to help guide you. Keep in mind, however, that since u Slight variations in delay time can drive (shorter delay times) or drag (longer
mixing music is such a varied and lively process, exceptions prove the rule. delay times) the rhythmic feeling of the song.
u Beware of phase cancellation (comb filter effect) with delay times of under 10
Basic Reverb Rules:
ms.
u When using more than one reverb device, organize the programs or presets by
room size (small, medium, large, etc.) u Single delays between 10 and 30 ms long can discretely "thicken up" the sound,
while the localization of the first sound event remains intact independent of
u Reverb tends to blur the mix more than delay. where the delay comes from. The human ear perceives single delays between
u The balance between space and distance can be controlled with the effect 10 and 30 ms as direct sound events.
level. u Delays up to 30 ms – especially stereo delays – are well suited for making the
u Reverb length – particularly with gated reverbs and snare reverbs – should be sound richer with a slight room effect.
tempo-synched. Snare reverb usually ends on the next full beat. u Delays between 30 and 60 ms produce what is called the doubling effect
u Reverb with very short decay times create discrete spaces. (Beatles sound).
u The longer the decay time, the earlier (with regard to effect level) distance is u Delays between 60 and 100 ms are called slap or slapback echo (Elvis sound).
created by the effect.
154 155
u Stereo delays of up to about 100 ms create the impression of an acoustic overall mix. In rare exceptions, pre-fader aux-sends for effects do make sense.
space. For example, when you want a sound to disappear in the reverb, set the aux-send
to pre-fader and slowly bring the channel fader down. This way, the dry signal
u A delay time above 100 ms is usually referred to as echo and tends to suggest
disappears in a soup of reverb. This effect was popular in the 1970s, but is seldom
distance and space.
used today. Occasionally, using reverb post-fader for quiet, legato strings that are
u The longer the delay time, the more the sound appears to be indirect. meant to be in the back of a large room can be problematic, because the signal
sent to the effect device is too weak, and too little reverb comes back. In this case
u Delay tends to blur sound less than reverb. switch the send to “pre” and reduce the channel fader level until the relationship
u To discretely create space, delay should be very subtly used, so that you miss is correct.
the muted FX channel, but you don't really consciously perceive the delay.
How Many Reverb Devices Do You Need?
u Echo (delay longer than 100 ms) which is not tempo synched is good for creat- Planning Depth Design
ing an effect that is clearly heard as such in the mix.
I know of no logical rule for this, other than that there is no rule. If the tracks al-
So that all of this does not remain dry theory, open a project with vocals and try ready have a good, natural ambience from a recording room with a good sound,
out various settings. and if the final aim is for the production of an intimate sound, it may be possible
in extreme cases to do without reverb altogether. However, if you have accepted
Pre-fader or Post-fader? the challenge of a very complex mix with 80 to 100 tracks with many events that
Why is the reverb effect used as a send effect? There are several reasons for this: it overlap spectrally, using up to 8 reverb devices can make sense. Whatever the
is the only way for you to have the option of sending several sources to one effect scenario, at some point – apart from reaching the limits of computing capacity
device; the only way to edit the effect-return signal separately with panorama, – you loose the ability to differentiate and using more reverb devices does not
EQ, or other effects, as may be required, without affecting the placement of the make any sense.
dry signal in any way. In rare individual cases it may make sense to insert a reverb For a typical pop music setting with a five-piece band, a handful of doublings and
device. To ensure that the send works properly, make sure that the reverb device a bit of ornamentation, I imagine the following scenario: in line with the panning
is set to 100% effect – often referred to as “wet” – and the aux-send to “Post- strategy outline as developed in the section on “Panorama” regarding the place-
fader.” “Post” and “pre” refer to where the signal is tapped on the channel signal ment of individual instruments, the drums are placed at the back of the stage. For
path with regards to the channel volume fader. this, we use our send effect 1 with a subtle little ambience reverb or stage reverb.
Generally, the default setting for aux-sends in DAWs is post – in other words, af- This reverb creates just enough information to bring the dry-recorded drums (or
ter the channel fader. The advantage is that the level of the tapped signal remains carefully programmed and totally dry samples) out into the spatial dimension.
proportionate to any change in the channel fader. If you have already determined In context, you will only be able to identify the reverb as such if you listen very
the right proportion of reverb, this proportion would change if your aux-send carefully. Only by switching it off again will you notice the absence of spatiality.
is pre-fader and you make any changes on channel fader, since the dosage is no To make sure that the bass does not sound like it’s coming from another room,
longer linked to the channel level. The effect amount remains the same while the it is only given enough of this reverb to appear as a unit with the drums. Please
dry signal changes. remember the advice given at the beginning of this book: use the channel send
paths for controlling the effects in order to adjust individual doses more accu-
The “pre” setting makes sense with headphone paths, so that the engineer can rately and use the group track/solo function for dry solo listening. I like to give
create independent studio mixes without affecting the headphone signal in the the bass drum a little less ambience than the other tracks to keep from having too
156 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 157
much ambience from the bass drum skin sound (upper mid-range). Otherwise, sized reverb space. Combinations of reverb in delay or delay in reverb can also be
the bass drum can sound fluttery. If the mix does not sound cohesive, then it may used to this end (see below). Or combining a discrete early reflection reverb with
help to give the other important instruments some of this basic ambient reverb. a delay sent to a spacious room reverb can make the vocals sound bigger. If you
This creates a feeling of unity. find good settings for this kind of work, I recommend creating your own effects
channel presets, so that you have quick access to your reverb setups.
Depending on style and genre, an additional, more obvious reverb is often used
for the snares, giving them more “space.” Be careful, though, because you might In the following scenario, 4 to 5 reverb devices are all that you need, provided
find yourself on a trip back to the 1980s. For ballads, we can also apply the send- that for the large reverb space, you just need one reverb device with a stereo Left-
2-reverb to the toms, making it more impressive. If necessary, reverb 2 can be Right return. It goes almost without saying that send effects 1 and 2 have no, or
gated by inserting a noise gate into the reverb return path and using the gate to only a very short pre-delay. Send effect 1 can have a slight treble roll-off to define
reduce the decay length. This gives you the option of using a full, long reverb that the sound’s position, whereas the snare reverb can be without EQ (trebles not
is artificially shortened. This trick also originates in the 1980s and must be ap- filtered) in order to place the snare at the front.
plied with caution and sensitivity.
Here is a quick summary for the setup used in this example:
The gate – or decay – time should be synched to the song tempo.
Send effect Decay Varieties of reverb/FX Area of application Pre-delay Treble roll-off
If the snare you want reverb on is within a loop or is recorded live (un-gated) with
crosstalk from other drum tracks, then you should not use reverb on the whole Send FX 1: <600ms small reverb/ ambience drums and some bass 0 to little slight
track. To avoid sinking the track into a muddy reverb soup, copy the track and
specifically gate the signal to send a short, punchy impulse to the bigger send-2- Send FX 2: 1/4note medium–large reverb space snare 0 no
reverb. If, after creating consistent panorama distribution, you want a modern, (gate)
relatively dry sound style, you can pan the guitars and keyboards without reverb
Send FX 3: >1200ms big room background events up to 60 ms strong
right and left. This way, they sound like they are very much in the front. A delay
(chorus, strings)
is sometimes more helpful for a “mud-free” and differentiated creation of space.
The sounds that are treated this way will stay up front. If backing vocals, percus- Send FX 4: 600-1200 ambience lead vocals 0 or 1/8th note no
sion, or other instruments are placed consciously toward the rear, we will need a
large reverb with some pre- delay and filtered trebles. Reverb can be generously Send FX 5: depends delay or reverb/delay lead vocals (if nec.)
on style combination
applied here so that the spatial information is perceived by our ears, since the
masking effect tends to minimize the perceived reverb amount. Try using a ste- Send FX 6: for ex: (decent) tempo-synched delay guitar & keyboard (if nec.) L10/R20
reo expander in the reverb return. If the result is unconvincing or not mono com-
patible, try two identical reverb presets from two different devices, panned left Send FX 7: delay (strong) solo
and right. Both reverb devices receive opposite send signals so that the cabassa in
the left of the panorama is reverbed to the right and vice versa. Send FX 8: chorus
The best comes last - reverb for the lead vocals! Especially in mixes that tend to be
dry, vocals require a particularly high-quality device to prevent them from being
pulled into a cloud of reverberation. Sometimes all you need is a subtle, small,
and unobtrusive reverb with attributes similar to a drum booth. Often combin-
ing with an additional delay works well, which might blur less than a medium-
158 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 159
If you decide to go for the option of a left/right split reverb return for the large well known for this very purpose. Unfortunately it is no longer produced, and I
reverb, you need an additional device. I cannot give you any detailed information am not yet aware if it is available as a virtual plug-in. However, this is not a big
regarding delay parameter settings – this depends too much on the individual problem for computer-internal mixing because the effect is easy to reproduce by
song and style. Keep in mind that instruments placed laterally in the panorama exporting the delay as a single track (mono or stereo) and then – using the “im-
will be pulled towards the center if their effect returns have centered pan set- port into audio track” function – linked back into the mix as an audio track. Then
tings. This phenomenon increases with effect amount and is particularly notice- it is very easy to edit the desired effects manually using cut, fade, mute, and level
able with modulation effects. You can counteract this with extreme panning on tools. Working with computer keyboard shortcuts is conducive to speedy work.
the dry tracks. If this is not possible, use a second panorama-weighted effect, or
While delay also creates a spatial impression, it is not suited for accurately creating
insert the effect into the track instead of using it as a send-effect.
specific stage depth for different sound events. This is why delay should only be
Since music styles and tastes are both individual and varied, please use the ex- used to as a stylistic aid. In some British styles, delay plays a larger role in layering.
ample here as an inspiration for employing your own selected effects, which Dub delays, for example, as used in dub and reggae, give the im-
should serve to support your layering strategy in accordance with your outline pression of a certain distance. The device customarily used for
(see “Dimension 1”). Make notes on the applied effects – like “Send FX4 – small this is the Roland RE 201 Space Echo. The combination of tape
ambience; only for lead vocals” – on a separate sheet of paper. A number of DAW echo with spring reverb gives this effect its unique character.
programs let you make notes on each track, which might help you to avoid hav- Every good dub band has an extra member just to operate this
ing too many paper notes lying around. device. Because for technical reasons, this tape-based device is
quite stingy in the higher frequency domain, it communicates The Roland RE 201 Space Echo
The time and effort of using left/right-split effects is only justified if the reverb
distance and spatial depth when heavily used. An authentic reproduction of the
part becomes too large to convey all the spatial information as a result of the
Space Echo is available as an optional plug-in for the UAD-1 card.
masking effect. The more complex a mix, the more time and effort is often re-
quired for placing all events accurately within the three dimensions.
Delay in Reverb or Reverb in Delay
Delay Instead of Reverb Ever since it has been possible to make complex effect cascades with most DAWs,
a whole field of possibilities has been opened which has always been part of the
A delay tail has the advantage that it can make lead vocals sound fuller, giving them
standard repertoire of traditional mixing consoles. The effects return can be rout-
more volume without endangering their frontal placement. The more obvious the
ed back to a full channel and, using aux-sends, be routed to any other effect. In
delay appears in the mix, the more attention you need to pay to the stereo width of
a virtual mixing console, this is not quite as straightforward, because the host
the effect in order to ensure that the lead vocals retain their center position.
application would have to constantly maintain correct latency time correction
In conjunction with delays, there are two interesting variants of ducking, or sup- throughout the signal path; so far, this has not yet been achieved on such a com-
pression. Either the tail only appears when the lead vocals are silent – in other plex level.
words, there is no input signal over a certain period – or the tail is suppressed as
In order to use effect cascades in spite of this limitation, some host applications
soon as there is no signal. The first option is useful if the delay blurs pronuncia-
make it possible to cascade effects – one after the other – on a single channel. In
tion clarity too much or if it is stylistically undesirable,
Nuendo or Cubase, so-called effect channels can be opened which let you plug
but does however create a good filler effect during
the main effect into the first insert slot. Effect channels are full-fledged channel
pauses. The second option is useful for fattening the
strips, which, for reasons stated above, have no send-target choices available in
vocals without being too obvious in the vocal pauses.
the aux-sends, but serve as effect-return channels. The seven remaining insert
The T.C. Electronic TC2290 Dynamic Delay On the hardware side, the TC 2290-Dynamic Delay is
slots in this channel can be used for wide-open sound design possibilities.
160 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 161
I recommend trying different variations for yourself and to develop your own been copied and the snare manually shortened, in order to create a short send
presets for various set-up uses. This is very easy to do by using the option to save signal for the somewhat larger snare reverb. Here you can experiment with a
complete channel settings. If your DAW does not allow effect cascades, then ex- thick gated reverb.
port and re-import the effect return as an audio file. Effects can then be inserted
Send FX 3 is a normal tempo-synched delay, providing some movement on
or the signal can be sent to any desired effect. The classic combination is a rever-
tracks 11 (clave), 12 (triangle), and 15 (Lounge Lizard arpeggio) and places both
bed delay or a delayed reverb. Other combinations with chorus, flanger, compres-
events slightly toward the back as a result of the space that has been created.
sor, gate, and stereo expander effects are also possible.
Send FX 4 gives the triangle and the Lounge Lizard e-piano on track 14 a discrete
SPATIAL EXERCISE spacey sound with a filter-modulated OhmBoyz delay.
On the accompanying DVD you will find a special “Layering Exercise” as a prac- Send FX 5 finally addresses the issue of reverb layering. This send gives the per-
tical application of the theoretical information on stage depth we have been dis- cussion tracks a medium, thick room from the Waves IR1 convolution reverb
cussing. plug-in. The return is processed with a little stereo basis widening in order to
counteract the masking effect and to consciously place the percussion behind
This exercise is provided as a .cpr file for Cubase, an .npr for Nuendo, and OMF the drums. I have put a little pre-delay on the reverb and slightly attenuated the
for all other programs. You will find it in the “Exercises” folder. trebles. The bongos are consciously far to the left and the conga fills far to the
In the .cpr and .npr files, the effects used have been left in place. If you get a mes- right in order to remain unmasked by the other sounds.
sage saying that one or another plug-in is not available on your system, save the Send FX 6 gives the chorus a warm room with the UAD-1 Plate 140 Reverb and
message in a text file and replace the plug-ins with preferred plug-ins available on the Send FX 7 provides the lead vocals with the impulse-response based Vocal
your system. Chamber from the Wizoo W2 Reverb.
The aim of the exercise is to demonstrate how various kinds of reverb provide The tracks “Mixdown wet” and “Mixdown dry” have been created for reasons of
different results in the mix. Try different devices and compare the results. Make comparison. Naturally, the amount of reverb – like so much in the mix – is a ques-
sure that the reverb devices and presets you use deliver the desired spatial in- tion of personal taste; other results are not necessarily better or worse. Have fun
formation in the context of an entire mix. setting this up!
The Send FX 1 is being used for a medium-sized drum chamber. The goal here If you are using OMF: Tracks 6, 13, and 17 are group tracks in the Nuendo and
is to create an unobtrusive space for the drums. You may want to use an SIR2 Cubase projects and therefore have no track numbering in your OMF project.
convolution reverb or another convolution reverb device, such as Altiverb, for ex-
ample. The sends of channels 1 through 5 (overhead, bass drum 2, loop, fretless,
and snare) go to a medium-sized room. Use the solo button on the folder track to
hear “wet” drums and the group track solo button to hear them ‘’dry.’’ This works
because the send level is taken from the audio tracks and not the drum group
track.
(Bass drum 2 on track 2 is slightly gated in order to make it crispier and the loop
track 3 has a little bass reinforcement at 100 Hz. The drum and percussion group
tracks have a little compression).
Send FX 2 is only for reverb on the already shortened snare track 5. The loop has
162 CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY CHAPTER 6: DIMENSION 3: LAYERING WITH REVERB AND DELAY 163
CHAPTER 7:
ADDITIONAL ARTISTIC ASPECTS:
DESIGN AND THE ACCESSORIES
MUTE AND SPECIAL FX
MUTE
S
OME PRODUCERS TEND TO RECORD more in a recording session
than can be intelligently used in the mix. At the beginning of a mixing
session, it is not uncommon for many instrument tracks to be present
from start to finish. The mixing process can then be used to make a selection for a
dynamic and interesting song structure. Few things are more boring than a song
that starts with full instrumentation from start to finish. The mute switch is one of
the most effective tools for accomplishing a good, dramatic song structure.
A division of labor between recording engineer and mixing engineer ensures
the required dose of impartiality needed to experiment enthusiastically with
muting some tracks in different parts of the song. This also opens new design
options for the creation of entirely new material, such as vocal or groove parts
(Sound Example #01). Interrupting loops, shaker figures, etc. can improve the Sound Example
song. For genres like techno and hip-hop, these methods are even part of the style #01
techniques. Numerous variations are first created on top of a loop and are then
arranged into the mix or simply eliminated. This method involves the creative
aspects of arranging deep in the mix process. Fear of the mute button can quickly
result in overloaded and unclear mixes. You must emotionally remove yourself
from the painstakingly recorded and edited tracks in order to work towards the
best possible result, and not be influenced by egos which might be affected by
using the mute switch.
In the mixing process, the mix engineer and producer digs into the large quantity
of tracks, which are in individual parts or individual positions unmuted. Back in
the days of analog mixing, the engineer or producer danced back and forth in
164 CHAPTER 7: ADDITIONAL ARTISTIC ASPECTS: DESIGN AND THE ACCESSORIES MUTE AND SPECIAL FX 165
front of the mixing console, hammering mute buttons, and pulling faders up just therefore not clearly perceptible. Why should an event be allowed to create mud-
at the right moment. Today we have it easier even though the athletic aspects of diness and not be readily recognizable on your studio monitors, even though you
the activity are unfortunately replaced by the computer mouse. Regardless of the know that the event actually exists? Every time you mute an event that does not
DAW we are using, we have three possibilities: have its place in the mix, your overall sound will become clearer.
u Clip-based muting of particular events If you want to keep the individual events, look for other places in the song where
u Mute switch (in automation mode) there is enough room for them. That is how your mix becomes transparent and
interesting!
u Fader automation
If the song does not have an end, you can sometimes creatively mix interesting
For the sake of a logical workflow, I suggest starting with clip-based muting. In endings into the arrangement. If the producer doesn‘t approve, go for the good old
the section “Automation,” we will explore this method more deeply. I can already fade-out.
tell you one thing; I strongly suggest avoiding automation until the static mix is
finished. The static mix represents the best possible result without resorting to Speaking of fade-out! Please, never automate or process fade-outs into your
automation. Personally, I do not believe in mute automation – this feature comes tracks! Fades are only to be done after dynamic and EQ processing has been dealt
from the days when muting individual events in an arrangement window was not with in the mastering phase! The only exception is arranged fade-outs, which
possible. Avoiding unnecessary automation prevents potential irritation through cannot be done on the finished master. These are fade-outs where individual
accidental deactivation of automation or moving the parts in the arrangement tracks are faded out against other tracks, or where the individual track fade-outs
independently of the automation data. are staggered with respect to one another.
Fader automation should be reserved for detail work, rather than bringing tracks What is the opposite of mute?
in and out. We will discuss this further in the chapter on automation.
Courage! Even though courage is needed for muting, more courage is neces-
The question raised is such; does it make more sense to “turn all tracks on” and sary for the opposite – conscious emphasis. Have the courage to bring individual
then reduce with the mute switch or is it better to develop the mix by gradually events such as effects, percussion instruments, and sound design elements briefly
unmuting tracks and events? to the extreme front of your mix. That could turn your song into a hit because
something out of the ordinary grabs the attention of your listeners. The strat-
Both methods are possible! However, you should decide for one or the other, de- egy of first creating a static mix is one thing. But the last 10 percent of the mix
pending on the style and the situation. If you use the first method, then try to find involves the really hard work and at the same time the evolution from a good
a valid place (within the three dimensions) in the mix for every event. Once you rough mix to a release-ready end result. Don‘t be misled by the results of the static
have found a place for all events and mixed them, reduce! You especially reach for mix – maybe you need to play around with some percussion parts, for example;
the mute switch if you have come to a point where it is no longer possible for all there could be three clave beats placed in a pause right before the refrain with a
events to transparently coexist. This way of working is often used with electron- tenfold level increase, giving your song the decisive kick. Or a bongo lick during
ic-oriented styles, while the second approach is better for a pop song. With the a vocals pause right after the bridge. What about the screwy guitar sound in the
latter, start first with the framework – the rhythm group, then the warmth-giv- intro? Couldn’t it be rearranged and placed right up front to make it another song
ing harmony instruments, then the lead vocals, and finally any ornamentation. hook? How many seconds does it take you to recognize a hit that you have heard
Pauses in the vocal tracks can often be filled with less important events like guitar only twice on the radio? Go for the extreme: modify your sound with courageous
riffs or percussion figures. changes in individual spots. Keep your ears open to sound events in the mix that
For transparency in a mix, you should radically mute all events for which you have something special, and bring these sounds out. Act like a stage actor who
have found no clearly-defined place within the three dimensions and which are
166 CHAPTER 7: ADDITIONAL ARTISTIC ASPECTS: DESIGN AND THE ACCESSORIES MUTE AND SPECIAL FX CHAPTER 7: ADDITIONAL ARTISTIC ASPECTS: DESIGN AND THE ACCESSORIES MUTE AND SPECIAL FX 167
exaggerates gestures so that the people in the very last row will get the message. tach them to an empty bottle sitting on a bass box. And so on. Take your time
Consciously listen for such stylistic techniques on your favorite CD. What makes looking for the right intro for a track that you hope will get airplay. Many legend-
a sound engineer successful? Hundreds of hit song mixes demonstrate the skillful ary sounds were created accidentally. Be open to acoustic accidents in the studio
feel that professional engineers have for bringing out that particular thing in the and cultivate them. Maybe you‘ll create a new cult sound!
song, emphasizing its charm and character, and turning a good mix into a hit mix.
Playing with space is certainly part of the special FX bag of tricks. Make some
special events of your mix occur in an other room (where it makes sense with
SPECIAL FX regards to content).
Now that you have exhausted all of the structuring possibilities with muting or
For inspiration, listen to Trevor Horn’s mixes. On the Seal albums you can clearly
event exaggeration, we still have the infinitely deep toy box called “Special FX” to
hear how the vocals room changes for just a measure or a few words. This draws
make our mix more interesting. The possibilities are so limitless, that it is difficult
attention and makes listening to the song interesting, even after the fiftieth time.
to sum them up in words and to logically present them. Use your playfulness and
Two Sound Examples are included (#03 and #04). Sound Examples
inventiveness and process the available audio material in your mix by chaining all
imaginable plug-ins together. #03 & #04
Since creating good stage depth requires good reverb plug-ins, you can export
Sound Example
the reverb return into an audio file to save computing capacity. That way you can
#02 The repertoire includes the extreme use of all known effects, deforming sounds
have two to four real-time reverb plug-ins for the basic layering of your project.
through uncommon chains, reversed sound, using synth filters, vocoders, and
much more (Sound Example #02). The optional and relatively expensive VSS3 Reverb from PowerCore is a real
Using guitar amp simulation such as
NI’s Guitar Rig or Leslie cabinets are
also popular techniques.
Extensive filter effects are possible with
the “Filtroid” plug-in from PowerCore
(illustrated). Along with a reversed au-
dio file, you might create an interesting
song intro. Make a catchy intro with
recognizable elements, preferably from
the refrain. There are no limits to exper-
imentation! The lack of large recording
rooms has also resulted in increasing
use of a classic method; re-amping. This
involves playing a track into an amp
through re-recording in a microphone
booth. Play the track through guitar
Filter plug-ins are amps, bass amps, or portable radios and
suitable for creating re-record the result. Or you can play a track through monitors in an interesting
interesting effects. Here
is the “Filtroid” plug-in acoustic space and experiment with capturing the room acoustics with a bound- The VSS3 has particularly high-quality early reflections. The screenshot shows that the
actual reverb tail (Rev. Level) is off and the original (Dry Level) mixed only with the early
from PowerCore. ary layer microphone. Maybe use a pair of headphones as microphones and at- reflections (Early Level). In this case, the device is used as an insert.
168 CHAPTER 7: ADDITIONAL ARTISTIC ASPECTS: DESIGN AND THE ACCESSORIES MUTE AND SPECIAL FX CHAPTER 7: ADDITIONAL ARTISTIC ASPECTS: DESIGN AND THE ACCESSORIES MUTE AND SPECIAL FX 169
Early reflections find. It is one of the rare reverb devices that boasts excellent qual-
ity for the early reflections, which can be totally separated from the
are very actual reverb tail.
important for Early reflections are very important for representing space and many
representing devices do not do a very good job of simulating them. The VSS3 cre-
ates good, discrete effects that can subtly breathe new life into other-
space! wise boring recordings.
I n case you don’t create a hit on your first try, don’t worry. It is certainly
because of the arrangement. No joke! A well-arranged composition
is easier to mix than a badly-arranged B-Side track. If you are not under
time limits, stand back and try to achieve some perspective. This will give
you a more fresh and unbiased approach to your work.
170 CHAPTER 7: ADDITIONAL ARTISTIC ASPECTS: DESIGN AND THE ACCESSORIES MUTE AND SPECIAL FX 171
CHAPTER 8: AUTOMATION
O
NLY FIFTEEN YEARS AGO a studio owner or engineer would
have felt lucky to have motorless fader automation with just mute
and VCA automation.
Motorized faders were lovingly called flying faders, which – as opposed to VCA
automation – equipped with servos would follow the actual fader position. These
did not have all that much in common with motorized faders of modern digital
mixing consoles or DAW-controllers that serve only to enter and visualize the
electronically processed digital values.
Today – keeping with our theme of internal mixing – DAWs make dynamic au-
tomation of practically every imaginable event possible. However wonderful this
might be, the other side of the coin is not just the lower caloric expenditure of
active mixing engineers. A mix without automation on a console that is several
meters long could be a sporting event that today is reduced to training the index
finger.
A clear work strategy is the prerequisite for taking full advantage of automation.
We will examine automation strategies; automating too early inevitably creates
more work than necessary and obstructs the workflow.
This is why I believe it to be urgent to first create a static mix that reaches a 90%
satisfaction level, especially in terms of panning. As the further development of
the rough mix, the static mix is both a waypoint of departure and a guide for any
automation that is to be added.
If you start with automation before all the basic relationships are adjusted, you
will most likely have to redo most of it. That is only fun with a touch-sensitive
motorized fader, which begins writing data when you touch it and stops when
you let go. But it can be nerve-wracking when you begin with automation before
the arrangement is even finished. If this is the case, make sure that the automa-
A
T THE EXPENSE OF REPEATING certain details from other parts of
the book, I believe it is important to provide you with systematically or-
ganized information and tips for working with individual instruments.
This way, you have a reference text that may be useful if you find yourself in a dead
end with your mix and you need to find some inspiration and solutions to get you
moving again.
DRUMS
Processing live recorded drums is a big challenge. But you can learn and get ideas
from working with programmed drums and loops. On the tutorial DVD Internal
Mixing Vol. II, the techniques I am describing will be developed step-by-step
with the help of practical examples.
Bass Drum
Pan
The bass drum definitely belongs in the center.
EQ
Often there are two tracks concerning the bass drum. One is recorded with a
microphone placed close to the head representing the “skin” sound, and then
another, large diaphragm mic on the resonance head or hole, delivering the lower
frequencies. Whether there are two tracks or just one – in both cases, the bass
drum has the same characteristic core frequencies which have their importance
and need to be treated as such; the lower frequency “bottom” range between 65
and 110 Hz and the skin sound between 3 and 5 kHz, occasionally up to 8 kHz.
For a good bass drum sound you can use a bell filter to support the lower end. Use
182 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 183
For boosting the snare’s overtones, I recommend using a high-resolution high-
Snare
shelf filter or a parametric filter with a broad band Q-factor.
Pan
Compressors and Gates
The snare should also be in the center.
Compressors fill two roles for snares:
EQ
u Top end limiting or compression to decrease dynamic range. Here you need to
Like the bass drum, the snare drum has two characteristic core frequencies. The make sure that the natural transients are not affected by compression, since this
bottom is a little higher than that of the bass drum and is situated in the lower can lead to a tide-drum effect (imagine a sound that a cardboard box makes
mid-range. Snare characteristic core frequencies are between 120 and 260 Hz, when played), sounding like someone took the air out of the snare; it destroys
depending on head size or samples being used. Snares are also often recorded the natural transients by too much dynamic processing.
with both a top and a strainer microphone. The top mic generally picks up the
punch in the lower mid-range and the strainer mic is used for getting the strainer u Bringing out snappiness and percussiveness with longer attack times.
sounds in the upper mid-range to the higher frequencies. In the upper frequency
u Controlling the quiet parts of the signal (strainer) by fast release times (the
spectrum, the strainer can sound broadly noisy up to 20 kHz and beyond. When
faster the release time, the more strainer sound).
using two microphone tracks, it is important to pay attention to phase. If there
is any phase-reversal of the strainer mic during recording, then both tracks can When working with live drums, it is standard to use a gate on the snare track or
cancel each other out in the mid-range, possibly taking all the power out of the to create a copy of the snare track to be gated.
snare sound. Try this out by simply switching on the phase button on the strainer
track while listening to both tracks at the same loudness levels. I find it interesting Doubling Bass Drum and/or Snare Tracks
that with live recordings, snares first start sounding really open and good when I have already mentioned this theme in passing when talking about gates with
the overhead tracks are mixed in. Snares seem to develop their true, full high-fre- bass drums. The trick of doubling (or “multing”) is very often used with tracks
quency sound with a little distance, making a second mic necessary to adequately (bass drums and snares, for example) with microphone crosstalk – particular-
record the sound. ly when there is only one microphone track (head/bottom or top/strainer). By
processing the doubled tracks differently, you can more intensively manipulate
If you are unhappy with your snare sound, there are two areas to look into: the
the signal, without making things sound artificial. If you put a short gate on the
bottom and the overtone spectrum of the strainer. In addition, I recommend
copied track, you can use extreme EQ and compressor settings. Strong EQ set-
using a lowcut filter at 80 Hz and also some attenuation to create space in the
tings on short, impulse-type sounds are not as noticeable as they are on ungated
mid-range. Snare sounds from metal snare drums add an additional interesting
sounds. Signals freed of crosstalk can also be more heavily compressed. Strong
element into play. These frequencies can be boosted or attenuated with a bell filter
compression with quick release time of an ungated track with crosstalk has the
set to a medium Q-factor. If the snare tone has a significant relationship to the key
undesirable effect of boosting the quieter (crosstalk-laden) signal parts.
of your track, then it is easier to integrate such a stylistic element into the song. If
the sound is a half tone lower, for example, it will probably sound terrible. Don’t A track processed in this way is then mixed with the ungated, “natural” sounding
be afraid to pitch-shift the whole track with your DAW to tune it to the song’s track, resulting in more punch.
fundamental key, or to a fifth above the key fundamental. The Red Hot Chili
This technique can also be used to split the instrument in its basic parts (bottom/
Peppers have tuned and integrated the tonal sound of a metal snare into several
skin) and then put it back together. In this case, one track would have the low
of their songs. If the sound is annoying, you can use the track doubling technique
frequencies and the other track the high frequencies.
described below to push the sound into the background.
184 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 185
The hi-hat needs to have a lowcut filter to eliminate any low-frequency air sounds.
Reverb
In order to reduce crosstalk from the snare, you can also set the lowcut to 250 Hz.
The same method can be used when dealing with big reverbs on snares: The cop- This really depends on what kind of hi-hat is used and the style you are dealing
ied and gated snare track drives the reverb with the short, “cut” snare sounds with. A heavy rock hi-hat definitely would need a lower cutoff frequency setting.
which are powerful and clean, and not washed out with the crosstalk signals. The I have never used a noise gate on a hi-hat track; it seems rather pointless to me
open, ungated track is then mixed in with a small amount of ambience reverb. that when in certain parts of a song there is no hi-hat, the artificial switching on
Certain styles demand that the snare is a bit more “meaty” by putting the gated and off of the track would create sonic inconsistency. The hi-hat mic also creates a
track in turn through gated reverb. You can create such a sound by putting a noise kind of super-close ambience track, which contributes to the global sound of the
gate after the reverb device. When both tracks are mixed together, the snare still drums. The more deep frequencies you leave in and the more you pan the hi-hat
sounds natural enough. The gate release time must be synched to the song tempo to the right, be aware of the center balance of the whole drumset because of the
(usually to a quarter note). snare crosstalk on the hi-hat track.
The cymbal track is already a kind of close-ambient room sound. Here, again,
Other Aspects of Drums you’ll need a lowcut filter. A gentle roll-off of 12 or 24 dB per octave is best in
Pan order to avoid unnecessary phasing effects. Use your ears to determine the cutoff
I like to mix drums from the listener’s perspective, so that the hi-hat is – for a frequency. In some situations, it serves the sound aesthetic best when the whole
right-handed drummer – slightly to the right in the panorama. You must be frequency spectrum comes through. In other cases, a cutoff at 400 Hz is appropri-
sure that the cymbals are organized so that the hi-hat is also on the right chan- ate. Keep in mind that overhead cymbal mics have a 6 ms acoustic delay (because
nel. This is where mix-ups can occur; one engineer might think in terms of the of being a distance of 2 meters (6’ 7”)) with regards to the snare and this can result
drummer’s perspective and another engineer might have the listener’s perspec- in a slightly “fluttery” bass drum sound if the cutoff frequency of your lowcut is
tive in mind. Track labeling can lead to misunderstandings. Make the panning set too low. If the drummer forgot cymbal cleaning fluid for the recording session,
of the cymbals as wide as possible. Very wide left-right panning is usually im- you might want to try a gentle-sounding highshelf filter that brings some brilliance
possible for reasons of phase problems. This is the time to watch the phase be- starting at about 12 kHz. In this case, pay attention to the quality of the plug-ins
havior with your correlation meter; make an acoustic check by switching the your are using. Try the UAD-1 Precision, the Helios 69 or the Neve 1073 EQ.
control room mix to mono as well. I also spread the tom-toms from right (rack Simulating Ambience Tracks with
tom) to left (floor tom) in the panorama. To bring out the authenticity factor,
Offline-processed Convolution Reverb
the hi-rack tom should be only slightly right, the mid-rack tom slightly left, and
the floor tom farther left. When more toms are used, you can simply follow the Complex high-end productions get their depth from recording ambience and
natural placement on stage. early reflections in large, expensive, and flexible recording studio rooms onto
separate tracks. That makes it easier for the mixing engineer to mix in a subtle
Gating toms is mostly done manually (scissors tool in your DAW). With a small amount of room ambience and depth, without oversaturating the recording with
amount of fills, this is accomplished rather quickly. If there is a lot of tom playing room ambience from artificial reverb devices. Often, three different stereo ambi-
in the drum part, then a noise gate can save you time. If the toms sound a little ence tracks are recorded – close ambience with small diaphragm mics about a
weak, an extreme (good-sounding) bell filter with a medium-to-strong Q-factor meter (3´3´´) away; a mid-ambience with big diaphragm mics about three to
can provide the necessary punch. In this case, try to find the exact characteristic four meters (10´ to 13´) away; and then boundary layer mics (PZM – Pressure
core frequency of each tom. Depending on the style, you can also mix in a bit of Zone Microphone) up to 8 meters (26´) away from the drums. If you don’t have
the big-room snare reverb to make the toms sound bigger. access to such big, expensive rooms, you can reach into our bag of tricks: As
usual, our drums are coming out of a sampler, a software instrument, or a re-
186 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 187
ally small and dry recording room. Compared to your favorite CD, your drums ing the bass drum sound mushy?
might have a good amount of punch, but they still sound flat. This phenomenon
This is all possible by simply copying the loop a number of times! Use each copy
has nothing to do with the quality of the samples or the recordings, but simply a
of the loop for shaping one particular aspect of the sound. In this way you can
result of a lack of room information in the sound.
use a copy to process the bass drum, another to separate out the snare, and yet
Now create a preliminary stereo mix of the drums where you mix the bass drum another one for the rattling sounds. Shape the desired features with gates and
in much more quietly than you would in the final mix. Depending on what DAW compression, with lowcut, shelving, and bell filters, and with appropriate use of
you are using and your general workflow, there are various methods that all have reverb and delay.
the same objective – to create one or more artificially created ambience tracks hav-
(Sound Example #08 and Exercise “Loop Editing”)
ing a large amount of room ambience, which can be mixed into the drum tracks.
One possibility is exporting and re-importing the above-mentioned drum mixes Sound Example
along with offline-rendered reverb. For this you can use the internal convolution BASS #08
reverb in Nuendo or any good convolution reverb in offline-processing mode. If I have already mentioned the most important thing for a good bass sound: The
you do not have this possibility, you can switch the drum group to solo and insert bass needs room! Every unnecessary low-frequency event robs the bass of the
the reverb into the group track. Then you can export the reverbed drum mix room it needs for clarity. Once again, use as many lowcut filters as you have in-
and use it as an ambience track to mix in. By choosing the right impulse file and dividual tracks! If you are recording your own bass guitar tracks, use fresh, thick
amount, you can obtain a stereo ambience track which is comparable to a stereo strings, with a high string action (unscrew the truss rod ½ to 1 turn), so that
ambience track made from two boundary layer mics placed far enough away the strings can vibrate freely with less noise. New strings ensure that the strings
from a drum kit in a good-sounding room. The bass drum can remain latently start vibrating more quickly, helping intonation and presence. A good front-end
present in this ambient track, so that it is part of the entire sound without creat- should also be used – at least a very good DI box or tube DI box (for example
ing a flutter-effect. With high bass drum levels, the skin sound can tend toward BSS). If you have a UAD-1, the 1176 and/or the LA-2A in the bass channel is a
flutter, and sounds old-fashioned and “wet.” The result of this method is simi- must. Here you can adjust the crispiness and therefore the localizability of the
lar to using a good reverb device in real time. It saves you computer power and note beginnings by playing with the attack times.
simulates a realistic situation, depending on the reverb used. The loudness of the
ambience track or of the ambience reverb should be adjusted so that it is neither In addition, you can control the balance between the “heavy” sounding notes and
obvious nor blatant, but is immediately missed if muted. Successfully layering the stops, damping sounds, and dead notes with the compression amount. If the
drums in discrete room ambience makes it clear what is meant by flat-sounding bass is played well, the more quiet dead notes can be brought out with the com-
drums. pressor. If it is badly played, or if dead notes disappear completely from the mix
from time to time, you can manually cut out and mute the annoying elements.
Refining Loops with Doubling, Gating, In addition to the timing of the note beginnings, note duration is also particularly
Compression, and Reverb important regarding interaction with the bass drum. This is achieved through the
Not every loop that sounds good on a sample CD is also good for using in a mix. player’s ability to skillfully dampen the notes. On the strong beats, particularly
Often loops sound “too good” and dominate the whole frequency spectrum in with the snare (usually on 2 and 4 of a 4/4 measure), the sounds in a bass line
a way that the mix would be perfect with the loop, if only there weren’t other should be ended or stopped, so that the line grooves. If necessary, use the scis-
instruments involved! sors tool in your DAW along with a short fade-out and muting to create room.
Remember our supreme aim of only including events in a mix that have a well-
How do you give the clap on a loop some reverb, without putting reverb on the defined place!
hi-hat and cabassa? How do you make the snare sound punchier, without mak-
188 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 189
Compression
The compressor is a versatile dynamic shaping tool for the bass guitar as well, and
can be used in the following ways:
u Dynamic limiting of sudden level peaks and irregular playing.
u Creation of sustain with long notes (particularly in songs with slow tempi).
With a long release time – synched to song tempo – the level reduced by the
A bass EQ curve might look like this. Above the lowcut filter the bottom is boosted and the thumpy mid-range is compressor amplifier slowly comes back up until the next note creates another
attenuated. Around 1 kHz, “comprehensibility” is accentuated. reduction of level. This way, nice, long tones can be drawn from a lower-qual-
ity instrument.
Pan
u Boosting quieter side noises (funky or Jaco style) with fast release times (this
Because of the low frequency content of the bass, it always belongs in the center. excludes creating sustain as described above).
With stereo tracks from synthesizers or software instruments, watch for a good
balance because an unevenly panned bass leads to a left-right imbalance resulting u Supporting rhythmic localization and percussiveness with long attack times.
from the high energy contained in the bass sound. Use a goniometer or the peak
meter in the master bus while soloing the stereo bass track.
Reverb
With normal pop music, you should give the bass – at the most – a subtle and
EQ barely audible, but just perceivable, room ambience from the drum kit. This em-
A bass can only handle a lowcut filter with a steep slope – 36 dB per octave is phasizes the unity between drums and bass.
often not enough, with the result of dampening the useful frequencies starting at
Especially fretless basses can do well with a subtle bit of room reverb. Other bass
about 30 Hz. Some basses that don’t have enough bottom end can benefit from
reverb techniques fall under the category of stylistic particularities.
a boost between 40 and 80 Hz with a medium Q-factor. You can dampen the
lower, thumpy mid-range between 120 and 350 Hz if the bass tends to take space Multiband Compression for Bass
away from other instruments. In the area between 800 Hz and 1.2 kHz you can
In particular cases, a multiband compressor with expander function can be used
adjust the nasal, woody part of the sound. This also helps with tonal localization.
to bring out specific spectral regions of an instrument. If a standard compressor
Above that, there are just a few very quiet overtones and mostly dampening and
is unable to boost the dead notes enough that they can be heard in context, then I
Sound Example other noises. In Sound Example #09 you can hear a soloed bass with and with-
like to use the multiband compressor from Steinberg/Spectral Design by copying
#09 out EQ, then in the mix with and without EQ. The Oxford EQ was used for this
the bass track and running the copy through the multiband compressor which is
example.
set for bringing out the frequency bands in question. In the Steinberg compres-
sor, you can keep the solo switch on for certain bands, so that only the critical
frequencies come through, which are then mixed to the “full range” channel.
This emergency tool can also be used with other instruments to undertake even
more extreme “first aid” operations.
190 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 191
Doubling the Bass Track Pan
Since the bass also consists of two components (bottom and top), it can be help- Individual lead vocal tracks are usually placed in the center in pop music produc-
ful in some cases to split it into two tracks in order to work on the bottom with tions. In parts of songs that are meant to deliver a feeling of intimacy, single lead
one track (using, for example, the Pultec-EQ on the UAD-1) and shape the tonal vocal tracks are always better than doublings (different takes). If a single track
comprehensibility of the other track. Interesting results can be obtained by dou- is not enough to create the presence needed, you can also duplicate the track
bling tracks in this way, sometimes making it possible to save a mediocre bass and decorrelate the two – in other words make them dissimilar – with various
recording. compressors and EQs, then slightly split them between left and right. Be careful
to maintain the phantom middle placement – the impression that the lead vocal
Bass and Chorus track comes from the center. This strategy gives the voice more punch and pres-
Doubling the bass track is a great way to add chorus, flanger, or other effects. ence. To create intimacy you can extremely compress one track (for example with
Leave the bottom track without effects and put a roll-off with a soft 12 or 24 dB/ a Sonnox Inflator), make more presence with boosting high frequencies and then
Octave highcut filter at around 180 Hz, for example. Then do the opposite with mix with the main track. For pop stars who have thin voices, the critical mass is
the copied track by dropping on a lowcut with a steep slope and then insert the often exceeded with vocals doubled 40 to 60 times, and then putting AutoTune
modulation effect. This way you maintain a clean mono bass without phasing on each track along with a good reverb plug-in. After quite a bit of editing you
problems along with a full-sounding, modulating stereo effect. can also get such results and generate the impression of one full voice. Maybe you
know the phenomenon where a large number of single voices sound at one point
VOCALS like one voice. That can even happen with amateur choirs, when enough voices
come together.
The better vocals are recorded, the easier it is for you to place them in the mix.
This can be compared to digital photos: If you take a picture of something with In pop music, refrains are very often doubled. Four doublings per singer and
a 2 megapixel camera for less than $100, you might have a nice little photo. But voice is generally the average. With a lead singer and three backup voices you can
if you want to make a poster out of the picture, then you quickly see the limits of easily end up with 16 voice tracks. To organize sonic separation, I recommend
the image quality – you can easily see the individual pixels and the lack of depth. routing the backup vocals to a separate group. The doubled lead vocals should be
Pixels to the eyes are bits to the ears. A recording, which does not have a high panned in a way to keep the phantom middle position, with the backup vocals
resolution, cannot be naturally placed in a three-dimensional room. With a high imbedding the lead in the panorama. The backups should be panned so that the
resolution, lead vocals come right out of the loudspeakers. higher voices are more spread and the lower ones are more towards the mid-
dle. Mix the group first together, before putting them into the whole mix. In the
Even if you do not own a first-class front end chain (microphone, cable, pre- group track, you can insert some stereo expansion in order to make sure that the
amp, cable, converters), then I strongly recommend that you borrow or rent such backgrounds and lead have enough separation in specific passages by increasing
equipment for the recording session so that you can compare the results to those the width of the backups.
of lower-quality equipment. This way you’ll get a feeling for the reserves that are
hidden in good technology and the limits and possibilities which good engineer-
ing can provide. Good performance together with excellent recording equipment
makes good engineering easier.
192 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 193
EQ
All vocal tracks need a lowcut filter – 80 Hz is enough for the lead vocals and
I like to give background vocals a softer 12 or 24 dB per octave filter, setting it
higher (up to 400 Hz) depending on the style and situation.
Vocal recordings often suffer from resonance in the bass and lower mid-range
that arise from suboptimal room acoustics, which result in room modes and
comb filter effects. Since recording situations often only eliminate disturbances
in the foreground affecting the mid and high frequency ranges, the recording
can sound thumpy. The sound could also be described as being “closed” instead
of open. In these cases, the resonance frequencies need to be scanned between
120 and 350 Hz with a very extreme and steep-sloped boost in order to find
them then reduced with a notch or bell filter (with high Q). The result is singing
Here you can see extremely pronounced sibilants resulting from a gap between the upper two front teeth. They are
that sounds more free. This mid-range band must not sound too thin! Here is manually reduced in the vocal track. The words are: “She was so strong...”
where warmth is located, especially from male voices.
In the mid-range region up to 2 kHz for men and 3 kHz for women you should De-essing
boost with a wide (low Q) EQ for speech comprehensibility. A soft boost in A de-esser can be used to handle a moderate amount of unwanted “s” sibilant
this range is standard, since putting the vocals in the phantom middle creates a sounds. There is a wide range of high-quality native and DSP de-essers, from
comb filter effect through the dampening of the human head, which weakens Waves, Steinberg/SPL, UAD-1, and TC. A de-esser is a kind of higher frequency
these frequencies. If the signal coming out of a center loudspeaker sounds like expander or compressor, which reduces sibilants with a bandpass filter on the up-
it’s coming from the front, then the signal comes into the left and right ear in per mid-range level peaks without making the rest of the frequency range sound
an equal and unified manner. If, during simultaneous playback from the left dull through dynamic reduction. That is the objective of de-essing and it is pos-
and right channel, there are time differences between the two channels; then sible without a pumping sound, as long as the sibilants are not too much in the
the signal coming from the one speaker arrives in the opposite ear with a slight foreground and therefore must be reduced too strongly.
delay and is dampened, and together with the other speaker creates a comb
Another kind of de-esser is a dynamic EQ. A native version is the Sonalksis DQ1,
filter effect.
for PowerCore there is the excellent TC Dynamic EQ. With dynamic EQs, a causal
Frequencies between 6 and 8 kHz are sensitive. Here are the “s” and other sibilant relationship with dynamic events is established. In other words, the frequency band
sounds that can be found up to 12 kHz. Boosting should always be subtle and in question is processed with a bell filter when a certain threshold value is exceeded.
broad-banded, while paying careful attention to the sibilant sounds. Sometimes Threshold recognition can be set up independently of the band that is to be pro-
combining with a de-esser can help. You can create a certain openness by gently cessed, via side chain (detector input). This opens up a huge field of possibilities.
increasing the highs from 10 to 12 kHz and beyond. For this, a shelving filter is
Manual de-essing is the most effective and least damaging for the sound. In this
often used. For processing high frequencies, I highly recommend using EQs that
case, enlarge the waveform representation of the soloed track to be processed and
are gentle and internally operate on the basis of oversampling. The Pultec and the
cut every sibilant in order to manually reduce levels. This might sound like never-
Precision-EQ both have oversampling engines and sound very silky (UAD-1).
ending editing drudgery, but it actually can be done quickly if you consistently work
Sound Example Sound Example #11 is a successful case of vocal sound first aid. You can test your with keyboard shortcuts. Sibilants are optically recognizable by having an extreme
#11 EQs with this example. degree of sound density, while being relatively weak in level.
194 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 195
Compression Making Different-sounding Takes Similar
You should reserve the best plug-ins for lead vocals. Here you can also use the Sometimes vocal takes are recorded on different days. Along with the mental
LA-2A or the 1176. The best copies are the “originals” on the UAD-1 platform, mood and the physiological condition of the voice, the slightest variation in mic
but there are numerous imitations of these legendary devices as native plug-ins placement or the singer’s position with respect to the mic can have a major influ-
as well. The 1176 is THE vocal compressor! Only with this device is it possible to ence on the vocal sound, so that attempts to improve things later sound quite
get the Bob Clearmoutain vocal sound (http://www.mixthis.com). When talking different. It is easiest to make takes sound similar on the spectral level. If a para-
about this, often the term “bite” is used, which can be controlled by using a long metric EQ and good hearing don’t lead to the desired results, then so-called intel-
attack time. This way, the beginning impulse of the vocals comes through un- ligent EQs may help. Intelligent EQs examine both the file to be processed and
Sound Example processed before the compressor kicks in. This is especially audible with plosive the target file in order to make a suggestion for processing. The TC Assimilator is
#10 consonants (Sound Example #10). such a device on the PowerCore platform. It allows you to morph between 100%
original and 100% processed. If the files are carefully read in, you can get good
For lead vocals you can also use the 1176 compressor with high ratios at 12:1.
results. However, fine sound variations stemming from day-dependent timbre
Even when the compression sounds extreme in solo mode, it sits into the mix
changes as well as differences in room acoustics resulting from different record-
well and “sticks to the front.”
ing positions cannot be corrected with such a device.
Reverb and Delay If spectral alignment is not enough, then you can experiment with plug-ins
Most of the time, the lead vocals are placed way in front of the mix. Using a large that focus on vocal manipulation, such as the TC Voice Modeler or Celemony’s
room reverb is a common mixing mistake because then the lead vocals sound Melodyne. There is also a number of voice processing plug-ins on the native
less direct, and it seems as if the singer has taken a few steps back! level.
One of the many possibilities is to use large amounts of small room reverb. These Processing Breathers
rooms should be subtle and unnoticeable, so that the listener is not consciously
The amount of “breathers” in a track is a question of style that I cannot answer
aware of them. Often these small rooms are combined with bigger rooms and de-
here. But breathers that are simply cut off are not a question of style, but a ques-
lay. A well-employed delay can help make the vocals sound fuller without push-
tion of a lack of skill! Decide for every new production how you want to deal with
ing them into the background.
the issue: Should breathers make up a major part of the vocal sound or should
In Chapter 6 (Dimension 3 – Layering with Reverb and Delay) I have already talk- they be unobtrusive? Carefully controlling this in “big productions” is essential.
ed about this in more detail. When doubling tracks, I recommend avoiding breather doubling. Edit out the
breathers from the copied tracks (make sure that the breathers are placed in the
center). If some breathers are unattractive or inappropriate, then replace them
with other breathers. With background vocals I like to split two or four synchro-
nous breathers left and right in the panorama and remove all other breathers
from the vocal tracks.
196 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 197
Editing Sloppy Syllable Endings in Vocals ACOUSTIC GUITAR
I recommend that you devote as much attention to syllable endings like “t” and Because all other instruments are handled very individually according to genre,
all other rhythmically significant consonants just as you would breathers. If “t” taste, sound aesthetic, and arrangement, it is difficult to make generalizations.
syllable endings chatter in the background vocals or if there is no consistency Therefore I will mention the essential aspects that could be useful for your work.
with the lead vocals, then it sometimes helps to simply cut them off and mute. To
avoid artifacts, put short fade-outs on the edits. This way there will be a rounder EQ
sound and the background vocals frame the singing better. Often I completely Acoustic guitars have a very wide spectral range and often take up too much
fade-out the end consonants of individual vocals, so that the soloed vocals sound space in the mix if they are not processed in some way. Use a lowcut filter to
somewhat weird. In the context of the whole mix, however, such edits are not eliminate unwanted low frequency noises. If an acoustic guitar is placed pre-
perceptible – to the contrary, they create greater transparence. dominantly in the foreground, a lowcut at 80 Hz is sufficient, so that there is a
good deal of warmth remaining in the lower mid-range. If, on the other hand,
Particularities of Working with Backing Vocals
you wish to have only the soft and velvety strumming (with a soft plectrum)
If you do not have enough tracks available to simulate a large, full choir, you can sound as a rhythmic accent, then a lowcut at 400 Hz can be just right, as long
copy a few tracks and process them with AutoTune or a Voice Modeler, so that as the lower mid-range is already occupied by other events. This way, you avoid
the only thing that changes is the color of the copied tracks in order to bring more overlapping and lack of transparency of this sensitive frequency band. In this
fullness. second case, I recommend panning far from the center. In cases where there is
In R&B productions 60 to 90 individually sung vocal tracks are not uncommon. an acoustic guitar alone in the intro or with singing that later comes in with the
Recently, the trend has been to have up to 300 vocal tracks. whole band, I automate the frequency parameter of the lowcut EQ so that the
cut off frequency shifts upward as soon as the other instruments start playing in
Proximity Effect order to make room for them.
The proximity effect refers to the physical characteristic of microphones to sound The rich overtone spectrum of acoustic guitars can often be brought out with
very bass-laden when they are very close to the source. The distance between a high-quality EQ. In this case, a shelving filter starting at 8 kHz is appropri-
mouth and microphone is extremely small. Voices recorded with proximity ef- ate. Remember that a large amount of high frequency content psychoacoustically
fect can give the impression of “crawling out of the speakers.” It is very useful to suggests closeness!
record a very dry signal in order to have the possibility to place the sound in the
front. Because of the close proximity of the sound source to the microphone dia- Compression
phragm, there is a large level difference between the first signal and the room re- Use the Compressor Exercise #07 to get familiar with the many uses of compres-
sponse. The room sounds are therefore much less significant in recordings where sors. Since acoustic guitars are very dynamic, a top quality compressor can help
the distance is 10 to 30 cm (4’ to 12”) from a large diaphragm mic. Because of narrow down the dynamic range. With fast release times you can bring out the
the very small amount of room sound, vocal tracks that are very close-mic’d are various playing noises, so long as they have musical significance. Lengthening at-
easily placed in the mix and therefore can be compensated for bad recording tack time makes the guitar more percussive. Long release times even out the level
room acoustics. Using this recording method for lead vocals can give a feeling of of loud sounds that naturally have little sustain and lose their loudness quickly
intimacy and bring out the vocals in front of the mix. (Exercises 01 and 02). Uncompressed guitars are difficult to fit into complex mix-
es. The typical level differences created while playing are often too big. The sonic
luxury of natural, uncompressed guitars is only for puristic mixes with small in-
strumentation and skillful performance.
198 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 199
100% with each other. When looking for the right sound, keep the effects off so
Reverb, Delay, and Chorus
that when you do find the right sound, the effects are the icing on the cake.
Use reverb and delay for all types of layering. Chorus is often used for acous-
tic guitars. The detuning effect of chorus gives 6-string acoustic guitars a more If you value good electric guitar sounds, I recommend seriously studying amp
“floating” sound, similar to 12-string guitars. micing techniques. Sometimes 2 to 3 cm (1´´ to 2´´) makes the all the difference
in the world with guitar sounds. The effort is certainly worth it, providing results
Other that still leave amp modeling in the dust.
If you have any influence on the recording process, make sure that fresh strings
Guitar sound freaks might want to look into the SPL Transducer. This analog
are used. Also, verify the combination of microphone and pickup recording
speaker and microphone simulation gives a powerful, latency-free guitar sound.
methods. Using only pickups does not sound as natural as micing. The ceram-
ic piezoelectric pickups often built into acoustic guitars sound a bit artificial The only rule I can give as far as EQs are concerned is that using lowcut filters is a
and hard. Spend time playing around with the microphone position until you must! Guitar sounds should be recorded as close as possible to the “target sound.”
have found the optimal placement. Sometimes just 5 cm (2´´) here or there can EQs allow small corrections, but won’t work wonders. With increasing amounts
make a huge difference! If the acoustic guitar is playing a major role in the ar- of distortion, compressors become less necessary. Distortion leads to a natural
rangement, it is worth looking into M/S micing. You’ll find more information kind of compression. On clean guitar sounds, use compressors as you would on
in technical literature on mics and microphone techniques. acoustic guitars. If your sound isn’t right, experiment with all amp modeling pos-
sibilities, including for example the Leslie from the B4II of Native Instruments.
ELECTRIC GUITAR Such experimenting can easily lead to interesting results.
It is very difficult to make blanket statements because of the variety of sonic pos-
sibilities of the electric guitar, even though I am myself a guitar player. Modern
KEYBOARDS
modeling techniques (POD and others) have made it relatively easy to get good Dealing with keyboards in a mix is similar to electric guitars because of the in-
sounds down “on tape.” This way of working usually leads to recording stereo guitar finite variety of available sounds: watch out with stereo tracks so that you place
tracks, which in turn lead to inconsistent panning in the mix. Do not be tempted keyboards as far from the center as possible. Strictly follow your mix strategy
to place guitar tracks in the center just because they are stereo, so that the full ste- with panning. Always use lowcut filters – even the best samples can have DC
reo effect can be heard. A good, solid mono track is usually a better basis for a good Offset and sub-bass artifacts.
sound; a stereo guitar that doesn’t hold up under mono because the stereo effects
are no longer there to support it is questionable. I often find it more interesting to
work with the combination of a single dry guitar track recorded simply through
a DI box along with a mono guitar track with the amp sound – whether recorded
with physical amps and mics or modeled. This way, you have an amp track without
effects giving you sound information as well as a dry track, which gives you the
possibility of refining this sound conception with other amp simulations, or real
amps that can either augment or replace (re-amping) the amp track. By recording
in this way, you can at least rectify possible recording mistakes. I believe that the
many possibilities of the numerous amp simulations are only really useful when
you have intensively worked with – ideally – the original amp models. Often using
presets leads to mediocre results because the preset and guitar don’t always match
200 CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS CHAPTER 9: WORKING WITH INDIVIDUAL INSTRUMENTS 201
CHAPTER 10:
OVERVIEW OF
RECOMMENDED PLUG-INS
T
HE SHORT HISTORY OF PROFESSIONAL computer-based produc-
tion platforms has been characterized by quick, uninterrupted develop-
ment and continual improvement of all plug-in types. This has led to
an almost unfathomable saturation of the plug-in market. In the following de-
scriptions, I therefore can neither provide a complete picture of what exists nor
guarantee that there isn’t now a new, better plug-in somewhere in the world. My
focus is on plug-ins that work on the DSP devices that I have already mentioned.
Additionally, I will mention a few native-based plug-ins that do not use non-na-
tive DSP units. With native plug-ins, I do not mean it negatively when I say that
they require a lot of processing power. When this is the case, then the price we
pay is simply more offline processing steps. Some of the plug-ins mentioned are
included on the DVD as demo versions. If needed, check on the Web sites of the
corresponding manufacturers to see if there are more recent versions available
for download.
With the variety of available plug-ins, I do not want to create a buying frenzy nor
give you a feeling that you are lacking a specific type of plug-in. My aim is rather
to incite you to critically examine individual components in terms of quality and
to put together a small number of tools that you have not only mastered how to
use, but to do so at the right time and place. In the end, you do not need any more
than a few dozen plug-ins that you can use efficiently by knowing them well. Take
the time to separate the wheat from the chaff in your plug-in folder so that in the
future you only use devices that lead to satisfying results!
Often in online forums I am criticized for not talking about native or freeware
plug-ins and that my methods require an investment in DSP hardware. My first
concern is to show you how to achieve professional results using your DAW. I
cannot justify ignoring high-quality plug-ins simply because they usually require
EQS
You can use Sound Example #11 to test your EQs. Insert your various EQs one
after the other and find a good setting for one of them. Then transfer this setting
onto the others and compare the sound of your plug-ins using the bypass switch.
In my testing, I have used the snapshot function of the Pinguin graphic analyzer
and pink noise in order to create the closest possible EQ curves between the plug-
Sound Example
ins, also for those with limited parameter choices. I put the pink noise through Universal Audio Cambridge-EQ
#11 the first EQ that is already set, then froze the frequency curve on the metering,
and then adjusted the other EQs to this curve.
Universal Audio Cambridge EQ (UAD-1: optional)
The most aurally apparent difference in quality of EQs can be heard with extreme
The 5-band full parametric Cambridge EQ is very similar to the Oxford EQ
high-frequency boosting. The palette goes from harsh to silky-soft. All other
from Sonnox for PowerCore in terms of sound character. The difference lies in a
differences in sound that go from subtle to distinct can be described with very
slightly different focus: it is flexible with regards to the cut and shelving filtering.
flowery language. Decide for yourself, what is pleasant and natural and what is
There are 19 different filter variations in the cut filter section to choose from, for
artificial, harsh, and sounds like filter noise. With strong bell filtering, you can
example Bessel and Butterworth. The E6 lowcut stands out from all other lowcut
hear the various sound characteristics of the different “circuitry” in the different
filters that I know, making it a very valuable tool for removing unwanted low fre-
EQ plug-ins. In the following descriptions, I discuss the most visually and aurally
quency noises, especially when the desired signal is itself very low. The extreme
apparent characteristics of each plug-in.
slope of the E6 is paid for with phase shift and therefore can tend to color the
sound, so that you should consider carefully which lowcut filter is best in a given
situation. All five bands can be switched into shelving mode, giving you three
different modes for different resonance peaks (overshoots). This analog-inspired
EQ is a flexible high-class multi-purpose tool.
204 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 205
The Precision Equalizer creates silky highs through internal oversampling.
206 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 207
Along with using the Pultec for mastering, it is also good for vitalizing vocals and
pumping up weak bass drums. The bass boost at 60 or 100 Hz is very special and
gives powerless drums the foundation they are lacking.
The UAD-1 Pultec Pro is a Pultec that is augmented with an MEQ-5 – a three
band mid-range EQ. Two bands are for boosting fixed frequencies and one band
Neve 1073 Channel Strip EQ
is for attenuation. The MEQ-5 is useful for boosting the upper mid-range at 4
kHz and at the same time the air band. Here, the basic version is missing a band.
Universal Audio Neve 1073 EQ (UAD-1: optional)
The 1073 has a lot of sound for so few controls: the input level control on the
left is followed by a highshelf knob with a range of plus/minus 18 dB at 12 kHz.
The semi-parametric adjustment of the mid-range has a frequency-dependent
Q-factor that increases with frequency. The low shelving filter is followed by a
lowcut with a slope of 18 dB per octave. Five mono EQs need the power of an
entire UAD-1 card, while the more economical SE version allows 16 mono EQs
per card. This EQ is good for velvety-soft and, at the same time, discrete highs:
vocals, acoustic guitars, strings, and a lot more. It really peps up dusty or medio-
cre recordings.
208 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 209
use the “E-button” in the EQ section. The filter section is generally very flexible
and sounds pleasant, soft, and British. For this test I mixed an album with the
Duende. That certain British SSL charisma is pleasant and full of character in the
final mix.
210 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 211
section. Since such complex devices tend to be unintuitive, I recommend first
getting to know the dynamic EQ section only at the beginning. This highly com-
plex EQ is a top-level tool for difficult situations and makes you really want to
experiment! The Dynamic EQ is very good for processing the sum and for mas-
tering. Although not an everyday EQ because of its complexity, the Dynamic EQ
is certainly a special tool for particular situations.
Another special feature of the Dynamic EQ is a switchable – although CPU-hun-
gry – linear phase circuit.
212 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 213
as a high knob. This is a very spartan parameter set, but allows you to obtain the in over display never lights up. The Oxford EQ gives you a
desired warm-sounding results quickly. For Sound Example #11, which requires choice of four different filter types that are chosen for all 5 Sonnox Bit-Depth
P
a great deal of intervention for saving the recording, the VoiceStrip does not offer frequency bands at the same time. The default setting (type lease note that the Sonnox versions
enough fine processing possibilities. The dB display in the Lo-Gain setting seri- 1) sounds very digital and analytical because of the decou- for ProTools and PowerCore are
ously contradicts the values that are entered. In such cases, a simple numerical pling of gain and EQ, inspired from the SSL4000 series EQs working in 24-bit fixed point and
Sound Example readout from -10 to +10 would be a less irritating solution. (for “E-Sound”, see SSL Duende channel strip above). The the native versions in 32-bit floating
#11 type 3 setting establishes a dependency between gain and point (both in double precision). (Cross
Q-factor (well-known in the analog world), and is inspired reference): Read more in Chapter 2 under,
from the character of Neve filters. The type 4 filter setting Incompatibility Between UAD-1 and
is so gentle that it is not appropriate for any precise opera- PowerCore.
tions, but is well suited for boosting highs, for example. The
sound is pleasantly silky and warm. For vocals processing I prefer the type 3 filter
because it provides a particularly pleasant and open sound. The cut filters can
be adjusted in 6 dB steps to 36 dB per octave and have an overshoot in the cut-
off frequency area that is controlled with the Q parameter. For a standard EQ,
the lowcut is good, but does not attain the slope of the E6 lowcut filter of the
Cambridge EQ. The LF and HF bands can also be used as shelving filters with
various controllable overshoots.
Sonnox Oxford EQ
(optional PowerCore and native) The classic among general-purpose EQs
The Oxford EQ from Sonnox (formerly Sony) is adapted from the famous Waves Renaissance EQ (REQ4)
Oxford mixing console. Like all PowerCore plug-ins, processing is at 24-bit reso-
Since the Renaissance bundle is widely used, I have chosen the REQ4 with four para-
lution. This means that it is sensitive to overs coming from 32-bit signals. You can
metric bands as an example of a channel EQ. Bundled with smaller and bigger EQs to
solve this problem by using level reduction upstream, so that the internal plug-
214 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 215
choose from, the CPU power requirements can be matched to the needs of the prehensible frequency band limitation of the bands to choose from. If the lowest
various bands you need to deal with. Included are low and highcut, bell, and band is already used for a lowcut filter, the second lowest band only goes down
shelving filters to choose from, making the REQ a universal tool. In the high to 258 Hz. This is often not low enough to eliminate problems in the thumpy
frequency ranges there is a noticeable difference between the Channel EQ and lower mid-range. Also, the idea of copying legendary passive mastering EQs with
the REQ, but still lacks the velvety character of an Oxford filter. Here there are incremental frequency settings is an unnecessary limitation for this EQ, which
limits. otherwise is very good.
Waves Linear Phase EQ Neutral-sounding high-end EQing with the phase linear PEQ-Orange by Algorithmix
Waves Linear Phase EQ (LinEq Broadband) Algorithmix Linear Phase PEQ Orange and Red
The advantage of linear phase EQs is their neutral sound. Analog filters always Beneath the Waves Linear Phase EQ it´s worth mentioning both of these excel-
have a slight phase shifting because of their physical structure, leading to desirable lent 10 band linear phase EQs from Algorithmix. They belong to a small group
or undesirable sound coloration, which can make up part of the sound character of professional native high-end EQs, and to a lesser degree to analytical and neu-
of a particular EQ circuit. Only in the digital domain is it possible to go beyond tral sounding tools, which allow working in the ultrasound regions with internal
these physical limits and to build phase-neutral – and therefore sound-neutral sampling rates of 384 kHz. The range of uses of these special EQs include – other
– EQs. Because of the large amount of CPU power required, it makes sense to use than mastering – extreme processing of single bands from individual tracks with-
such EQs in groups or in the sum and only by extreme EQ-needs as an insert in out the side effects of the sound-influencing character of otherwise typical phase
an individual track. When standard EQs create unwanted phase effects through effects. For example, more extreme bass boosting or subtle notch filtering is pos-
strong boosting or attenuation, this kind of EQ is a good choice. The huge choice sible than with standard EQs.
of special filters also suggests many uses. One clear negative factor is the incom-
216 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 217
Native all-rounder: the SC-517 Mk2 by Sonalksis
218 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 219
EQ: Conclusion
Even small nuances can make a world of difference in the sum. Choose your EQs
accordingly. Along with the difficult to describe details in sound, there are two
main characteristics that stand out with regards to EQ sound: the quality of being
able to produce silky highs on the one hand, and on the other, the sound of a bell
filter with strong boosting. The latter is important when you need to emphasize
the bass drum or tom, without creating an unpleasant filter sound. The phase
problems of EQs in the mix are often exaggerated and are no problem in reality.
Phase changes can even create pleasant side effects and create a certain charm.
Phase linear EQs are only appropriate for neutral-sounding material that should
be manipulated as little as possible. This may be the case when dealing with pop,
jazz, or classical acoustic recordings or when it’s necessary to strongly boost while
keeping a neutral sound. Generally, in order to cover all processing requirements,
we need a workhorse EQ for basic work (for example, the channel EQ in your
DAW), a universal parametric EQ with cut and shelving filters (for example, the
Cambridge, Oxford, SSL, or Sonalksis EQ – there are of course other creators of
good EQ plug-ins), a tube EQ like the Pultec, and a phase linear EQ. With four
good plug-ins – which you know well in terms of sound characteristics and filter
types – you can master the frequency domain.
Nuendo Channel EQ
Using Sound Example #11, a direct comparison between the Oxford or Precision
EQs and the efficiency-optimized Channel EQ clearly shows the weaknesses of
the Nuendo plug-in. The highs are harder, and the vocals don’t breathe and do
not have the assertiveness they need. Nevertheless, the Channel EQ does its job
quite well, making it useful for tracks that are less important than the lead vocals.
Sound Example The more extreme EQ settings and the more important the track, the more I
#11 recommend using “external” EQ plug-ins. According to Steinberg, the Channel
EQs of Nuendo 4 should sound much better, but I have not yet had the chance
to test them.
220 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 221
mid-range. In that case, another compressor should be used. If you want to create
COMPRESSORS extreme sustain effects (bass sound example), you would be better off using the
Judging and comparing compressors is much more difficult than EQs because Sonnox Dynamics compressor.
of the many different needs these devices fill. The “normal” single band com-
pressor – with its usual basic parameters threshold, attack, release, and ratio
– should be able to handle all the jobs described in the section on Compression
Exercises. Another category is the “beautification compressors” like the LA-2A
or the FA-770, which make the signal not only louder, but also particularly more
present and pleasant through discrete tube circuitry. I have purposely left out the
category of multiband compressors. They belong to mastering and not mixing.
Limiters are important to mixing, since for this purely technical task, it is impor-
The Neve 2254 Compressor emulated on the UAD-1 platform
tant to be able to use a good-sounding and properly functioning plug-in which
tends toward coloration as late as possible. Like multiband compressors, loud-
ness maximizers also belong to mastering. In these times of over-compressed Universal Audio Neve 33609 Master bus
mixes, I would nevertheless like to mention this category. Finally, I will mention Compressor/Limiter (UAD-1: optional)
the Sonnox Transient Modulator and the SPL Transient Designer on the UAD-1 The emulation of the classic British Neve 2254 compressor of the 1970s is im-
platform, which – as far as I know – provide unique transient modulation pos- pressively authentic. In addition to compression, it allows you to sweeten up the
sibilities in DAWs. sound the non-linear distortion, also known as warmth.
Please use the Compression Exercises with your own audio files to test your com- The cost of such a close copy of the original is extremely high DSP power us-
pressor plug-in repertoire in order to weed out and optimize what you use, so age. You can only use one single stereo plug-in per DSP card. For this reason, a
that you have a small choice of reliable devices which you completely master. DSP-optimized “SE” version is also included which lets you use up to 8 stereo
compressors per channel. In this case, you have to do without the details of the
charming distortion and the internal oversampling. It is worthwhile to resort to
offline processing in order to fatten up individual tracks such as lead vocals or
the drum group with some analog warmth. As long as you follow my advice and
do not over-compress the mix, so as to leave further dynamic processing to the
mastering engineer, you can use the 33609 perfectly well as a summing compres-
sor. Single band compressors are not designed for material that is already heavily
compressed; the result quickly tends towards “pumping.”
Universal Audio 1176 Compressor – the Original
Here are two tips for users who do not like to read manuals: the signal path of the
33609 does not follow the front panel design from left to right, but starts instead
Universal Audio 1176 LN (UAD-1: optional) – logically – first at the compressor on the right, whose output level manipulates
I do not need to say much about my favorite compressor, since I’ve already heav- the threshold of the limiter on to the left. At the lower right is a three-position
ily praised it in this book. It is a good choice for nearly all compression jobs. toggle switch labeled with the numbers 22, 18, and 14. Since digital emulations
The 1176 is well known for its “bite,” which is created by emphasizing transients. of analog devices does not allow the compressor to be operated when “hot” in
With bass guitars and bass drums, it can sound a little thin in the bass and lower order to create the desired distortion artifacts, this switch gives you the choice
222 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 223
between three different level settings, which simulate the compressor’s behavior
at three different input levels. With 22, the compressor is “hot,” while 14 is better
for a cleaner sound.
224 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 225
Waves C1 Compressor / Gate
226 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 227
Voxengo Marquis Compressor (native)
Like the Waves C1, the Marquis compressor sports a large number of sometime
unusual parameters, which makes reading the manual rather necessary. Despite the
variety of different parameters, you can accomplish many tasks quickly and intui-
tively. The extra functions provide extended processing possibilities for shaping the
release curve for fine-tuning the whole sound. There is also a phase linear mode.
The Marquis covers all the details of many devices in one and lives up to the term
“universal compressor.”
228 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 229
The legend lives on: The UAD-1 LA-2A is extremely close to the original
230 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 231
Magneto (native, internal Cubase and Nuendo)
The Magneto is also not a compressor in the modern sense of the term. The
Magneto is good for bringing in a bit of “analog” life to the digital environment
here and there. It also does not have the same degree of assertiveness as the LA-
2A. However, it is an easy-to-use sound improvement tool. It is well suited for
group tracks.
232 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 233
LIMITERS
Only in the past few years have audio engineers become conscious of the impor-
tance of staying under specific peak levels. People started to realize that digital
recordings, which are too hot, lead to a large number of artifacts during D/A con-
version as well as data reduction (DTS, AC3, MP2, MP3, etc.) processing. Since
digital fixed-pointed calculation can only deal with values up to full scale, only
virtual overs can be recorded, consisting of a chain of full scale bytes. Mastering
professionals consider three full scale bytes in a row as an over. Even this conser-
vative value does not guard against so-called “interleaved sample overs,” which
can only be effectively prevented when the signal goes through a higher sampling
rate (oversampling) in the limiter. This is exactly what a brickwall limiter does.
A brickwall limiter is therefore placed in the master bus, before any dithering oc-
curs and after all other processing steps. It also makes sense to keep an eye on the
operational range of the limiter. If more than just a few individual peaks need to
be reduced, then the input level should be reduced, since the limiter should only
have a technical, protective role. If the peaks (transients) are capped too much
or too often, then the result is unwanted loss of dynamics as well as undesirable
sound transformation.
234 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 235
A good limiter should have attack and release times that are so well adjusted that
TC MD3 Brickwall Limiter
transients are deformed as little as possible.
(PowerCore: optional, in the MD3-Bundle)
The excellent Sonnox Oxford Limiter goes a step further: if desired, it makes a The technical and sonic superiority of the MD3 Brickwall Limiter from PowerCore
new transient interpolation. This is an ingenious function that restores cut off level is the undefeated champion of the TC Electronic company. Like the UAD-1 lim-
peaks. After this little experiment you know which brickwall limiter you will use in iter, it cannot be overdriven, but tends to deform transients much later. Without a
the future. doubt, the PowerCore’s MD3 software is the top solution in terms of limiting and
multiband compression. The algorithm comes from the TC System 6000.
UAD-1-Precision Limiter
236 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 237
Nomad Brick Wall Limiter – no frills
238 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 239
Steinberg Loudness Maximizer shows that the L2 tends to create artifacts, distor-
tion, and changes in stage depth earlier than the Steinberg device. This is why you
need to be careful with dosage.
Waves L2
Waves L2 (native)
The Waves L2 belongs to the most widely-used loudness maximizers and is
quite intelligently laid out. Between the two main knobs there is a link switch ME Loudness Maximizer
that makes both threshold and output simultaneously reducible. Since the L2 is
level precise – in other words, 3 dB peak reduction is automatically made up for Steinberg Loudness Maximizer (native)
by 3 dB in the output – you save yourself the complication of doing a loudness- The ME (Mastering Edition) Loudness Maximizer from Steinberg / Spectral
corrected A/B comparison (this does not apply with very strong limiting). The Design is very easy to use and gives the user a quick idea of the amount of loud-
output ceiling fader works pretty much like a leveler; with increased loudness ness available. The “Possible Gain” display shows the amount of loudness that
through lower thresholds, level is reduced by the same proportion. This lets you can be obtained in decibels. When all parameters are in their default position,
easily find the limits of good taste, since when bypassed, the original sounds just this display is a sure indication for a sound, which would not be degraded, when
as loud as when processed and you won’t be fooled by a difference in loudness the LED is still green in the loudest passages. This way, you can see at a glance if
when you do an A/B comparison. After you have carefully adjusted the tolerable the loudness to be gained is acceptable and you can increase loudness without
loudness, you can set the output ceiling at an appropriate value. The L2 has an negatively changing sound, depth, and spatial information. The other parame-
advantage over Steinberg’s Loudness Maximizer: the ability to reliably adjust out- ters should be used with more care and require careful acoustic monitoring. The
put levels with the “output ceiling” parameter, so that you do not need to resort to amount of “Possible Gain” can be increased with “More Density,’’ but the resulting
leveling or brickwall limiting later on, as long as you have left 0.3 dB headroom processing can quickly make the Loudness Maximizer begin to audibly change
(see Mastering book). When using the L2 as an insert effect in a mix, don’t forget the sound quality. “Boost” also allows more loudness and makes the signal harder
to turn off the L2’s dithering, so that the signal does not continue on in the 24-bit and more transient-rich. The toggle switch lets you choose between a harder or
resolution format. softer sound character – you should experiment with this. Compared to the L2 or
The L2 also functions as a pure limiter, which does not affect the lower dynam- L3, the Loudness Maximizer tends toward undesirable changes in sound quality
ic levels at all (provided that you do not exaggerate). Blind testing against the a bit later. Unfortunately there is no adjustable “Output Ceiling” on the Loudness
240 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 241
Maximizer. Otherwise it can be used as a reliable limiter, safely limiting to minus
0.0 dB. In the mix it can be used in the master bus for checking the amount of re-
verb while temporarily increasing the loudness, or for moderately being applied
to the drum group track. Unfortunately, the Loudness Maximizer is no longer
for sale officially; with a little luck you might be able to find a “second hand” ver-
sion. It used to be sold in the Mastering Edition bundle and is the predecessor to
a plug-in of the same name, which is now included in the mastering workstation
“AudioCube”.
242 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 243
sion tracks as well as all transient-rich sounds.
REVERB
Keep in mind that transient processing can result in extreme peaks, which some-
In the chapter on “Dimension 3” I already mentioned the criteria for good (in
times must be eliminated later. For that you need an excellent limiter that won’t
terms of effective) reverb. Here I would like to examine whether – in a working
make the freshly pepped up transients dull. The Transient Modulator should be
context – the following reverb devices deliver the psychoacoustical room and
used later in the mix, after the level relationships are already established, so that
depth information with their presets.
the modulation can be adjusted in accordance to the track level, as well as any
limiting that might be necessary. For testing your own choice of reverb units, take a mix that needs a small room
for drums or percussion and a large room for the vocals. Insert all the devices to
SPL Transient Designer (UAD-1: optional) be tested in the corresponding effect channels. In order not to overtax the CPU,
switch on one device (per chain) on at a time. Now look for rooms in all the re-
In the hardware world, the SPL Transient Designer is already a classic in this
verb plug-ins, which should be good for the tasks, sound similar, and have similar
special category and is supposed to sound even better on the UAD-1 platform. At
parameter value ranges. Using a short, 4 or 8 measure long loop, compare the
the laboratory and test studio Universal Audio, I was convinced of this. The two
perceived spatial information of the various reverb devices by switching them on
parameters Attack and Sustain make it easy and intuitive to control transients
and off. Make sure that the reverb return level is the same in all devices in order
and are like a refreshing drink during a summer heat wave. This great plug-in
to make a fair comparison!
makes the list of must-have hardware devices even shorter!
Surprisingly, you can hear considerable differences in stage depth. The layering
exercise can also be used for this test.
244 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 245
post pro library. Here you can find discrete rooms that make an inconspicuous the impulse file representation. Here you can make the reverb tail duller as time
but effective layering possible. The device features true stereo processing which goes on. Or you can make the reverb tail dull at the beginning and then become
has become a standard even in mastering (for true stereo, see SIR2). brighter towards the end, as if the reverb effect quickly approaches the listener.
This is an interesting parameter for experimentation. The amplitude of the im-
pulse file can be shaped as well (red line) and the reverb can be processed with
EQ. The SIR2 creates impressive spatial depth and provides good, small “ambi-
ence” rooms for discrete layering.
Cubase RoomWorks
Faltungshall SIR² from Knufinke Among the large number of plug-ins that I have been able to examine, RoomWorks
by Yvan Grabit and Charly Steinberg stand out. It is pleasing to see that there are
SIR2 many small rooms included, which sound good and make intelligent layering
In the first edition of this book I demonstrated my enthusiasm for the W2 reverb possible. However, the good reverb quality has its price in terms of CPU pow-
from Wizoo, which unfortunately is no longer available. Now there is a worthy er, which nevertheless can be reduced with an “Efficiency” knob (smaller value
replacement similar to that of the W2; though the SIR2 is not a mixture of convo- = higher CPU needs). When the export button is on, then the highest possible
lution and algorithmic reverb. The SIR2 convolutions reverb plug-in by Christian quality will be used for the mixdown. The simple interface which is not graph-
Knufinke is a real stereo reverb device with four channels. Both input channels ics oriented (for a change!) is also nice to work with. Spatial information is also
L/R are convoluted with two (altogether four!) impulse files and then mixed back transported well with small doses of reverb. RoomWorks can also be used as a
together. In the diagram you can see the “true stereo” routing. surround reverb unit.
246 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 247
The UAD-1 RealVerb’s big brother: DreamVerb
Plate 140 – the legendary EMT plate reverb on the UAD-1 card
248 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 249
metal plates that transmitted the signal via induction. At the time, spring reverb
technology was the usual fare. The software version contains three completely
independent plate reverb devices that can be controlled from one unit, and
are enhanced by a number of modern functions in order to address today’s
operational needs. The wonderful, soft sound of the EMT 140 has already been
emulated in many later reverb devices. Now the “original” is available with its
unusually thick and full sound. The Plate 140 creates credible depth and is a joy
to the ears. I like to use this reverb for backing vocals and guitars.
250 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 251
Digital Vintage Reverb – the EMT the “Space Heater” for DAWs The NonLin2 is best suited for special effects and sound design.
252 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 253
Waves IR1 convolution reverb
254 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 255
The Nomad BlueVerb has a vintage design.
256 CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS CHAPTER 10: OVERVIEW OF RECOMMENDED PLUG-INS 257
OTHER PLUG-INS
Under the category “other plug-ins” come all send effect devices like delay, chorus,
flanger, and filter effects. There are so many of these that it would go beyond the
scope of this book to describe them, let alone to find common criteria for evalua-
tion. If the large selection of plug-ins in your host application is not enough, then
please refer to specialized magazines, books, and other publications, or on the
Web sites provided in the appendix.
I am pleased that you have read this book with interest. Have fun using the tips
and advice in your productions.
The tutorial DVDs of the same name dealing with mastering and mixing are a
perfect multimedia addition to both books. For more information, go to www.
All of the suggested ways of working are not always suitable for all users or all proworkshops.de or sign up for the newsletter. On the accompanying DVD you
production situations. Find your own working style that might be a mix of your will find a QuickTime trailer in the “Trailer” folder.
methods, my methods, and methods of other engineers and producers. Recently
I wish you enjoyment, success, plenty of good work, and happy clients!
a successful American producer and engineer explained to me that he always
starts his mix with the vocals and never switches them off until the mix is done. I Friedemann Tischmeyer
have described an entirely different way of working, but he is of course also right.
As I say: find YOUR way of working.
Another decisive factor in terms of the quality of results is beyond our control.
Remember that a good production starts with a good composition and that the
arrangement, the quality of the recording, and the musical performance itself all
determine the quality of the entire work. If one day you have the feeling that you
have followed all the rules of your craft while using good equipment and you still
haven’t achieved the desired results, then change something that occurs before
the mix. Keep in mind that a good mix can reveal deficiencies in the arrange-
ment. If the wrong vocals disappear in a mush of sound, you can bring them
right into the ears with a clearly structured and well-layered mix, but you give the
untrained ear a (subjectively wrong) impression of a bad mix. Then it becomes
necessary to extend your mixing capacities into musical ones and correct the ar-
rangement. Don’t underestimate ear training, even though it might seem as if it
has become very unpopular.
You are welcome to sign up for the newsletter mailing list.
Along with my activities as a sound engineer specialized in mastering, I also offer
private and group workshops for individuals and companies in my studio near
Hamburg, Germany. The group courses are limited to 6 participants, so that each
person can take part and really hear what is going on. In the workshops it is also
possible to use examples that could not be included on the accompanying DVD
or tutorial DVD for licensing reasons. It is particularly engaging to combine a
mixing job with a workshop (Mix & Learn), where the first 1-3 tracks of your
album are worked on together. This gives you enough experience and know-how
to mix the rest of the tracks in your own studio after being helped out at the be-
ginning.
ACKNOWLEDGEMENTS
I would like to thank all of the designers and programmers who are responsible
for the development of all these wonderful tools; for their friendly assistance
and answers to my questions. A special thanks to my wonderful wife Angela for
proofreading and for the many great suggestions as well as her loving support for
this book, and my many other projects.
Adlib / ad libitum
The term “ad libitum“ comes from classical music and indicates the
musician should add his or her own ornamentation and/or ideas to
the melody. In pop music it refers to variations of the actual melody or
improvisation that is added when the refrain is played for the umpteenth
time.
AES/EBU
Audio Engineering Society / European Broadcasting Union. AES/EBU
refers to the symmetrical digital signal standard established by this
organization for data transmission via XLR plug and 110 Ohm cables.
It is the professional equivalent of the S/P-DIF standard.
Ambience
Room or environment reverb. Ambience tracks are often recorded
in studios with good-sounding recording rooms. These tracks are
created for capturing room acoustics. Distinctions are made between
closed, mid, and far ambience. Far ambience is often recorded using
boundary layer microphones (PZM). In reverb devices, presets in the
“ambience” category often imitate discrete recording rooms.
Amp
Amplifier
D.I. Box
Direct injection box. This is a small device for converting impedance,
creating balanced signals, and removing grounds from audio signals.
D.I. boxes are used in studios to convert signals from guitars, basses,
or keyboards into symmetric line-level signals suitable for mixing
DC-Offset displayed on the Pinguin Metering consoles or preamplifiers. Ground removal eliminates humming which
often occurs on the stage through the use of different electrical circuits
(wind noise, breathers), synthesizer sounds from analog or plug-in that have different phases.
instruments, as well as string sounds from the most costly sampling
libraries. Sometimes the fault lies in faulty electronic circuitry or defect Flip
cables. In the following illustration you can see DC offset from the Switching function built into inline mixing consoles, which switches
vocal track pictured above (between takes) displayed on the Pinguin (“flips”) the input assignments between inline and main channels.
analyzer. The low frequency signal cannot even be heard on a high-end
audio monitor system. FOH
The lower the interfering signal the closer it is to 0 Hertz, which
represents direct current (DC). Front of house
Effects of DC offset: This is the mixing position for live concerts.
DC offset signals do not have a type of tonal or other relevant content Gate / gated
and “mush up” the bass frequencies. Power amplifiers and (sub) woofers
The short form of noise gate. A gate is a dynamic processor that –
require a great deal of energy to reproduce these unnecessary signals.
ReWire
Internal interface for computers that allows audio signals to run
between two applications running in parallel. The program Reason by
Propellerheads was the first application to have a simple connection
to Cubase. Since ReWire is an open standard, it is now a standard for
computer-internal audio communication.
S/P-DIF
Sony/Philips-Digital Interface is the name of a digital stereo data
transmission standard for consumer devices. Transmission occurs at
least with 16-bit resolution, but can also go to 24 bits. Data transfer
occurs over either an unbalanced 75-Ohm co-axial cable and RCA
plugs (also known as IEC 958), or over optical glass fiber cables
(Toslink).
Sustain
Time during which the signal remains audible before attenuating
(release).
Transients
Transients are very quick and impulse-like electrical or acoustical
signal attacks or beginnings. Transients often provide the acoustic
information necessary for our ear to identify certain natural sounds
such as a bowed string or piano. Particularly percussive sounds are
characterized by short, energy-laden attacks and are therefore rich in
transients. Transient modulation makes it possible to either expand
or compress transient dynamics (SPL Transient Designer / Sonnox
Oxford Transient Modulator). The main use of transient modulation
making transients more fresh, i.e., expanding them. When too strongly
or badly limited, or signals over 0 dB in fixed-point calculation,
the natural waveform of transients is destroyed, making the sound
unnatural beyond these levels. Special interpolation algorithms exist
to prevent this kind of damage by restoring the original waveform.
Along with restoration software tools, the Oxford Limiter also has
such a function.
2) Miscellaneous
Bass Sequence
Frequency Chart
Links
Summing Unit Test
Trailer of Tutorial DVDs
http://www.rme-audio.de technical information on digital audio http://www.tcelectronic.com PowerCore plug-ins /Air Monitor Software
Online shop:
North & South America: www.tischmeyer-publishing.com
Europe, Africa, Asia & Australia: www.tischmeyer-publishing.de
To inquire about rebates offered to universities and audio schools
please feel free to contact [email protected]