I’ve said it before and I’ll no doubt say it again, but the way people engage the services of voiceover artists is changing. Once upon a time a VO would always be employed for a job in conjunction with a production studio, so all the VO would have to do is the actual voicing bit. More and more now the production studio or producer is sidelined and the client approaches the VO direct and expects them to record their audio at their home studio and do all the editing and processing themselves. This is obviously a whole new set of skills that voiceovers need to learn and master if they want their business to be a success.
Over the time that Bee Productive has been in business we’ve offered training to VOs to help them get on top of the new skill-sets and a few common themes have come to light. So this month I want to take a quick look at some audio pointers that are particularly pertinent to jobbing VO artists. Needless to say if you want further clarification or to have some one-to-one training on producing your voiceovers then get in touch.
Resolution- The first thing to consider is the format that we record to. Most DAWs (Digital Audio Workstation – the software environment that we use to capture/edit/process the audio) will record to either a wav or aiff file. We should always make sure that we are recording to the highest resolution that we can reasonably manage. Wav and aiff files can very massively in quality depending on what resolution they are. There is a huge difference between an 8KHz 8-bit wav and a 48KHz 24-bit one. Here I pause to explain some more terms.
Bit depth and sample rate- These are terms that we should be paying close attention to. They totally dictate the quality of the audio that we produce. But what are they? In short they are the X and Y axis of a graph. The X axis is time and this is measured in hertz or kilohertz. In terms of physics this is a number of cycles per second and is used in music to denote pitch (concert A is 440Hz). In digital audio it states how many times a sound wave is measured per second. So as we progress along the X axis (pass through time) a reading is taken of where the sound wave is on the Y axis. The Y axis measurement is a binary reading and is known as the bit rate of the audio. A 16-bit reading has 16 digits, a 24-bit reading has 24. The higher the bit rate, the bigger the dynamic range (the space between the loudest and quietest bits the system can manage). The combination of these 2 qualities gives a higher or lower resolution of sound as the wave is plotted out on the ‘graph’. In lower resolution sound the wave is drawn with lots of square corners rather than the smooth line that would be the analogue wave and therefore is a less accurate replication of the original. The difference between low resolution and high resolution audio is the equivalent of the difference between Disney/Pixar animation and teletext pictures.
Back to thinking about resolution –
My first point was that we should make sure that we’re always recording to the highest resolution we can manage. Hopefully now you will see that better resolution will produce better audio, but the flip side to that is that it takes more processing power from our computers. So as a default position I’d say it’s always best to record in the highest possible resolution that we’re likely to be asked for. To convert audio to a lower resolution is perfectly fine, but if you convert up you will not – I repeat, you will not – improve the quality of the audio, you’ll just make it compatible with whatever system it’s going to be used on. I would recommend that you when you open a new file you set it to 48KHz and 24 bit (or 32 bit float if you can – I won’t go into that here as it’s quite complicated). Once you’ve voiced and processed it depending on your client you can either just save it at whatever resolution and format they have requested from you or – if you think you may need to re-use the audio or visit it again for whatever reason – save it at that full resolution then ‘save as’ whatever format your client requires. that way you can easily re-work the original recording if you need to without compromising quality.
Levels- In the olden days of analogue when everything was recorded to tape it was important to push the levels as much as possible to maximise the dynamic range of the tape and hide the tape hiss behind audio you wanted. This is still widely done with digital audio, but there really in no need to. A good reel of 2″ tape has a dynamic range of 50 – 60 dB. this means that between the tape hiss that constitutes the noise floor and the point where the tape if overloaded and distorts there is 50 – 60 dB of room for the program material you want. With 16-bit audio you get 96dB and with 24-bit you get 145dB. Seeing that the dynamic range of the human ear is somewhere around 136dB I think you can see that there is plenty of room in a 24-bit system to not have to push the levels to the max to get the best out of the audio. In fact the thing you really have to avoid is digital clipping – when the level of the signal gets too loud. The loudest level of any digital system is 0dB and all signals are measured in minus numbers, so I would advise that you want to record your voiceovers so that the highest level is between -12dB and -6dB. This is plenty high enough and it still gives you lots of headroom for further processing as every process will change the waveform and may increase the levels.
One final thing –
Phantom Power – As a professional VO you will use a condenser mic that has to have the phantom power activated or it doesn’t work, but I’ve spoken to quite a few VOs who don’t know what it is or why they need it. Phantom power is activated on your desk, audio interface or preamp by a switch that will either say ‘phantom power’ or ‘+48v’ and by switching it on you will be putting a current of 48v down your mic lead to your condenser mic (or if you’re using a mixing desk with one universal switch to everything plugged into the XLR inputs on your desk). Condenser mics are also known as capacitor mics, and that gives a small clue as to why phantom power is necessary. A capacitor is an electrical component that stores a small amount of electricity, the capsule of your mic acts as such a component. The capsule consists of the diaphragm and a backplate. The diaphragm of your mic carries a charge across it and as you speak the diaphragm vibrates and the gap between it and the backplate changes. This has the effect of altering the amount of charge the ‘capacitor’ can hold and as it fluctuates an electric current is generated which passes through the mic and sends the signal created by your voice vibrations into the rest of your set-up. Simples!
The thing you need to remember about phantom power is that it can create a bit of a surge through your system, so you should take care when switching it on and off that your speakers and headphones are all muted or switched off. You also need to take care that all your mics are plugged in before switching it on or off as you can put a permanent charge across the diaphragm of any mic regardless of whether it’s a condenser or not by plugging things in or out badly. And once your mic’s got a permanently charged diaphragm it’s a write-off.
That’ll do for today. If you’ve found this useful let me know and I’ll do some more tips at a later date.