Real or Surreal?



What is (are) the distinguishing feature(s) of a high-end audio system? To some it the retrieval of each and every musical detail. To others, it is flat frequency response. To still others, it is full-range capability. To some, it is bass which rocks the foundation. Then there are those who feel that the relevant criterion is the ability to play loudly with minimal distortion. Last but not least, there is a school of thought - to which I adhere - which believes that the raison d’etre of high end audio is something less tangible, namely the ability to communicate the emotional content of the music  - a property we might call musical realism. (Musical realism doesn’t seem to correlate with any easily measurable parameter.)

I am indebted to Jonathan Halpern of Tone Imports, discussions with whom provided the impetus for this article.

A few years ago, in an article published prior to my joining Stereo Times, I addressed the issue of “truth versus beauty” in high-end audio. (stereophile) I noted there that designers often intentionally deviate from apparent accuracy to the source in the hope of obtaining a more pleasant sound, and I speculated that this might be to compensate for (as one example) the harshness of digitally recorded music, or to provide better balance in a speaker that is not full range (i.e., the so-called “rule of 400,000”). A variant of this phenomenon is what I refer to as “hyper-realism,” which is characterized by being “over the top” and “realer-than-real.” Hyper-realism has become so prevalent in high-end audio as to become something of the norm.

In this article I will discuss some of the ways in which hyper-realism manifests itself, and will attempt to shed some light on why it has become so pervasive.
To understand what I mean by hyper-realism, video analogies might be helpful. A few years back go I entered the world of high definition with the acquisition of a 1080p projector and hi-def cable box. As is undoubtedly common amongst first-time hi-def viewers, I spent the first few weeks skipping from channel to channel, ooh’ing and aah’ing at the incredible resolution and color palette. By chance I stumbled upon CSI Miami, and was immediately riveted (by the image; certainly not by the acting or story line!). The colors seemingly jumped off the screen at me in a way I had never before witnessed; blues were bluer than I knew possible, as were the reds and the greens. (Like being on an LSD trip, as a friend later described it.) I couldn’t take my eyes off the screen, despite the clear realization that the image was quite unnatural
.2

2 - In his review of the Oppo DV-970HD Universal player (Stereophile, May 2007), Wes Phillips stated “...I was reminded of the difference between video and film. Video always looks realer, whereas film always looks better.” Of course, Wes was comparing two varieties of recorded media, not a recorded medium to reality, but it does remind us that in video, as in audio, beauty can sometimes be preferable to truth.

I initially though my projector was mis-calibrated but soon learned that the show is intentionally shot in that manner. But in fact, oftentimes video displays are the culprits. It is well known that the factory settings for brightness and contrast on televisions are way too hot. Conventional wisdom (urban legend?) says that this is a marketing ploy, designed so the sets catch the eye of consumers even in brightly lit stores. In and of itself this is no big deal; unfortunately however, many consumers are unaware of the out-of-whack settings, and fail to properly adjust the displays once they get the set home. Many of us with an interest in video have had the experience of going to the home of a friend or colleague and, upon noting the misadjusted picture, offer to properly adjust it for them, only to have them complain that the picture now looks drab and lifeless. (One could similarly draw an analogy to those who heavily salt their food.) The good news is that with time, viewers often become accustomed to the proper video adjustments (ditto for food not drowning in salt); the bad news is that it doesn’t happen overnight and moreover, one has to have the realization that the contrast/brightness/salt are masking the true “flavor” of the medium. That is, one has to be willing to deviate from their comfort zone to discover what one was missing.

Returning to audio, hyper-realism comes in many guises. Let’s begin at the beginning: the recording studio. Some of the best, most realistic-sounding recordings were made in the 1950s. While there are undoubtedly numerous differences between those and modern recordings, one especially important factor contributing to the earlier recordings’ magic is the use of minimal-mic'ing techniques. Nowadays, many (and probably most) recordings are made with multiple microphones, often more than one per instrument, positioned in very close proximity. Because of this proximity the mics' “hear” the instrument very differently from the way we would. In the case of string instruments (as an example), this arrangement highlights the strings and minimizes the contribution of the instrument body, resulting in a sound that is exaggerated in some sonic attributes, and lacking in others. Using multiple mics in this way makes the situation ever worse, as the sound of the instrument is effectively broken up into sections, resulting in a disjointed sound.

Close-mic'ing also exaggerates the soundstage, a topic I have discussed elsewhere (www.dagogo.com). First, close mic'ing drastically distorts image size; we have all experienced pianos that seemingly extend across the room, and I can’t begin to count the number of times I walked into a room at an audio show and was greeted with a ten foot Diana Krall. Closely mic'd recordings alter the soundstage in other ways as well. For example, they artificially enhance the space between the instruments, and - in conjunction with modern speakers - present etched borders that, in my experience, never exist in live music.3 While it is annoying that such artifacts exist on the recordings, the far greater problem is that too many audiophiles have come to regard them as normal and worse still, crave them. Thankfully, there is a resurgence of minimal mic'd recordings, such as those from Unipheye Music (unipheyemusic; see stereotimes) and Mapleshade Records (mapleshaderecords).

3 - Until their recent closure, I regularly attended jazz and chamber music performances at the Bosendorfer NYC showroom. The performance area is approximately 20 feet wide, and the performers often stood close to one another. It occurred to me that if the soundstage were accurately reproduced on a recording and playback system, many audiophiles would complain that it was too small.

Another example of hyper-realism relates to resolution. The Stereophile glossary (www.stereophile.com) defines resolution (also called definition) as

“That quality of sound reproduction which enables the listener to distinguish between, and follow the melodic lines of, the individual voices or instruments comprising a large performing group.”

While audiophiles agree on very few things, there seems to be a nearly unanimous consensus that higher resolution is a desirable property of audio systems. But far too common are systems that merely give the impression of resolution. Such systems are often spoken of as being ruthlessly revealing but, as my friend Jules Coleman has pointed, this is in fact a misnomer in that they are actually the antithesis of high resolution. False resolution typically results from a system that emphasizes certain frequencies or frequency ranges, or the notes’ leading edges (more often than not a function of the amplification). While these are most assuredly ear catching, they are of course ersatz and once recognized as such, unrewarding.

Last but not least is bass, without which no discussion of hyper-realism would be complete. Good bass can be a wonderful thing, and we all know that achieving it can be especially difficult because of room interactions. That however, is separate and distinct from bass hyper-realism, which comes in two flavors: quantitative and qualitative. On the quantitative side we are confronted with those individuals who want bass that flaps their pants, pounds them in the chest, shakes the house and rattle the rafters. The popularity of action/adventure movies, filled with explosions of every shape and form, might have something to do with it, but that is just a hunch. I once asked a friend - a well-respected home theater enthusiast - why fans spend so much time and energy on bass. His reply? “Because they can.”

On the qualitative side, things are equally odd. Many audiophiles want bass that is “tight” to the nth degree, with zero overhang. In over three decades of attending live performances, I have never heard bass with those properties. By analogy with the situation with soundstage I mentioned above, I firmly believe that if the bass from a live venue - be it jazz club, concert hall or stadium - were accurately reproduced, many audiophiles would be dissatisfied with it. This of course raises the question, “what exactly is it audiophiles want?,” a topic beyond the scope of this article. I can’t help but feel that far too many audiophiles have lost sight of the fact that the double bass (and electric bass) is a musical instrument, not a device for demolition or the generation of sound-effects.

Having outlined some of the ways in which hyper-realism manifest, it’s time to turn to the $64,000 question, namely, why is it so common in high-end audio? There are undoubtedly numerous causes, and I don’t claim to know them all. I will however address a few that I feel are especially important. The first factor relates to what I refer to as the “supersize me” mentality. The most obvious manifestation of this phenomenon of course pertains to the obsession with food, evidenced by the quantities of food served at eateries (especially fast food restaurants and movie theatres), and the near-epidemic proportions of obesity (and concomitant heart disease). Food is no longer just for sustenance, nor just a treat for the taste buds; rather, it has become an addiction for those for whom satiety is nearly unobtainable. The desire for “more, more, more” extends beyond food to many aspects of life, including recreation and entertainment. Whereas folks once turned to books and radio - activities which required imagination, and thus active participation - far too many people now want - and sadly, need - constant bombardment of the senses, be it from a telephone, or gimmick-laden surround sound movies.

Another reason for hyper-realism is audio-related, specifically having to do with dynamics. While I never doubted the importance of dynamics, its primary importance became clear to me in, of all places, the New York City subway system. In many stations, especially the busier ones, a variety of musicians - some quite talented - perform on the subway platforms, offering weary commuters a much needed respite from their hectic days. The subways are a cavernous space, filled with massive echoes and all manners of background noise, from rumbling trains to children crying. And yet, even when the musicians were out of eye range and in fact, a considerable distance away, I could tell immediately that I was hearing live music. What was it about the music that made it so readily identifiable as being live? Surely it wasn’t the tonal quality, as this was massively distorted by the time it reached my ears. What was making it through all the din was the dynamics. By dynamics I mean not only the ability go from loud to soft but the rate at which music does so; indeed, what clearly identified the music as being live was the notes’ rapid rise times. It really was an eye (ear) opener, and dramatically changed my views on what matters most in musical reproduction. I am of course hardly the first to describe the primary importance of dynamics.

     

Sadly, and somewhat ironically, dynamics is the area in which most modern speakers are weakest. My realization of this occurred not in NYC but a few thousand miles to the west, specifically Las Vegas, at CES 2003. I remember the fateful moment as if it happened yesterday: I wandered unsuspectingly into the Damoka Audio room, whereupon my eyes were struck by speakers with the largest baffles I had ever seen, measuring greater than 6’ in both height and width. Contained within each baffle were a horn and two large woofers. I later learned that these behemoths were Klangfilm Bionors, which David Karmelli had painstakingly dissembled, shipped across the country, and reassembled solely to for our listening enjoyment, as they were not for sale. I had no idea what to expect so I took a seat and listened. What greeted my ears was reproduced music unlike any I had heard previously. The distinguishing feature was not the tone (though this was quite good, due in no small measure to the associated Lamm amplification), nor the frequency extension (which was limited), nor the soundstage (which was odd, to say the least) but rather, the sense that the music simply flowed from the speakers, unimpeded. After listening to the Klangfilms, virtually all other speakers - including the very large, top-of-the-line speakers from the world’s most respected manufacturers - seemed to be running with the brakes on.

Soon thereafter I learned that a number of other speakers - predominantly high-efficiency, in particular horns - also displayed breathtaking dynamics. To say the least, it was disconcerting to realize how severely dynamically compromised were all the speakers I had heard, and enjoyed, to that point.
So what does this have to do with hyper-realism? Just this: I believe that hyper-realism is frequently employed as a means to compensate for the lack of dynamics which characterize many modern audio systems. Speakers (in particular, though electronics can also contribute) with poor dynamics drain music of its vitality, its energy, its life. In response, designers crank up the treble or presence region, while listeners often crank up the volume “to 11” in an attempt to bring life to the music. Unfortunately, exaggerated detail, while initially ear-catching, is a poor substitute for true dynamics.

Oddly enough, the dynamic limitations of most speakers is rarely acknowledged in mainstream audio. It is not entirely clear whether this is due to ignorance or to a state of denial, though I suppose it is a combination of the two. It seems that nearly every speaker review praises its dynamics, despite the fact that, at least to my ear, this is often the speaker’s weakest property. A few months ago I read a review in a prestigious print magazine, of a monitor speaker. The reviewer spoke about the speakers’ excellent dynamics, commenting on how there were no dynamic limitations up to (if memory serves) a listening level of about 94 dB. Presumably, what this reviewer (and many others like him) meant was that the speakers had no overt distortions up to that volume, but this is to miss half the point. The reviewer was focusing on distortions of the commission variety, whereas dynamic compression is a distortion of omission. That is, the system is truncating information, rather than adding it. A musical signal is not a sine wave; rather, it comprises multitudes of tiny peaks and valleys. A system with dynamic compression cannot move quickly enough to reproduce all those speaks and valleys, so it in effect smoothes out the signal. Contained within those tiny hills and valleys is the musical detail; thus a system that smoothes them, lacks resolution. To compensate for the loss of actual resolution, designers create systems with artificial resolution - hyper-realism - which brings us full circle.

Is hyper-realism just another example of truth over beauty? Alas, I think it is not. For one thing, I do not regard hyper-realism as in any way beautifying the music; if anything, it is the antithesis of beauty in that it often contributes to listener fatigue, and fosters a dependence on excess. Another fundamental difference has to do with the “why” of hyper-realism. As discussed in my earlier article, certain deviations from accuracy are responses to inherent limitations (inherent that is, to today’s technology) in the recording or playback chain. Hyper-realism, in contrast, is used to compensate for a weakness that is in fact easily correctable. In medicine, certain diseases are classified as iatrogenic, which is defined as “induced inadvertently by a physician or surgeon or by medical treatment or diagnostic procedures .” So it is with modern audio; in trying to improve certain parameters, designers sacrificed other parameters that are more important. In essence, the baby was thrown out with the bath water. Hyper-realism appears to be a means of compensating for this one step forward, two step backward approach. The way out this conundrum requires a paradigm shift, in which designers and consumers once again focus on the reproduction of music, rather than sound.