2012-03-11 19:27:40 -07:00
|
|
|
HRTF Support
|
|
|
|
============
|
|
|
|
|
|
|
|
Starting with OpenAL Soft 1.14, HRTFs can be used to enable enhanced
|
|
|
|
spatialization for both 3D (mono) and multi-channel sources, when used with
|
|
|
|
headphones/stereo output. This can be enabled using the 'hrtf' config option.
|
|
|
|
|
|
|
|
For multi-channel sources this creates a virtual speaker effect, making it
|
|
|
|
sound as if speakers provide a discrete position for each channel around the
|
|
|
|
listener. For mono sources this provides much more versatility in the perceived
|
|
|
|
placement of sounds, making it seem as though they are coming from all around,
|
|
|
|
including above and below the listener, instead of just to the front, back, and
|
|
|
|
sides.
|
|
|
|
|
2014-02-23 20:51:30 -08:00
|
|
|
The default data set is based on the KEMAR HRTF data provided by MIT, which can
|
2020-02-11 00:37:21 -08:00
|
|
|
be found at <http://sound.media.mit.edu/resources/KEMAR.html>.
|
2012-03-11 19:27:40 -07:00
|
|
|
|
|
|
|
|
2014-02-23 20:51:30 -08:00
|
|
|
Custom HRTF Data Sets
|
|
|
|
=====================
|
2012-03-11 19:27:40 -07:00
|
|
|
|
|
|
|
OpenAL Soft also provides an option to use user-specified data sets, in
|
2020-02-11 00:37:21 -08:00
|
|
|
addition to or in place of the default set. This allows users to provide data
|
|
|
|
sets that could be better suited for their heads, or to work with stereo
|
|
|
|
speakers instead of headphones, for example.
|
2012-03-11 19:27:40 -07:00
|
|
|
|
2012-09-11 01:59:42 -07:00
|
|
|
The file format is specified below. It uses little-endian byte order.
|
2012-03-11 19:27:40 -07:00
|
|
|
|
|
|
|
==
|
2020-02-11 00:37:21 -08:00
|
|
|
ALchar magic[8] = "MinPHR03";
|
2012-03-11 19:27:40 -07:00
|
|
|
ALuint sampleRate;
|
2017-10-22 15:36:42 -07:00
|
|
|
ALubyte channelType; /* Can be 0 (mono) or 1 (stereo). */
|
|
|
|
ALubyte hrirSize; /* Can be 8 to 128 in steps of 8. */
|
|
|
|
ALubyte fdCount; /* Can be 1 to 16. */
|
|
|
|
|
|
|
|
struct {
|
|
|
|
ALushort distance; /* Can be 50mm to 2500mm. */
|
|
|
|
ALubyte evCount; /* Can be 5 to 128. */
|
|
|
|
ALubyte azCount[evCount]; /* Each can be 1 to 128. */
|
|
|
|
} fields[fdCount];
|
|
|
|
|
2020-02-11 00:37:21 -08:00
|
|
|
/* NOTE: ALbyte3 is a packed 24-bit sample type,
|
2017-10-22 15:36:42 -07:00
|
|
|
* hrirCount is the sum of all azCounts.
|
|
|
|
* channels can be 1 (mono) or 2 (stereo) depending on channelType.
|
|
|
|
*/
|
2020-02-11 00:37:21 -08:00
|
|
|
ALbyte3 coefficients[hrirCount][hrirSize][channels];
|
2017-10-22 15:36:42 -07:00
|
|
|
ALubyte delays[hrirCount][channels]; /* Each can be 0 to 63. */
|
2012-03-11 19:27:40 -07:00
|
|
|
==
|
|
|
|
|
2020-02-11 00:37:21 -08:00
|
|
|
The data layout is as follows:
|
2012-03-11 19:27:40 -07:00
|
|
|
|
2020-02-11 00:37:21 -08:00
|
|
|
The file first starts with the 8-byte marker, "MinPHR03", to identify it as an
|
2012-03-11 19:27:40 -07:00
|
|
|
HRTF data set. This is followed by an unsigned 32-bit integer, specifying the
|
2020-02-11 00:37:21 -08:00
|
|
|
sample rate the data set is designed for (OpenAL Soft will resample the HRIRs
|
|
|
|
if the output device's playback rate doesn't match).
|
2012-03-11 19:27:40 -07:00
|
|
|
|
2020-02-11 00:37:21 -08:00
|
|
|
Afterward, an unsigned 8-bit integer specifies the channel type, which can be 0
|
|
|
|
(mono, single-channel) or 1 (stereo, dual-channel). After this is another 8-bit
|
|
|
|
integer which specifies how many sample points (or finite impulse response
|
|
|
|
filter coefficients) make up each HRIR.
|
2012-03-11 19:27:40 -07:00
|
|
|
|
2017-10-22 15:36:42 -07:00
|
|
|
The following unsigned 8-bit integer specifies the number of fields used by the
|
2020-02-11 00:37:21 -08:00
|
|
|
data set, which must be in descending order (farthest first, closest last).
|
|
|
|
Then for each field an unsigned 16-bit short specifies the distance for that
|
|
|
|
field in millimeters, followed by an 8-bit integer for the number of
|
2017-10-22 15:36:42 -07:00
|
|
|
elevations. These elevations start at the bottom (-90 degrees), and increment
|
|
|
|
upwards. Following this is an array of unsigned 8-bit integers, one for each
|
|
|
|
elevation which specifies the number of azimuths (and thus HRIRs) that make up
|
|
|
|
each elevation. Azimuths start clockwise from the front, constructing a full
|
|
|
|
circle. Mono HRTFs use the same HRIRs for both ears by reversing the azimuth
|
|
|
|
calculation (ie. left = angle, right = 360-angle).
|
|
|
|
|
2020-02-11 00:37:21 -08:00
|
|
|
The actual coefficients follow. Each coefficient is a signed 24-bit sample.
|
|
|
|
Stereo HRTFs interleave left/right ear coefficients. The HRIRs must be
|
|
|
|
minimum-phase. This allows the use of a smaller filter length, reducing
|
2017-10-22 15:36:42 -07:00
|
|
|
computation. For reference, the default data set uses a 32-point filter while
|
2012-03-12 17:14:42 -07:00
|
|
|
even the smallest data set provided by MIT used a 128-sample filter (a 4x
|
2017-10-22 15:36:42 -07:00
|
|
|
reduction by applying minimum-phase reconstruction).
|
2012-09-11 01:59:42 -07:00
|
|
|
|
2020-02-11 00:37:21 -08:00
|
|
|
After the coefficients is an array of unsigned 8-bit delay values as 6.2 fixed-
|
|
|
|
point integers, one for each HRIR (with stereo HRTFs interleaving left/right
|
|
|
|
ear delays). This is the propagation delay in samples a signal must wait before
|
|
|
|
being convolved with the corresponding minimum-phase HRIR filter.
|