Microphone Quality Metering & Enhancement
Start date: 2012
End date (phase-1): 2017 (collaboration ongoing)
Research Student: Andy Pearce
Principal Supervisor: Dr Tim Brookes
Co-Supervisor: Dr Martin Dewhirst
Co-Supervisor: Dr Russell Mason
Supported by: EPSRC and Cirrus Logic
Project Outline
The perceptual characteristics of a microphone are not always clear from its technical specification. This thesis documents a first step towards creating more perceptually relevant measures.
Consideration of relevant criteria revealed that the most appropriate method for recording stimuli for perceptual microphone comparisons is to use all microphones under test simultaneously. Experiments determined that a maximum array size of 150 mm will ensure that the perceptual differences between the recorded stimuli are predominantly due to the characteristics of the microphones and not artefacts of the spacing between them.
It was established that there are eight standard physical differences that exist between microphones which may impact the perceived characteristics of a recording. These differences, supplemented with expert opinions, indicated that recording five programme items with eight studio and two MEMS microphones would allow for determination of the most prominent inter-microphone perceptual differences. A combination of indirect and direct elicitation experiments on the resulting 50 recordings identified a hierarchy of 40 perceptual attributes that describe the differences between microphones. A novel attribute contribution experiment conducted on the 31 lowest-level attributes in the hierarchy showed that brightness contributes the most overall to the inter-microphone difference.
The spectral centroid and ratios comparing the relative level of high frequencies were previously used to predict brightness; however, these metrics did not predict subjective ratings of microphone-related brightness as well as a newly proposed combination metric: the product of the spectral centroid above 3 kHz, and the ratio of energy above 3 kHz compared to all energy. This model performed well on training data (r = 0.909). Validating it on independent microphones and programme items suggested that improvements may be necessary for error-free prediction of programme-related aspects of brightness, but showed good correlation with each programme item and overall (r = 0.854), indicating that the model predicts microphone-related brightness well.
Publications
- A.Pearce, T.Brookes, R.Mason: "Modelling the Microphone-Related Timbral Brightness of Recorded Signals", Applied Sciences Special Issue on Applications of Machine Learning in Audio Classification and Acoustic Scene Characterization, vol.11, iss.14, article number 6461, 2021
full text available at DOI 10.3390/app11146461 - A.Pearce, "Perceived differences between microphones", PhD Thesis, Institute of Sound Recording, University of Surrey, 2017
full text available at epubs.surrey.ac.uk/814006 - A.Pearce. T.Brookes, R.Mason, M.Dewhirst, "Measurements to determine the ranking accuracy of perceptual models", Audio Eng.Soc. 140th Convention, Paris, France, 4-7 June 2016
- A.Pearce, T.Brookes, M.Dewhirst, R.Mason, "Eliciting the most prominent perceived differences between microphones",
The Journal of the Acoustical Society of America, Vol.139, No.5, May 2016.
full text available at DOI 10.1121/1.4950820 - A.Pearce. T.Brookes, M.Dewhirst, "Validation of experimental methods to record stimuli for microphone comparisons", Audio Eng.Soc. 139th Convention, New York, USA, 29 Oct - 1 Nov 2015
- A.Pearce. T.Brookes, M.Dewhirst, "Timbral differences between microphones", BBC Sound Now & Next Technology Fair, London, UK, 19-20 May 2015
Data Archive
The data on which the findings of this project are based will be made available as each stage of the project is completed. Data published so far are available in these repositories:
- Microphone Comparison Array Validation Dataset doi: 10.5281/zenodo.20547
- Perceived Differences Between Microphones Dataset doi: 10.5281/zenodo.30621
- Monte Carlo Simulation Code doi: 10.5281/zenodo.47245
- Microphone Brightness Modelling Dataset doi: 10.5281/zenodo.322747