Investigating room impulse response characterization from the partial coherence properties of speech and music signals
×
Cite This
Citation & Abstract
E. Driscoll, M. Bocko, and S. Smith, "Investigating room impulse response characterization from the partial coherence properties of speech and music signals," Paper 10655, (2023 May.). doi:
E. Driscoll, M. Bocko, and S. Smith, "Investigating room impulse response characterization from the partial coherence properties of speech and music signals," Paper 10655, (2023 May.). doi:
Abstract: Characterizing acoustic spaces is important for multiple applications, from architectural acoustics to augmented and virtual reality, sound design, and others. Traditionally, this process involves making impulse response measurements in an unoccupied space, which is time consuming and potentially inaccurate because the acoustics of spaces may be affected significantly by the presence of an audience or other occupants. The ability to “blindly” retrieve acoustic information about a space from recordings made during typical use scenarios would be more practical and potentially it could provide a more accurate picture of the acoustics of the space when partially or fully occupied. In this paper we discuss how an analysis of the coherence properties of the harmonic partials of naturally occurring speech and music signals recorded in a space may provide information about the impulse response of the space. A broad class of naturally occurring sounds, such as speech and music, contain sets of harmonically related overtones with mutually correlated amplitude and phase modulations, i.e., vibrato. This is a reasonable assumption based upon the physics of sound generation in many realistic sources, such as wind musical instruments or the human vocal tract. When such acoustic signals are filtered by a space, we show how the autocorrelation of individual overtones and the cross-correlation between pairs of harmonically related overtones can provide information about the impulse response of the space. In this paper we explore this hypothesis by analyzing a few simplified cases and we discuss how these methods may be extended to more complex, and realistic, scenarios.
@article{driscoll2023investigating,
author={driscoll, erin and bocko, mark and smith, sarah},
journal={journal of the audio engineering society},
title={investigating room impulse response characterization from the partial coherence properties of speech and music signals},
year={2023},
volume={},
number={},
pages={},
doi={},
month={may},}
@article{driscoll2023investigating,
author={driscoll, erin and bocko, mark and smith, sarah},
journal={journal of the audio engineering society},
title={investigating room impulse response characterization from the partial coherence properties of speech and music signals},
year={2023},
volume={},
number={},
pages={},
doi={},
month={may},
abstract={characterizing acoustic spaces is important for multiple applications, from architectural acoustics to augmented and virtual reality, sound design, and others. traditionally, this process involves making impulse response measurements in an unoccupied space, which is time consuming and potentially inaccurate because the acoustics of spaces may be affected significantly by the presence of an audience or other occupants. the ability to “blindly” retrieve acoustic information about a space from recordings made during typical use scenarios would be more practical and potentially it could provide a more accurate picture of the acoustics of the space when partially or fully occupied. in this paper we discuss how an analysis of the coherence properties of the harmonic partials of naturally occurring speech and music signals recorded in a space may provide information about the impulse response of the space. a broad class of naturally occurring sounds, such as speech and music, contain sets of harmonically related overtones with mutually correlated amplitude and phase modulations, i.e., vibrato. this is a reasonable assumption based upon the physics of sound generation in many realistic sources, such as wind musical instruments or the human vocal tract. when such acoustic signals are filtered by a space, we show how the autocorrelation of individual overtones and the cross-correlation between pairs of harmonically related overtones can provide information about the impulse response of the space. in this paper we explore this hypothesis by analyzing a few simplified cases and we discuss how these methods may be extended to more complex, and realistic, scenarios.},}
TY - Room Acoustics
TI - Investigating room impulse response characterization from the partial coherence properties of speech and music signals
SP -
EP -
AU - Driscoll, Erin
AU - Bocko, Mark
AU - Smith, Sarah
PY - 2023
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2023
TY - Room Acoustics
TI - Investigating room impulse response characterization from the partial coherence properties of speech and music signals
SP -
EP -
AU - Driscoll, Erin
AU - Bocko, Mark
AU - Smith, Sarah
PY - 2023
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2023
AB - Characterizing acoustic spaces is important for multiple applications, from architectural acoustics to augmented and virtual reality, sound design, and others. Traditionally, this process involves making impulse response measurements in an unoccupied space, which is time consuming and potentially inaccurate because the acoustics of spaces may be affected significantly by the presence of an audience or other occupants. The ability to “blindly” retrieve acoustic information about a space from recordings made during typical use scenarios would be more practical and potentially it could provide a more accurate picture of the acoustics of the space when partially or fully occupied. In this paper we discuss how an analysis of the coherence properties of the harmonic partials of naturally occurring speech and music signals recorded in a space may provide information about the impulse response of the space. A broad class of naturally occurring sounds, such as speech and music, contain sets of harmonically related overtones with mutually correlated amplitude and phase modulations, i.e., vibrato. This is a reasonable assumption based upon the physics of sound generation in many realistic sources, such as wind musical instruments or the human vocal tract. When such acoustic signals are filtered by a space, we show how the autocorrelation of individual overtones and the cross-correlation between pairs of harmonically related overtones can provide information about the impulse response of the space. In this paper we explore this hypothesis by analyzing a few simplified cases and we discuss how these methods may be extended to more complex, and realistic, scenarios.
Characterizing acoustic spaces is important for multiple applications, from architectural acoustics to augmented and virtual reality, sound design, and others. Traditionally, this process involves making impulse response measurements in an unoccupied space, which is time consuming and potentially inaccurate because the acoustics of spaces may be affected significantly by the presence of an audience or other occupants. The ability to “blindly” retrieve acoustic information about a space from recordings made during typical use scenarios would be more practical and potentially it could provide a more accurate picture of the acoustics of the space when partially or fully occupied. In this paper we discuss how an analysis of the coherence properties of the harmonic partials of naturally occurring speech and music signals recorded in a space may provide information about the impulse response of the space. A broad class of naturally occurring sounds, such as speech and music, contain sets of harmonically related overtones with mutually correlated amplitude and phase modulations, i.e., vibrato. This is a reasonable assumption based upon the physics of sound generation in many realistic sources, such as wind musical instruments or the human vocal tract. When such acoustic signals are filtered by a space, we show how the autocorrelation of individual overtones and the cross-correlation between pairs of harmonically related overtones can provide information about the impulse response of the space. In this paper we explore this hypothesis by analyzing a few simplified cases and we discuss how these methods may be extended to more complex, and realistic, scenarios.
Authors:
Driscoll, Erin; Bocko, Mark; Smith, Sarah
Affiliation:
University of Rochester, NY, USA
AES Convention:
154 (May 2023)
Paper Number:
10655
Publication Date:
May 13, 2023Import into BibTeX
Subject:
Room Acoustics
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=22062