M.N. Bin Mohd Nor, T. Matsumura, and T. Onoye (Japan)
Acoustic, speech processing, speech direction of arrival, vowel harmonic structure, and two-microphone array
This paper proposes methods to achieve improved accuracy of speech direction of arrival estimation. Previous research has proposed a high resolution DOA estimation system for human vowels using only two microphones. However, in real environment, the conventional DOA estimation system is not robust enough to provide accurate results for human speech. To increase the robustness of the system for speech, non utterance frame omission method and steering frequency selection method are proposed. Non-utterance frame omission evaluates the strength of speech in each frame and omits frames that have no or weak speech presence. Steering frequency selection is applied to determine the frequency that is imperative for DOA estimation based on harmonic product spectrum. Finally, the proposed system is evaluated both through simulation and real environment test. Proposed system shows a distinct improvement for speech DOA estimation amounting to about 46% decrease in estimation error compared to the conventional system for sound sources present at the side of the array.
Important Links:
Go Back