For the current REF see the REF 2021 website REF 2021 logo

Output details

13 - Electrical and Electronic Engineering, Metallurgy and Materials

University of Surrey

Return to search Previous output Next output
Output 0 of 0 in the submission
Article title

Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking

Type
D - Journal article
Title of journal
IEEE Transactions on Signal Processing
Article number
-
Volume number
61
Issue number
22
First page of article
5520
ISSN of journal
1941-0476
Year of publication
2013
URL
-
Number of additional authors
-
Additional information

Speech-on-speech audio mixtures in typical (reflective) acoustic conditions face the permutation problem (i.e., pulling out the wrong voice in place of the desired target speaker), which is a major obstacle in the wide-spread deployment of source separation techniques. The proposed method solves it using visual information about a speaker and a practical machine learning technique to help extract the relevant speech, which has the potential to improve the real-world performance of many speech technologies in a vast array of application scenarios including: biometric security, automatic speech recognition, speaker localisation, speech enhancement for hearing aids, audio re-mastering, surveillance and telecommunications.

Interdisciplinary
-
Cross-referral requested
-
Research group
None
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-