You are in : Home » Results & submissions » Select UOA » 13 - Electrical and Electronic Engineering, Metallurgy and Materials » View submission: University of Surrey » Outputs » Detail

Output details

13 - Electrical and Electronic Engineering, Metallurgy and Materials

University of Surrey

Return to search Previous output Next output

Output 0 of 0 in the submission

Article title

Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking

Type

D - Journal article

DOI

10.1109/TSP.2013.2277834

Title of journal

IEEE Transactions on Signal Processing

Article number

Volume number

Issue number

First page of article

5520

ISSN of journal

1941-0476

Year of publication

2013

URL

Number of additional authors

Additional information

Speech-on-speech audio mixtures in typical (reflective) acoustic conditions face the permutation problem (i.e., pulling out the wrong voice in place of the desired target speaker), which is a major obstacle in the wide-spread deployment of source separation techniques. The proposed method solves it using visual information about a speaker and a practical machine learning technique to help extract the relevant speech, which has the potential to improve the real-world performance of many speech technologies in a vast array of application scenarios including: biometric security, automatic speech recognition, speaker localisation, speech enhancement for hearing aids, audio re-mastering, surveillance and telecommunications.

Interdisciplinary

Cross-referral requested

Research group

None

Proposed double-weighted

Double-weighted statement

Reserve for a double-weighted output

Non-English

English abstract