Please use this identifier to cite or link to this item: http://theses.ncl.ac.uk/jspui/handle/10443/5449
Full metadata record
DC FieldValueLanguage
dc.contributor.authorXian, Yang-
dc.date.accessioned2022-06-15T10:44:35Z-
dc.date.available2022-06-15T10:44:35Z-
dc.date.issued2021-
dc.identifier.urihttp://hdl.handle.net/10443/5449-
dc.descriptionPh. D. Thesis.en_US
dc.description.abstractMonaural speech separation and enhancement aim to remove noise interference from the noisy speech mixture recorded by a single microphone, which causes a lack of spatial information. Deep neural network (DNN) dominates speech separation and enhancement. However, there are still challenges in DNN-based methods, including choosing proper training targets and network structures, refining generalization ability and model capacity for unseen speakers and noises, and mitigating the reverberations in room environments. This thesis focuses on improving separation and enhancement performance in the real-world environment. The first contribution in this thesis is to address monaural speech separation and enhancement within reverberant room environment by designing new training targets and advanced network structures. The second contribution to this thesis is on improving the enhancement performance by proposing a multi-scale feature recalibration convolutional bidirectional gate recurrent unit (GRU) network (MCGN). The third contribution is to improve the model capacity of the network and retain the robustness in the enhancement performance. A convolutional fusion network (CFN) is proposed, which exploits the group convolutional fusion unit (GCFU). The proposed speech enhancement methods are evaluated with various challenging datasets. The proposed methods are assessed with the stateof-the-art techniques and performance measures to confirm that this thesis contributes novel solutionsen_US
dc.language.isoenen_US
dc.publisherNewcastle Universityen_US
dc.titleAdvanced deep neural networks for speech separation and enhancementen_US
dc.typeThesisen_US
Appears in Collections:School of Engineering

Files in This Item:
File Description SizeFormat 
Xian Yang e-copy.pdfThesis8.92 MBAdobe PDFView/Open
dspacelicence.pdfLicence43.82 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.