基于深度学习的单通道音频分离系统研究进展-上海大学

基于深度学习的单通道音频分离系统研究进展

2018.06.07

投稿：杨秀丽部门：通信与信息工程学院浏览次数：

活动信息

时间： 2018年06月08日 09:30

地点：校本部东区翔英大楼T516室

行健讲坛学术讲座

第332期

时间: 2018年6月8日（周五）上午9：30

地点: 校本部东区翔英大楼T516室

讲座: 基于深度学习的单通道音频分离系统研究进展

演讲者: 罗艺博士哥伦比亚大学

演讲者简介：

罗艺现为哥伦比亚大学的博士生。他的研究重点是音频前端处理中的机器学习以及深度学习系统，主要从事的工作有音频分离，语音增强，语音去混响等，同时还致力于构建BMIs（Brain-Machine Interfaces）以建立机器与生物系统之间的联系。

讲座摘要：

Single-channel audio separation has been an active research area for decades. Recently, deep learning-based systems have greatly advanced the state of this problem. In most of the deep learning systems, the separation is performed in time-frequency (T-F) domain where a T-F mask is estimated for each of the target source. However, the use of T-F masks upper-bounds the system’s performance and introduces difficulties in end-to-end separation. In this talk, I will first make an overview of several deep learning approaches in T-F domain, address their main disadvantages, and introduce the recently proposed Time-domain Audio Separation Network (TasNet) which separates the mixture directly in time-domain.

邀请人：通信与信息工程学院朱梦尧副教授

欢迎广大教师和学生参加！

校园看点

快速链接

扫码关注