自然语言处理深度学习网络设计方法

2019.05.20

投稿:杨秀丽部门:通信与信息工程学院浏览次数:

活动信息

时间: 2019年05月22日 10:30

地点: 校本部东区翔英大楼T706室

行健讲坛学术讲座

第393期

时间:   2019年5月22日(周三)上午10:30-11: 30

地点:   校本部东区翔英大楼T706室

讲座:   自然语言处理深度学习网络设计方法

 

演讲者: 佛罗里达大学  吴大鹏教授  IEEE Fellow

演讲者简介:Dapeng Oliver Wu received Ph.D. in Electrical and Computer Engineering from Carnegie Mellon University, Pittsburgh, PA, in 2003. Since 2003, he has been on the faculty of Electrical and Computer Engineering Department at University of Florida, Gainesville, FL, where he is currently Professor.  His research interests are in the areas of networking, communications, video coding, image processing, computer vision, signal processing, and machine learning. He received University of Florida Term Professorship Award in 2017, University of Florida Research Foundation Professorship Award in 2009, AFOSR Young Investigator Program (YIP) Award in 2009, ONR Young Investigator Program (YIP) Award in 2008, NSF CAREER award in 2007, the IEEE Circuits and Systems for Video Technology (CSVT) Transactions Best Paper Award for Year 2001, the Best Paper Award in GLOBECOM 2011, and the Best Paper Award in QShine 2006. 

讲座摘要:In this talk, I present a new approach to the design of deep networks for natural language processing (NLP), based on the general technique of Tensor Product Representations (TPRs) for encoding and processing symbol structures in distributed neural networks. A network architecture--- the Tensor Product Generation Network (TPGN) --- is proposed which is capable in principle of carrying out TPR computation, but which uses unconstrained deep learning to design its internal representations. Instantiated in a model for image-caption generation, TPGN outperforms LSTM baselines when evaluated on the COCO dataset. The TPR-capable structure enables interpretation of internal representations and operations, which prove to contain considerable grammatical content. Our caption-generation model can be interpreted as generating sequences of grammatical categories and retrieving words by their categories from a plan encoded as a distributed representation.

邀请人:上海大学通信与信息工程学院 沈礼权研究员

欢迎广大教师和学生参加!