院长信箱
当前位置: 首页>师资队伍
教师详情
  • 个人信息
    殷绪成

    Xu-Cheng Yin

    系      所:
    |计算机科学与技术系|
    职      称:
    教授  
    职      务:
    院长
    办公地点:
    信息楼623B
    办公电话:
    010-62332873
    电子邮箱:
    xuchengyin@ustb.edu.cn
    本 科 课 程:
    离散数学 软件工程及课程设计 人工智能与互联网大数据技术前沿研讨
    研究生课程:
    机器学习 人工智能前沿技术
    科 研 方 向:
    模式识别与计算机视觉 文字识别(文档图像分析与识别) 信息检索与自然语言处理 人工智能芯片技术及应用
    学术与社会兼职:
    中国图象图形学学会文档图像分析与识别专委会副主任/秘书长 中国人工智能学会模式识别专委会委员
  • 简   历

    殷绪成,人工智能专家,国家杰出青年科学基金项目获得者,北京科技大学教授、博导,北京科技大学模式识别与人工智能技术创新实验室主任,中国图象图形学学会文档图像分析与识别专委会副主任/秘书长、中国自动化学会模式识别与机器智能专委会委员、中国计算机学会计算机视觉专委会委员、中国人工智能学会模式识别专委会委员。主要研究领域包括模式识别、文字识别、计算机视觉及人工智能芯片技术,近五年来在中国计算机学会推荐国际期刊和会议上发表论文五十余篇,连续四届荣获国际文档分析与识别大会技术竞赛共15项冠军,获2019年度北京市科技进步一等奖(第一完成人)、2018年度教育部科技进步二等奖(第一完成人)、2005年度北京市科技进步一等奖(主要成员)。

     

    1995.09 - 2002.06  北京科技大学计算机系    学士、硕士

    2002.07 2006.07  汉王科技股份有限公司研发中心     研发工程师/技术经理

    2003.09 - 2006.07  中国科学院自动化研究所    博士

    2006.08 - 2008.06  富士通研究开发中心信息技术部      研究员

    2008.07 - 今     于北京科技大学计算机系从事教学和科研工作(副教授、教授)

    2013.01 - 2014.01  Center for Intelligent Information Retrieval, School of Computer Science, University of Massachusetts Amherst, USA, Visiting Associate Professor

    2014.07 – 2014.08  Computer Vision Lab, School of Computer Science, University of MassachusettsAmherst, USA, Visiting Professor

    2016.07-2016.09  BioNLP Lab, Department of Quantitative Health Sciences, University of Massachusetts Medical School, USA, Visiting Professor

    详情请参见实验室(模式识别与人工智能技术创新实验室)主页(http://prir.ustb.edu.cn)和个人主页(http://prir.ustb.edu.cn/yin/

     

  • 代表性论文

     

    [J1] Xu-Cheng Yin (殷绪成)*, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, Robust text detection in natural scene images, IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), vol. 36, no. 5, pp. 970-983, 2014. (2020 Impact Factor: 17.861)

    [J2] Xu-Cheng Yin (殷绪成)*, Wei-Yi Pei, Jun Zhang, and Hong-Wei Hao, Multi-orientation scene text detection with adaptive clustering, IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), vol. 37, no. 9, pp. 1930-1937, 2015. (2020 Impact Factor: 17.861)

    [J3] Shu Tian, Xu-Cheng Yin* (殷绪成), Ya Su, and Hong-Wei Hao, A unified framework for tracking based text detection and recognition from web videos, IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), vol. 40, no. 3, pp. 542-554, 2018. (2020 Impact Factor: 17.861)

    [J4] Xu-Cheng Yin (殷绪成)*, Ze-Yu Zuo, Shu Tian, and Cheng-Lin Liu, Text detection, tracking and recognition in video: A comprehensive survey, IEEE Trans. Image Processing (T-IP), vol. 25, no. 6, pp. 2752-2773, 2016. (2020 Impact Factor: 9.34)

    [J5] Chun Yang, Xu-Cheng Yin* (殷绪成), Wei-Yi Pei, Shu Tian, Ze-Yu Zuo, Chao Zhu and Junchi Yan, Tracking based multi-orientation scene text detection: A unified framework with dynamic programming, IEEE Trans. Image Processing (T-IP), vol. 26, no. 7, pp. 3235-3248, 2017. (2020 Impact Factor: 9.340)

    [J6] Jie-Bo Hou, Xiaobin Zhu, Chang Liu, Kekai Sheng, Long-Huang Wu, Hongfa Wang, and Xu-Cheng Yin* (殷绪成), IEEE Trans. Image Processing (T-IP), vol. 29, pp. 7904-7916, 2020. (2020 Impact Factor: 9.340)

    [J7] Bo-Wen Zhang, Xu-Cheng Yin* (殷绪成), and Fang Zhou, A generic pseudo relevance feedback framework with heterogeneous social information, Information Sciences, vol. 367-368, pp. 909-926, 2016. (2020 Impact Factor: 5.910)

    [J8] Zan-Xia Jin, Bo-Wen Zhang, Fang Zhou, Jingyan Qin*, and Xu-Cheng Yin* (殷绪成), Ranking via partial ordering for answer selection, Information Sciences, vol. 538, pp. 358-371, 2020. (2020 Impact Factor: 5.910)

    [J9] Song-Lu Chen, Chun Yang, Jia-Wei Ma, Feng Chen, and Xu-Cheng Yin* (殷绪成), Simultaneous end-to-end vehicle and license plate detection with multi-branch attention neural network, IEEE Trans. Intelligent Transportation Systems (T-ITS), August 2019, published online. (2020 Impact Factor: 6.319)

    [J10] Jie-Bo Hou, Xiaobin Zhu*, Chang Liu, Chun Yang, Long-Huang Wu, Hongfa Wang, and Xu-Cheng Yin* (殷绪成), Detecting text in scene and traffic guide panels with attention anchor mechanism, IEEE Trans. Intelligent Transportation Systems (T-ITS), June 2020, published online. (2020 Impact Factor: 6.319)

     

    [C1] Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chang Liu, Chun Yang, Hongfa Wang, and Xu-Cheng Yin* (殷绪成), Deep relational reasoning graph network for arbitrary shape text detection, Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR, Oral), 2020. (CCF A)

    [C2] Bowen Yang, Chun Yang, Qi Liu, and Xu-Cheng Yin* (殷绪成), Joint rotation-invariance face detection and alignment with angle-sensitivity cascaded networks, Proceedings of the 27th ACM International Conference on Multimedia (ACM Multimedia), 2019. (CCF A)

    [C3] Shu Tian, Wei-Yi Pei, Ze-Yu Zuo, and Xu-Cheng Yin* (殷绪成), Scene text detection in video by learning locally and globally, Proceedings of 25th International Joint Conference on Artificial Intelligence (IJCAI, Oral), 2016. (CCF A)

    [C4] Bo-Wen Zhang, Xu-Cheng Yin* (殷绪成), Fang Zhou, and Jianlin Jin, Building your own reading list anytime via embedding relevance, quality, timeliness and diversity, Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR), 2017. (CCF A)

    [C5] Xu-Cheng Yin (殷绪成), Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, Accurate and robust text detection: a step-in for text retrieval in natural scene images, Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR), 2013. (CCF A)

    [C6] Bo-Wen Zhang, Xu-Cheng Yin* (殷绪成), Xiao-Ping Cui, Jiao Qu, Bin Geng, Fang Zhou, Li Song, and Hong-Wei Hao, Social book search reranking with generalized content-based filtering, Proceedings of the 23rd International ACM Conference on Information and Knowledge Management (ACM CIKM, Oral), 2014. (CCF B)

    [C7] Fan Fang, Bo-Wen Zhang, Xu-Cheng Yin* (殷绪成), Hai-Xia Man, and Fang Zhou, TED-KISS: A known-item speech video search benchmark, Proceedings of the 27th International ACM Conference on Information and Knowledge Management (ACM CIKM), 2018. (CCF B)

    [C8] Miao Cao, Chun Yang, Fang Zhou, and Xu-Cheng Yin* (殷绪成), Pyramid memory block and time-step attention for speech emotion recognition, Proceedings of 20th Annual Conference of the International Speech Communication Association (INTERSPEECH, Oral), 2019. (CCF C)

    [C9] Miaotong Jiang, Jie-Bo Hou, Chun Yang, Xiaobin Zhu, and Xu-Cheng Yin* (殷绪成), Detecting text in news images with similarity embedded proposals, Proceedings of 15th International Conference on Document Analysis and Recognition (ICDAR), 2019. (CCF C)

    [C10] Chun Yang, Xu-Cheng Yin* (殷绪成), Hong Yu, Dimosthenis Karatzas, and Yu Cao, ICDAR2017 Robust Reading Challenge on Text Extraction from Biomedical Literature Figures (DeTEXT), Proceedings of 15th International Conference on Document Analysis and Recognition (ICDAR, Oral), 2017. (CCF C)

     

  • 科研业绩

     

    横向项目:

    (1)“网络图片文字识别与广告视频内容理解研究”(2016~2021, 腾讯科技合作项目,负责人)

    (2)“面向AI芯片的人工智能技术”(2018-2021,亿智电子合作项目,负责人)

    (3)“教育行业复杂英文文档分析与识别技术”(2014~2015,科大讯飞合作项目,负责人)

    纵向项目:

    (1) “大规模网络图像的文本识别方法与关键技术研究”(2022-2026,国家杰出青年科学基金项目,负责人)

    (2) “多语言场景文本检测与识别关键技术研究”(2021-2024,国家自然科学基金面上项目,负责人)

    (3) “结合前馈和反馈机制的自然场景文本识别技术”(2015~2018,国家自然科学基金面上项目,负责人)

    (4) “网络图片视频文本识别与理解技术”(2016~2019,国家XXX工程子任务,负责人)

     

  • 获得奖励/专利

     

    2019年度北京市科技进步一等奖(第一完成人),“网络图像视频大数据的智能识别关键技术及应用”;

    2018年度教育部科技进步二等奖(第一完成人),“大规模网络图像的文本识别技术及应用”;

    连续四届(2013/2015/2017/2019年)荣获国际文档分析与识别大会技术竞赛“场景文本检测”、“场景文本识别”、“网络图片文本检测”、“网络图片文本识别”等15项冠军;

    连续四年(2015/2016/2017/2018年)荣获国际生物信息文本语义检索与问答技术挑战平台BioASQ Challenge多项第一名;

    2005年度北京市科技进步一等奖(主要成员),“汉王OCR技术及应用”;

    2006年度富士通研究开发中心优秀发明奖;

    2006年富士通研究所社长奖,2007年富士通研究所社长奖。

     

  • 计通NEWS
  • 索思