English
通知公告

学术报告:Speech Recognition Development: A Dataset and Benchmark Perspective

发布时间:2021-12-11
点击次数:

报告人:陈果果 博士,SpeechColab

时间:2021/12/14 [周二]  下午 2:00-3:00

地点:腾讯会议 842 5204 8009

 

题目: Speech Recognition Development: A Dataset and Benchmark Perspective

 

摘要: The previous decade saw remarkable development in automatic speech recognition technologies. While there are a lot of technical articles explaining the improvements from the model point of view, the impact of datasets and benchmarks to speech recognition development is not well studied. In this talk, we first investigate the contribution of datasets and benchmarks to speech recognition development. We then introduce a large scale English speech recognition dataset named GigaSpeech. We will demonstrate the data creation pipeline, as well as initial benchmarks on this dataset. Finally, we close this talk by outlining our on-going work for speech recognition benchmarks.

 

报告人简介: Dr. Chen holds a Ph.D. degree in Electrical and Computer Engineering from the Johns Hopkins University and a B.Eng. degree in Electronic Engineering from Tsinghua University. During his Ph.D., he spent 5 years at the Center for Language and Speech Processing, Johns Hopkins University, where he worked on various aspects of speech recognition and was one of the key contributors to the open source speech recognition toolkit Kaldi, and the open source deep learning toolkit CNTK. He was the author of LibriSpeech, one of the most cited (2,500+ Google Scholar citations) speech recognition dataset/benchmark. He also spent two summers at Google Inc. where he developed the prototype of Android's wake word detection engine for "Okay Google", serving billions of Android/Google Home users. After graduation, Dr. Chen co-founded KITT.AI, a CBInsights AI 100 company in 2017, which was acquired by Baidu. In 2020, Dr. Chen co-founded Seasalt.ai. Dr. Chen also initiated SpeechColab, a volunteer organization for the speech recognition community, which released one of the largest speech recognition dataset named GigaSpeech, covering 10,000 hours of transcribed audio and 33,000 hours of total audio for speech recognition research.


  • 上一条: 学术报告:Investigating Sequence-Level Normalisation for CTC-Like End-to-End ASR
  • 下一条: 学术报告:Advancing Transformer Transducer for Speech Recognition on Large-Scale Dataset: Efficient Streaming and LM Adaptation

国外留学毕业证书制作公司西安代做海外留学学位证天津代办国外成绩单制作成都做国外留学生学位证办理西宁制作海外学历文凭代办南昌代做国外学历定做武汉制作国外大学毕业证代办济南办理海外毕业证办理福州补办海外学位证制作天津办海外成绩单西安国外博士毕业证定做深圳代做国外文凭毕业证西安代做海外本科毕业证办理南昌补办海外留学毕业证代做杭州办国外学历证定做石家庄国外留学文凭定制珠海代做海外学历证定制武汉国外毕业证代办深圳办国外留学毕业证代做北京代做国外学位证补办合肥补办国外本科毕业证代办西安办海外学位证定制贵阳代做国外留学生学位证办理合肥代办国外留学文凭贵阳制作海外证书代办昆明办理国外大学毕业证办理郑州制作国外博士毕业证代办西宁代办海外学位证书代做太原代办海外本科毕业证制作昆明做海外证件代办兰州办海外留学生学位证办理淀粉肠小王子日销售额涨超10倍罗斯否认插足凯特王妃婚姻让美丽中国“从细节出发”清明节放假3天调休1天男孩疑遭霸凌 家长讨说法被踢出群国产伟哥去年销售近13亿网友建议重庆地铁不准乘客携带菜筐雅江山火三名扑火人员牺牲系谣言代拍被何赛飞拿着魔杖追着打月嫂回应掌掴婴儿是在赶虫子山西高速一大巴发生事故 已致13死高中生被打伤下体休学 邯郸通报李梦为奥运任务婉拒WNBA邀请19岁小伙救下5人后溺亡 多方发声王树国3次鞠躬告别西交大师生单亲妈妈陷入热恋 14岁儿子报警315晚会后胖东来又人满为患了倪萍分享减重40斤方法王楚钦登顶三项第一今日春分两大学生合买彩票中奖一人不认账张家界的山上“长”满了韩国人?周杰伦一审败诉网易房客欠租失踪 房东直发愁男子持台球杆殴打2名女店员被抓男子被猫抓伤后确诊“猫抓病”“重生之我在北大当嫡校长”槽头肉企业被曝光前生意红火男孩8年未见母亲被告知被遗忘恒大被罚41.75亿到底怎么缴网友洛杉矶偶遇贾玲杨倩无缘巴黎奥运张立群任西安交通大学校长黑马情侣提车了西双版纳热带植物园回应蜉蝣大爆发妈妈回应孩子在校撞护栏坠楼考生莫言也上北大硕士复试名单了韩国首次吊销离岗医生执照奥巴马现身唐宁街 黑色着装引猜测沈阳一轿车冲入人行道致3死2伤阿根廷将发行1万与2万面值的纸币外国人感慨凌晨的中国很安全男子被流浪猫绊倒 投喂者赔24万手机成瘾是影响睡眠质量重要因素春分“立蛋”成功率更高?胖东来员工每周单休无小长假“开封王婆”爆火:促成四五十对专家建议不必谈骨泥色变浙江一高校内汽车冲撞行人 多人受伤许家印被限制高消费

国外留学毕业证书制作公司 XML地图 TXT地图 虚拟主机 SEO 网站制作 网站优化