家电科技 ›› 2020, Vol. 0 ›› Issue (zk): 222-224.doi: 10.19784/j.cnki.issn1672-0172.2020.99.054

• 智能与健康 • 上一篇    下一篇

一种基于少量训练数据的口语语义理解技术

李明杰1,2, 贾巨涛1,2, 宋德超1,2, 吴伟1,2, 韩林峄1,2   

  1. 1.珠海格力电器股份有限公司 广东珠海 519070;
    2.空调设备及系统运行节能国家重点实验室 广东珠海 519070
  • 出版日期:2020-11-10 发布日期:2021-01-05
  • 通讯作者: 李明杰 jaylingli@126.com

An oral semantic understanding technology based on a small amount of training data

LI Mingjie1,2, JIA Jutao1,2, SONG Dechao1,2, WU Wei1,2, HAN Linyi1,2   

  1. 1. GREE ELECTRIC APPLIANCES, INC. OF ZHUHAI Zhuhai 519070;
    2. State Key Laboratory of Air-conditioning Equipment and System Energy Conservation Zhuhai 519070
  • Online:2020-11-10 Published:2021-01-05

摘要: 随着人工智能的发展,语音交互技术在智能家居、智能助理等领域得到长足的发展,口语化短文本的语义理解技术成为研究重点。口语文本中存在句法不规范、关键信息少、句子间差异小等特点,影响语义识别准确率,模型需不断更新迭代。本文将层次化理论与智能家居领域语音交互应用相结合,形成多层语义解析架构,实现文本数据到结构化知识的转变,在少量文本数据的基础上,实现训练耗时10秒以内,平均准确率96%以上,并设计了分布式模型训练系统,满足工业级应用需求。

关键词: 语音交互, 口语语义, 少量数据, 层次化

Abstract: With the development of artificial intelligence, speech interaction technology has made great progress in smart home, intelligent assistant and other fields. The semantic understanding technology of oral short text has become the research focus. There are some characteristics in spoken text, such as nonstandard syntax, less key information and small differences between sentences, which affect the accuracy of semantic recognition. In this paper, the hierarchical theory is combined with the voice interaction application in smart home field to form a multi-layer semantic analysis framework, which realizes the transformation from text data to structured knowledge. On the basis of a small amount of text data, the training time is less than 10 seconds, and the average accuracy rate is more than 96%. A distributed model training system is designed to meet the industrial application requirements.

Key words: Voice interaction, Oral semantics, A small amount of data, Hierarchical

中图分类号: