家电科技 ›› 2025, Vol. 0 ›› Issue (zk): 17-23.doi: 10.19784/j.cnki.issn1672-0172.2025.99.004

• 第一部分 优秀论文 • 上一篇    下一篇

智慧家庭语音交互意图理解评测现状

潘悦然1,2, 焦利敏3,4,5, 杨荣祥1,2, 崔庆用1,2, 姚昌松1,2, 徐翼1,2   

  1. 1.美的集团股份有限公司 广东佛山 528311;
    2.美的集团(上海)AI研究院 上海 201702;
    3.中国家用电器研究院 北京 100176;
    4.中家院(北京)检测认证有限公司 北京 100176;
    5.国家智能家居质量检测检验中心 北京 100176
  • 发布日期:2025-12-30
  • 作者简介:潘悦然,博士学位。研究方向:人工智能。地址:上海市青浦区徐民路777号美的上海全球创新园区。E-mail:panyr9@midea.com。

A survey on the evaluation of intent understanding in smart home voice interaction

PAN Yueran1,2, JIAO Limin3,4,5, YANG Rongxiang1,2, CUI Qingyong1,2, YAO Changsong1,2, XU Yi1,2   

  1. 1. Midea Group Co., Ltd. Foshan 528311;
    2. AI Research Center, Midea Group (Shanghai) Co., Ltd. Shanghai 201702;
    3. China Household Electric Appliance Research Institute Beijing 100176;
    4. CHEARI (Beijing) Certification & Testing Co., Ltd. Beijing 100176;
    5. National Smart Home Quality Supervision & Inspection Center Beijing 100176
  • Published:2025-12-30

摘要: 随着人工智能与物联网的融合,智慧家庭语音交互正由功能指令执行迈向需求理解和主动服务。意图理解作为服务的核心,其评测体系是衡量语音助手性能与用户体验的关键。综述了智慧家庭意图理解的评测现状,系统梳理了技术演进、评测数据集和指标体系。针对现有局限,提出评测体系面临的三大挑战:智能程度量化难、复杂交互覆盖不足、语料标准化缺失,并展望未来应构建以用户体验与情感智能为导向的新型评测框架。进一步提出三项建设路径:标准化数据与脚本、统一指标与报告协议、仿真与真实场景的迁移验证,为实现人本位智能评测提供参考。

关键词: 智慧家庭, 语音交互, 意图理解, 评测系统, 大语言模型

Abstract: With the integration of artificial intelligence (AI) and the Internet of Things (IoT), smart home voice interaction is evolving from functional command execution to demand understanding and proactive service. Intent Understanding is the core of this service , and its evaluation system is crucial for measuring voice assistant performance and user experience. Reviews the current status of evaluation for intent understanding in smart home voice interaction, systematically summarizing its technical evolution, datasets, and metric systems. Addressing existing limitations, it identifies three major challenges: difficulty in quantifying intelligence, insufficient coverage of complex interactions, and lack of corpus standardization. Further envisions a human-centered evaluation framework guided by user experience and emotional intelligence. Also puts forward three construction paths: standardizing data and scripts , unifying metrics and reporting protocols , and validating transferability between simulation and real-world scenarios, providing a reference for achieving human-centric intelligence evaluation.

Key words: Smart home, Voice interaction, Intent understanding, Evaluation framework, Large Language Model (LLM)

中图分类号: