家电科技 ›› 2026, Vol. 0 ›› Issue (2): 71-75.doi: 10.19784/j.cnki.issn1672-0172.2026.02.011

• 论文 • 上一篇    下一篇

智能语音设备协同唤醒响应机制研究与实现

刘永红1, 徐翼1, 伍云云1, 张文彬1, 吴洋1, 吴则昊1,2   

  1. 1.美的集团(上海)有限公司 上海 201700;
    2.东北大学佛山研究生创新学院 广东佛山 528000
  • 出版日期:2026-04-01 发布日期:2026-06-17
  • 通讯作者: 徐翼,E-mail:xuyi42@midea.com。
  • 作者简介:刘永红,硕士学位。研究方向:边缘智能。地址:上海市青浦区西虹桥商务区美的全球创新园区。E-mail:yonghong2.liu@midea.com。

Research and implementation of collaborative wake-up response mechanism for voice-enabled intelligent devices

Liu Yonghong1, Xu Yi1, Wu Yunyun1, Zhang Wenbin1, Wu Yang1, Wu Zehao1,2   

  1. 1. Midea Group (Shanghai) Co., Ltd. Shanghai 201700;
    2. Foshan Graduate School of Innovation Northeastern University Foshan 528000
  • Online:2026-04-01 Published:2026-06-17

摘要: 随着智能语音设备在家居环境中的广泛部署,多设备因同时响应用户唤醒指令而引发的交互冲突问题日益突出。由于现有唤醒协同响应机制依赖网络基础设施,在离线场景下失效,且通过信号强度等指标难以准确识别用户直面的目标设备。所以提出了一种基于IEEE 802.11管理帧的无网络协同方案,通过设备间直接通信,融合相干混响信噪比与双阶段信噪比等多维唤醒特征,实现分布式决策。实验结果表明,方案在模拟家居环境下实现96.65%的设备唯一准确率和90.18%的直面准确率,显著提升了多设备语音交互的准确性与用户体验。

关键词: 语音交互, 协同响应, 分布式决策, 设备直接通信, 相干混响信噪比

Abstract: With the widespread deployment of voice-enabled intelligent devices in home environments, interaction conflicts caused by multiple devices simultaneously responding to user's wake-up commands have become increasingly prominent. This is because the existing wake-up collaborative response mechanisms rely on network infrastructure and fail in offline scenarios; moreover, it is difficult to accurately identify the user's “facing” target device using metrics such as signal strength alone. To address this, a network-free cooperative scheme is proposed. Based on IEEE 802.11 management frames, the scheme enables direct device-to-device communication and fuses multidimensional acoustic features—including coherent reverberant SNR and two-stage SNR—to support distributed decision-making. Experimental results in simulated home environments show a 96.65% unique-response rate and 90.18% facing-device accuracy, significantly improving the accuracy and user experience of multi-device voice interactions.

Key words: Voice interaction, Collaborative mechanisms, Distributed decision-making, Direct communication, Coherent reverberant signal-to-noise ratio

中图分类号: