家电科技 ›› 2025, Vol. 0 ›› Issue (6): 14-19.doi: 10.19784/j.cnki.issn1672-0172.2025.06.001

• 论文 • 上一篇    下一篇

嵌入式蒸烤箱多模态菜谱生成技术研究

屈新芳1, 张立民1, 叶至壮1, 关阳2   

  1. 1.中家院(北京)检测认证有限公司 北京 100176;
    2.中国家用电器研究院 北京 100037
  • 出版日期:2025-12-01 发布日期:2026-04-03
  • 作者简介:屈新芳,学士学位。研究方向:家电检测和技术标准。地址:北京市北京经济技术开发区博兴八路3号。E-mail:quxf@cheari.com。

Research on multi-modal recipe generation technology for embedded steamer oven

QU Xinfang1, ZHANG Limin1, YE Zhizhuang1, GUAN Yang2   

  1. 1. CHEARI (Beijing) Certification & Testing Co., Ltd. Beijing 100176;
    2. China Household Electric Appliances Research Institute Beijing 100037
  • Online:2025-12-01 Published:2026-04-03

摘要: 针对嵌入式蒸烤箱智能化升级需求,解决传统设备操作模式单一、个性化烹饪需求匹配不足的问题,构建一种融合多模态数据的精准菜谱生成系统,为智能厨房家电技术创新提供支撑。该系统通过采集食材图像、重量、用户语音指令、环境传感器数据等多模态信息,结合小波变换去噪、CLAHE图像增强及MaskR-CNN分割等预处理技术,采用早期与晚期融合的多模态数据算法架构,构建基于CNN、LSTM和MLP的深度学习模型,并通过剪枝算法优化模型性能。实验表明,图像噪声(标准差0.1时准确率从91.45%降至68.34%)和语音识别错误会显著影响推荐精度;模型经30%剪枝后计算量减少38.41%,精度损失控制在3%以内。对100名用户调研显示,63.24%年轻用户偏好语音交互,28.48%中老年用户倾向手动操作。实现了多模态数据在菜谱生成中的有效应用,为设备智能化提供技术路径,未来需进一步优化跨模态语义合成与用户个性化适配能力,推动智能厨房家电发展。

关键词: 嵌入式蒸烤箱, 多模态数据, 菜谱生成, 深度学习

Abstract: To address the demand for intelligent upgrading of embedded steam ovens and solve the problems of single operation mode and insufficient matching of personalized cooking needs in traditional equipment, an accurate recipe generation system that integrates multi-modal data was constructed, providing support for technological innovation in intelligent kitchen appliances. By collecting multi-modal information such as food ingredient images, weight, user voice commands, and environmental sensor data, combined with preprocessing techniques such as wavelet transform denoising, CLAHE image enhancement, and MaskR-CNN segmentation, an early and late fusion multi-modal data algorithm architecture is adopted to construct a deep learning model based on CNN, LSTM, and MLP, and the model performance is optimized through pruning algorithm. The experiment showed that image noise (accuracy decreased from 91.45% to 68.34% with a standard deviation of 0.1) and speech recognition errors significantly affect recommendation accuracy. After 30% pruning, the computational complexity of the model decreased by 38.41%, and the accuracy loss was controlled within 3%. According to a survey of 100 users, 63.24% of young users prefer voice interaction, while 28.48% of middle-aged and elderly users prefer manual operation. The system has achieved effective application of multi-modal data in recipe generation, providing a technical path for device intelligence. In the future, it is necessary to further optimize cross-modal semantic synthesis and user personalized adaptation capabilities to promote the development of intelligent kitchen appliances.

Key words: Embedded steam oven, Multi-modal data, Recipe generation, Deep learning

中图分类号: