智能家居系统的Multi-Agent建模研究

doi:10.19784/j.cnki.issn1672-0172.2022.05.001

摘要/Abstract

摘要： 以智能家居系统作为研究对象,通过Multi-Agent理论建立了相关模型,并采用价值分解网络（Value Decomposition Networks,VDN）作为模型算法对Q函数进行了优化分析。以此提出了建立智能家居中各智能体规则库和知识库,并系统式开放建立各智能体Agent的BDI（信念-需求-意图）集合的建议,从哲学逻辑到收益建模对智能家居使用效用进行量化分析及优化。

关键词: 多智能体系统, 智能家居, 场景, 强化学习, Q学习

Abstract: Takes the smart home system as the research object, establishes the relevant model through the Multi-Agent theory, and uses the Value Decomposition Networks (VDN) as the model algorithm to optimize the Q function. Based on this, it is proposed to establish the rules database and knowledge database of each Agent in the smart home, and systematically open the BDI (Belief-Demand-Iintention) set of each Agent, so as to quantitatively analyze and optimize the use utility of the smart home from philosophical logic to income modeling.

Key words: Multi-Agent system, Smart home, Scene, Reinforcement learning, Q-learning

中图分类号:

TP27
TP368

曲宗峰. 智能家居系统的Multi-Agent建模研究[J]. 家电科技, 2022, 0(5): 16-21.

QU Zongfeng. Research on smart home system of Multi-Agent modeling[J]. Journal of Appliance Science & Technology, 2022, 0(5): 16-21.

参考文献 23

[1]	杜威, 丁世飞. 多智能体强化学习综述[J]. 计算机科学, 2019, 46(08): 1-8.
[2]	吴军, 徐昕, 王健, 等. 面向多机器人系统的增强学习研究进展综述[J]. 控制与决策, 2011, 26(11): 1601-1610.
[3]	Jennings N R.On Agent-based software engineering[M]. Agent-Oriented Software Engineering. Springer Berlin Heidelberg, 2000.
[4]	Macal C M, North M J.Tutorial on Agent-based modeling and simulation part 2: how to model with Agents[C]. Simulation Conference. IEEE, 2006.
[5]	Holland J H.Studying Complex Adaptive Systems[J]. Journal of Systems Science and Complexity, 2006, 19(01): 1-8.
[6]	李诗濛, 李俊青, 等. 迈向“6S”智慧家居:智能科技与智慧生活[J]. 电器, 2021(09): 46-51.
[7]	刘剑飞. 科技住宅的发展及现状[J]. 住宅科技, 2021, 41(05): 8-11+23.
[8]	科大讯飞携手德国摩根发布重磅新品,深耕智能家居领域[J]. 机电信息, 2019(16): 59.
[9]	中国智能物联网(AIoT)白皮书 2020年[A]//上海艾瑞市场咨询有限公司. 艾瑞咨询系列研究报告(2020年第2期)[R].国智能物联网(AIoT)白皮书 2020年[A]//上海艾瑞市场咨询有限公司. 艾瑞咨询系列研究报告(2020年第2期)[R]. 上海艾瑞市场咨询有限公司, 2020.
[10]	陈应. 浅谈5G+AIoT如何赋能智慧建筑[J]. 智能建筑, 2020(06): 31-32+43.
[11]	张钦海. 浅谈AI算力的普及给AIoT领域带来的新机遇[J]. 中国安防, 2020(08): 54-57.
[12]	华为:构建跨终端融合共享生态打造全场景智慧生活新体验[J]. 日用电器, 2019(08): 2-5.
[13]	R. Scoble.Age of Context: Mobile, Sensors, Data and The Future of Privacy[M]. Create Space Independent Publishing Platform, 2013.
[14]	武兴, 熊昕. 物联网技术在智能家居中的应用分析[J]. 信息与电脑(理论版), 2020, 32(21): 175-176.
[15]	Hajjaji, Y., Boulila, W., Farah, I. R., Romdhani, I. , Hussain, A. Big data and IoT-based applications in smart environments: A systematic review[J]. Computer Science Review, 2021, 39: 100318.
[16]	White C.Markov decision processes[M]. New York: Springer, 2001.
[17]	孙彧, 曹雷, 陈希亮, 等. 多智能体深度强化学习研究综述[J]. 计算机工程与应用, 2020, 56(05): 13-24.
[18]	雷明. 机器学习原理、算法与应用[M]. 北京: 清华大学出版社, 2019: 374-382.
[19]	Beard R W, Saridis G N, Wen J T.Galerkin approximations of the generalized Hamilton- Jacobi- Bellman equation[J]. Automatica, 1997, 33(12): 2159-2177.
[20]	Watkins C, Dayan P.Q-learning[J]. Machine Learning, 1992, 8(3/4): 279-292.
[21]	Sutton R S, McAllester D A, Singh S P, et al. Policy gradient methods for reinforcement learning with function approximation[C]. Advances in Neural Information Processing Systems, 2000: 1057-1063.
[22]	SUNEHAG P, LEVER G, GRUSLYS A, et al.Value decomposition networks for cooperative multi-Agent learning based on team reward[C]. Proceedings of AAMAS, 2018: 2085-2087.
[23]	熊丽琴, 曹雷, 赖俊, 陈希亮. 基于值分解的多智能体深度强化学习综述[J]. 计算机科学, 2022, 49(09): 172-182.