家电科技 ›› 2023, Vol. 0 ›› Issue (1): 74-79.doi: 10.19784/j.cnki.issn1672-0172.2023.01.012

• 论文 • 上一篇    下一篇

基于迁移学习的室内小样本声源定位方法研究

吴爽, 冯涛, 王晶   

  1. 北京工商大学人工智能学院 北京 100048
  • 出版日期:2023-02-01 发布日期:2023-04-24
  • 通讯作者: 冯涛,E-mail:fengt@btbu.edu.cn。
  • 作者简介:吴爽,硕士研究生。研究方向:声源定位、机器学习。地址:北京市海淀区阜成路11号北京工商大学。E-mail:15811457825@163.com。

Indoor small sample sound source localization method based on transfer learning

WU Shuang, FENG Tao, WANG Jing   

  1. Beijing Technology and Business University School of Artificial Intelligence Beijing 100048
  • Online:2023-02-01 Published:2023-04-24

摘要: 针对实测数据样本数量不足,导致声源定位模型定位性能受限的问题,提出一种基于迁移学习的室内声源定位方法。该方法使用卷积神经网络搭建迁移学习模型,对大量的仿真数据进行预训练,在预训练模型的基础上对小样本实测数据进行再训练,从而实现小样本数据的室内声源定位。基于TAU Spatial Sound Events 2019数据集的实验表明:迁移学习模型针对不同小样本实测数据均可实现高准确率方位预测,且定位性能优于传统卷积神经网络模型,对迁移学习理论在室内声源定位中的应用具有一定的价值。

关键词: 室内声源定位, 迁移学习, 小样本, 卷积神经网络

Abstract: Aiming at the problem of limited sound source localization model performance caused by the insufficient number of measured data samples, an indoor sound source localization method based on transfer learning is proposed. The method uses a convolutional neural network to build a transfer learning model, pre-trains a large amount of simulated data, and re-trains a small sample of measured data on the basis of the pre-trained model, so as to achieve indoor sound source localization with small sample data. Experiments based on the TAU Spatial Sound Events 2019 dataset show that the transfer learning model can achieve high-accuracy azimuth prediction for different small sample measured data, and the localization performance is better than the traditional convolutional neural network model, has certain value for the application of transfer learning theory in indoor sound source localization.

Key words: Indoor sound source localization, Transfer learning, Small sample, Convolutional neural network

中图分类号: