基于迁移学习的室内小样本声源定位方法研究

doi:10.19784/j.cnki.issn1672-0172.2023.01.012

家电科技 ›› 2023, Vol. 0 ›› Issue (1): 74-79.doi: 10.19784/j.cnki.issn1672-0172.2023.01.012

基于迁移学习的室内小样本声源定位方法研究

吴爽, 冯涛, 王晶

北京工商大学人工智能学院北京 100048

出版日期:2023-02-01 发布日期:2023-04-24
通讯作者: 冯涛,E-mail：fengt@btbu.edu.cn。
作者简介:吴爽,硕士研究生。研究方向：声源定位、机器学习。地址：北京市海淀区阜成路11号北京工商大学。E-mail：15811457825@163.com。

Indoor small sample sound source localization method based on transfer learning

WU Shuang, FENG Tao, WANG Jing

Beijing Technology and Business University School of Artificial Intelligence Beijing 100048

Online:2023-02-01 Published:2023-04-24

摘要/Abstract

摘要： 针对实测数据样本数量不足,导致声源定位模型定位性能受限的问题,提出一种基于迁移学习的室内声源定位方法。该方法使用卷积神经网络搭建迁移学习模型,对大量的仿真数据进行预训练,在预训练模型的基础上对小样本实测数据进行再训练,从而实现小样本数据的室内声源定位。基于TAU Spatial Sound Events 2019数据集的实验表明：迁移学习模型针对不同小样本实测数据均可实现高准确率方位预测,且定位性能优于传统卷积神经网络模型,对迁移学习理论在室内声源定位中的应用具有一定的价值。

关键词: 室内声源定位, 迁移学习, 小样本, 卷积神经网络

Abstract: Aiming at the problem of limited sound source localization model performance caused by the insufficient number of measured data samples, an indoor sound source localization method based on transfer learning is proposed. The method uses a convolutional neural network to build a transfer learning model, pre-trains a large amount of simulated data, and re-trains a small sample of measured data on the basis of the pre-trained model, so as to achieve indoor sound source localization with small sample data. Experiments based on the TAU Spatial Sound Events 2019 dataset show that the transfer learning model can achieve high-accuracy azimuth prediction for different small sample measured data, and the localization performance is better than the traditional convolutional neural network model, has certain value for the application of transfer learning theory in indoor sound source localization.

Key words: Indoor sound source localization, Transfer learning, Small sample, Convolutional neural network

中图分类号:

TB52+9

吴爽, 冯涛, 王晶. 基于迁移学习的室内小样本声源定位方法研究[J]. 家电科技, 2023, 0(1): 74-79.

WU Shuang, FENG Tao, WANG Jing. Indoor small sample sound source localization method based on transfer learning[J]. Journal of Appliance Science & Technology, 2023, 0(1): 74-79.

参考文献

[1] Zhang X F, Xiao Y G, Deng H L.Noise source localization investigation in high speed train based on microphone array[A]//Applied Mechanics and Materials[C]. Trans Tech Publications Ltd., 2012, 103: 285-291.
[2] 钟泽, 江俊, 李语亭, 等. 基于声学定位测试的冰箱制冷剂噪声的分析与优化[J]. 家电科技, 2021(04): 54-56.
[3] 李昊, 汪超, 王进, 等. 机器人吸尘器环境建模与路径规划研究综述[J]. 家电科技, 2021(02): 30-40.
[4] Torrieri D J. Statistical Theory of Passive Location Systems[J]. Aerospace & Electronic Systems IEEE Transactions on, 1984, AES-20(02): 183-198.
[5] Yook D, Lee T, Cho Y.Fast Sound Source Localization Using Two-Level Search Space Clustering[J]. IEEE Trans Cybern, 2016, 46(01): 20-26.
[6] Takeda R, Komatani K.Performance comparison of MUSIC-based sound localization methods on small humanoid under low SNR conditions[A]//2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids)[C]. IEEE, 2015: 859-865.
[7] Sun Y, Chen J, Yuen C, et al.Indoor sound source localization with probabilistic neural network[J]. IEEE Transactions on Industrial Electronics, 2017, 65(08): 6403-6413.
[8] Ding J, Ren B, Zheng N.Microphone array acoustic source localization system based on deep learning[A]//2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP)[C]. IEEE, 2018: 409-413.
[9] Liu N, Chen H, Songgong K, et al.Deep learning assisted sound source localization using two orthogonal first-order differential microphone arrays[J]. The Journal of the Acoustical Society of America, 2021, 149(02): 1069-1084.
[10] Tan T-H, Lin Y-T, Chang Y-L, et al.Sound Source Localization Using a Convolutional Neural Network and Regression Model[J]. Sensors, 2021, 21(23): 8031.
[11] Adavanne S, Politis A, Nikunen J, et al.Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks[J]. IEEE Journal of Selected Topics in Signal Processing, 2019,13(01): 34-48.
[12] Pan S J, Yang Q.A survey on transfer learning[J]. IEEE Transactions on knowledge and data engineering, 2009, 22(10): 1345-1359.
[13] Scheibler R, Bezzam E, Dokmanić I.Pyroomacoustics: A python package for audio room simulation and array processing algorithms[A]//2018 IEEE international conference on acoustics, speech and signal processing (ICASSP)[C], 2018: 351-355.
[14] Patil U G, Shirbahadurkar S D.Performance analysis of SS based speech enhancement algorithms for ASR with Non-stationary Noisy Database-NOIZEUS[A]//2018 2nd International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC)[C]. IEEE, 2018: 636-641.
[15] Adavanne S, Politis A, Virtanen T.A multi-room reverberant dataset for sound event localization and detection[J]. arXiv preprint arXiv: 1905. 08546, 2019.
[16] Xiao X, Zhao S, Zhong X, et al.A learning-based approach to direction of arrival estimation in noisy and reverberant environments[A]//2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)[C]. IEEE, 2015: 2814-2818.

基于迁移学习的室内小样本声源定位方法研究

Indoor small sample sound source localization method based on transfer learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 0

Metrics

本文评价

推荐阅读 0