[1] |
Liu S, Demirel M F, Liang Y.N-gram graph: Simple unsupervised representation for graphs, with applications to molecules[J]. Advances in neural information processing systems, 2019, 32.
|
[2] |
Vaswani A, Shazeer N, Parmar N, et al.Attention is all you need[J]. Advances in neural information processing systems, 2017, 30.
|
[3] |
Peters M, Neumann M, Iyyer M, et al.Deep contextualized word representations[A]// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)[C], volume 1. 2018: 2227-2237.
|
[4] |
Radford A, Narasimhan K, Salimans T, et al.Improving language understanding by generative pre-training[J].
|
[5] |
Radford A, Wu J, Child R, et al.Language models are unsupervised multitask learners[J]. OpenAI blog, 2019, 1(08): 09.
|
[6] |
Raffel C, Shazeer N, Roberts A, et al.Exploring the limits of transfer learning with a unified text to-text transformer[J]. The Journal of Machine Learning Research, 2020, 21(01): 5485-5551.
|
[7] |
Brown T, Mann B, Ryder N, et al.Language models are few-shot learners[J]. Advances in neural information processing systems, 2020, 33:1877-1901.
|
[8] |
Zhao W X, Zhou K, Li J, et al. A survey of large language models[J]. arXiv preprint arXiv:2303.18223, 2023.
|
[9] |
田云龙, 王统帅, 牛丽. 智能家居领域利用AIGC家电垂直大模型提升洗衣机智能交互体验的系统和方法[J]. 家电科技, 2023(S1): 126-130.
|
[10] |
Paine T L, Khorrami P, Chang S, et al. Fast wavenet generation algorithm[J]. arxiv preprint arxiv:1611.09482, 2016.
|
[11] |
Klejsa J, Hedelin P, Zhou C, et al.High-quality speech coding with sample RNN[A]//ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)[C], IEEE, 2019: 7155-7159.
|
[12] |
Sotelo J, Mehri S, Kumar K, et al.Char2wav: End-to-end speech synthesis[J]. 2017.
|
[13] |
Wang Y, Skerry-Ryan R J, Stanton D, et al. Tacotron: Towards end-to-end speech synthesis[J]. arxiv preprint arxiv:1703.10135, 2017.
|
[14] |
Skerry-Ryan R J, Battenberg E, **ao Y, et al. Towards end-to-end prosody transfer for expressive speech synthesis with tacotron[A]// international conference on machine learning. PMLR[C], 2018: 4693-4702.
|
[15] |
McGilloway S, Cowie R, Douglas-Cowie E, et al. Approaching Automatic Recognition of Emotion from Voice: A Rough Benchmark [A]// ISCA Workshop on Speech & Emotion[C], 2000.
|
[16] |
Sun C, Zhang M, Wu R, et al.A convolutional recurrent neural network with attention framework for speech separation in monaural recordings[J]. Scientific Reports, 2021, 11(01): 1434.
|
[17] |
Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov,Abdelrahman Mohamed, “Hubert: Selfsupervised speech representation learning by masked prediction of hidden units,” Trans. of TASLP, 2021[Z].
|