4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Voice Keyword Retrieval Method Using Attention Mechanism and Multimodal Information Fusion

      1
      Scientific Programming
      Hindawi Limited

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A cross-modal speech-text retrieval method using interactive learning convolution automatic encoder (CAE) is proposed. First, an interactive learning autoencoder structure is proposed, including two inputs of speech and text, as well as processing links such as encoding, hidden layer interaction, and decoding, to complete the modeling of cross-modal speech-text retrieval. Then, the original audio signal is preprocessed and the Mel frequency cepstrum coefficient (MFCC) feature is extracted. In addition, the word bag model is used to extract the text features, and then the attention mechanism is used to combine the text and speech features. Through interactive learning CAE, the shared features of speech and text modes are obtained and then sent to modal classifier to identify modal information, so as to realize cross-modal voice text retrieval. Finally, experiments show that the performance of the proposed algorithm is better than that of the contrast algorithm in terms of recall rate, accuracy rate, and false recognition rate.

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Article: not found

          Research on heterogeneous multimodal data retrieval based on hash algorithm

          F. Chen (2019)
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Label consistent locally linear embedding based cross-modal hashing

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Research progress of cross-modal retrieval method based on adversarial learning research progress of cross-modal retrieval method based on adversarial learning

              L Zhang (2019)
                Bookmark

                Author and article information

                Contributors
                Journal
                Scientific Programming
                Scientific Programming
                Hindawi Limited
                1875-919X
                1058-9244
                January 23 2021
                January 23 2021
                : 2021
                : 1-11
                Affiliations
                [1 ]Department of Educational Technology, Inner Mongolia Normal University, Inner Mongolia, Hohhot 010022, China
                Article
                10.1155/2021/6662841
                7a886ff3-fa77-4fe1-9895-0dfdc6dbb107
                © 2021

                https://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article