Search for authorsSearch for similar articles
5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          <p xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" class="first" dir="auto" id="d4341076e53">Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Another popular approach to model the multimodal data is through deep neural networks, such as the deep Boltzmann machine (DBM). Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply and extend this model to multimodal data, such as simultaneous image classification and annotation. First, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the learned hidden topic features and show how to employ it to learn a joint representation from image visual words, annotation words and class label information. We test our model on the LabelMe and UIUC-Sports data sets and show that it compares favorably to other topic models. Second, we propose a deep extension of our model and provide an efficient way of training the deep model. Experimental results show that our deep model outperforms its shallow version and reaches state-of-the-art performance on the Multimedia Information Retrieval (MIR) Flickr data set. </p>

          Related collections

          Author and article information

          Journal
          IEEE Transactions on Pattern Analysis and Machine Intelligence
          IEEE Trans. Pattern Anal. Mach. Intell.
          Institute of Electrical and Electronics Engineers (IEEE)
          0162-8828
          2160-9292
          June 1 2016
          June 1 2016
          : 38
          : 6
          : 1056-1069
          Article
          10.1109/TPAMI.2015.2476802
          26372202
          240e68d6-e75b-461a-b8c8-b8bd51f1ed55
          © 2016
          History

          Comments

          Comment on this article