3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We introduce a new architecture for personalization of text-to-image diffusion models, coined Mixture-of-Attention (MoA). Inspired by the Mixture-of-Experts mechanism utilized in large language models (LLMs), MoA distributes the generation workload between two attention pathways: a personalized branch and a non-personalized prior branch. MoA is designed to retain the original model's prior by fixing its attention layers in the prior branch, while minimally intervening in the generation process with the personalized branch that learns to embed subjects in the layout and context generated by the prior branch. A novel routing mechanism manages the distribution of pixels in each layer across these branches to optimize the blend of personalized and generic content creation. Once trained, MoA facilitates the creation of high-quality, personalized images featuring multiple subjects with compositions and interactions as diverse as those generated by the original model. Crucially, MoA enhances the distinction between the model's pre-existing capability and the newly augmented personalized intervention, thereby offering a more disentangled subject-context control that was previously unattainable. Project page: https://snap-research.github.io/mixture-of-attention

          Related collections

          Author and article information

          Journal
          17 April 2024
          Article
          2404.11565
          8289d5bd-132c-4bc7-9bc2-5aa2f0d165c0

          http://creativecommons.org/licenses/by-nc-sa/4.0/

          History
          Custom metadata
          Project Website: https://snap-research.github.io/mixture-of-attention
          cs.CV cs.AI cs.GR

          Computer vision & Pattern recognition,Artificial intelligence,Graphics & Multimedia design

          Comments

          Comment on this article