9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Visual Tuning

      Read this article at

      ScienceOpenPublisher
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Fine-tuning visual models has been widely shown promising performance on many downstream visual tasks. With the surprising development of pre-trained visual foundation models, visual tuning jumped out of the standard modus operandi that fine-tunes the whole pre-trained model or just the fully connected layer. Instead, recent advances can achieve superior performance than full-tuning the whole pre-trained parameters by updating far fewer parameters, enabling edge devices and downstream applications to reuse the increasingly large foundation models deployed on the cloud. With the aim of helping researchers get the full picture and future directions of visual tuning, this survey characterizes a large and thoughtful selection of recent works, providing a systematic and comprehensive overview of existing work and models. Specifically, it provides a detailed background of visual tuning and categorizes recent visual tuning techniques into five groups: fine-tuning, prompt tuning, adapter tuning, parameter tuning, and remapping tuning. Meanwhile, it offers some exciting research directions for prospective pre-training and various interactions in visual tuning.

          Related collections

          Most cited references214

          • Record: found
          • Abstract: found
          • Article: not found

          Deep learning.

          Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Deep Residual Learning for Image Recognition

              • Record: found
              • Abstract: not found
              • Article: not found

              ImageNet classification with deep convolutional neural networks

                Author and article information

                Contributors
                Journal
                ACM Computing Surveys
                ACM Comput. Surv.
                Association for Computing Machinery (ACM)
                0360-0300
                1557-7341
                December 31 2024
                July 25 2024
                December 31 2024
                : 56
                : 12
                : 1-38
                Affiliations
                [1 ]Zhejiang University-University of Illinois Urbana-Champaign Institute, Zhejiang University, Haining, China and Zhejiang Provincial Engineering Research Center for Multimodal Transport Logistics Large Models, Haining China
                [2 ]Huawei, Shenzhen China
                [3 ]Peking University, National Engineering Research Center for Software Engineering, Beijing China
                [4 ]Peng Cheng Laboratory, Shenzhen China
                [5 ]Shandong University of Science and Technology, College of Computer Science and Engineering, Qingdao China
                [6 ]National Key Lab of General AI, Peking University, School of Intelligence Science and Technology, Peking University, Beijing, China and Pazhou Laboratory (Huangpu), Guangzhou China
                [7 ]The Hong Kong Polytechnic University, Department of Computing, Hong Kong, Hong Kong
                Article
                10.1145/3657632
                bfae1fe3-6e90-40f5-87cc-a89703b1a65e
                © 2024
                History

                Comments

                Comment on this article

                Related Documents Log