UNeXt: MLP-based Rapid Medical Image Segmentation Network

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

UNet and its latest extensions like TransUNet have been the leading medical image segmentation methods in recent years. However, these networks cannot be effectively adopted for rapid image segmentation in point-of-care applications as they are parameter-heavy, computationally complex and slow to use. To this end, we propose UNeXt which is a Convolutional multilayer perceptron (MLP) based network for image segmentation. We design UNeXt in an effective way with an early convolutional stage and a MLP stage in the latent stage. We propose a tokenized MLP block where we efficiently tokenize and project the convolutional features and use MLPs to model the representation. To further boost the performance, we propose shifting the channels of the inputs while feeding in to MLPs so as to focus on learning local dependencies. Using tokenized MLPs in latent space reduces the number of parameters and computational complexity while being able to result in a better representation to help segmentation. The network also consists of skip connections between various levels of encoder and decoder. We test UNeXt on multiple medical image segmentation datasets and show that we reduce the number of parameters by 72x, decrease the computational complexity by 68x, and improve the inference speed by 10x while also obtaining better segmentation performance over the state-of-the-art medical image segmentation architectures. Code is available at https://github.com/jeya-maria-jose/UNeXt-pytorch

Abstract

Tech Report

Related collections

Author and article information

Journal

Publisher: arXiv

Publication date (Electronic): 2022

Publication date Submitted: 09 March 2022

Publication date Updated: 11 March 2022

Publication date Available: March 2022

Article

DOI: 10.48550/ARXIV.2203.04967

SO-VID: 839453af-909f-4815-887d-4c95496775e9

License:

arXiv.org perpetual, non-exclusive license

History

Keywords: FOS: Computer and information sciences,FOS: Electrical engineering, electronic engineering, information engineering,Computer Vision and Pattern Recognition (cs.CV),Image and Video Processing (eess.IV)

Data availability:

Keywords: FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, Computer Vision and Pattern Recognition (cs.CV), Image and Video Processing (eess.IV)

UNeXt: MLP-based Rapid Medical Image Segmentation Network

Read this article at

Abstract

Abstract

Related collections

Recursive Rule based Visual Categorization

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 240

Cited by 8