AI-SAM: Automatic and Interactive Segment Anything Model

Yimu Pan (1), Sitao Zhang (1), Alison D. Gernand (1), Jeffery A. Goldstein (2), and James Z. Wang (1)

(1) The Pennsylvania State University, University Park, Pennsylvania, USA
(2) Northwestern University, Chicago, Illinois, USA
Abstract:

Semantic segmentation is a core task in computer vision, traditionally approached as either automatic or interactive. Interactive approaches, exemplified by the Segment Anything Model, have shown promise as pre-trained models, but current adaptation strategies tend to favor either automatic or interactive methods. Interactive approaches rely on user prompts, whereas automatic methods bypass interactive promptability entirely. We introduce the Automatic and Interactive Segment Anything Model (AI-SAM), a novel paradigm that addresses these limitations. At its core is the Automatic and Interactive Prompter (AI-Prompter), which automatically generates initial prompts while allowing user input. AI-SAM achieves state-of-the-art performance in both medical and non-medical applications and offers flexibility to further enhance results through user interaction. Code is available at https://github.com/ymp5078/AI-SAM.


Full Paper
(PDF, 1.6MB)

NIH project information

More information


Citation: Yimu Pan, Sitao Zhang, Alison D. Gernand, Jeffery A. Goldstein and James Z. Wang, ``AI-SAM: Automatic and Interactive Segment Anything Model,'' Proceedings of the 9th International Workshop on Health Intelligence, in conjunction with the Annual AAAI Conference on Artificial Intelligence, pp. -, Philadelphia, Pennsylvania, March 2025.

© 2025 AAAI. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the AAAI.

Last Modified: February 10, 2025
© 2025