Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Haghighi F, Taher MRH, Zhou Z, Gotway MB, Liang J. Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning. IEEE Trans Med Imaging 2021;40:2857-2868. [PMID: 33617450 PMCID: PMC8516596 DOI: 10.1109/tmi.2021.3060634] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

For:	Haghighi F, Taher MRH, Zhou Z, Gotway MB, Liang J. Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning. IEEE Trans Med Imaging 2021;40:2857-2868. [PMID: 33617450 PMCID: PMC8516596 DOI: 10.1109/tmi.2021.3060634] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Number

Cited by Other Article(s)

Hosseinzadeh Taher MR, Haghighi F, Gotway MB, Liang J. Large-scale benchmarking and boosting transfer learning for medical image analysis. Med Image Anal 2025;102:103487. [PMID: 40117988 DOI: 10.1016/j.media.2025.103487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 08/03/2024] [Accepted: 01/27/2025] [Indexed: 03/23/2025]

Abstract

Transfer learning, particularly fine-tuning models pretrained on photographic images to medical images, has proven indispensable for medical image analysis. There are numerous models with distinct architectures pretrained on various datasets using different strategies. But, there is a lack of up-to-date large-scale evaluations of their transferability to medical imaging, posing a challenge for practitioners in selecting the most proper pretrained models for their tasks at hand. To fill this gap, we conduct a comprehensive systematic study, focusing on (i) benchmarking numerous conventional and modern convolutional neural network (ConvNet) and vision transformer architectures across various medical tasks; (ii) investigating the impact of fine-tuning data size on the performance of ConvNets compared with vision transformers in medical imaging; (iii) examining the impact of pretraining data granularity on transfer learning performance; (iv) evaluating transferability of a wide range of recent self-supervised methods with diverse training objectives to a variety of medical tasks across different modalities; and (v) delving into the efficacy of domain-adaptive pretraining on both photographic and medical datasets to develop high-performance models for medical tasks. Our large-scale study (∼5,000 experiments) yields impactful insights: (1) ConvNets demonstrate higher transferability than vision transformers when fine-tuning for medical tasks; (2) ConvNets prove to be more annotation efficient than vision transformers when fine-tuning for medical tasks; (3) Fine-grained representations, rather than high-level semantic features, prove pivotal for fine-grained medical tasks; (4) Self-supervised models excel in learning holistic features compared with supervised models; and (5) Domain-adaptive pretraining leads to performant models via harnessing knowledge acquired from ImageNet and enhancing it through the utilization of readily accessible expert annotations associated with medical datasets. As open science, all codes and pretrained models are available at GitHub.com/JLiangLab/BenchmarkTransferLearning (Version 2).

Collapse

Zou L, Cao Y, Nie Z, Mao L, Qiu Y, Wang Z, Cai Z, Yang X. Segment Like A Doctor: Learning reliable clinical thinking and experience for pancreas and pancreatic cancer segmentation. Med Image Anal 2025;102:103539. [PMID: 40112510 DOI: 10.1016/j.media.2025.103539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2024] [Revised: 02/05/2025] [Accepted: 02/27/2025] [Indexed: 03/22/2025]

Abstract

Pancreatic cancer is a lethal invasive tumor with one of the worst prognosis. Accurate and reliable segmentation for pancreas and pancreatic cancer on computerized tomography (CT) images is vital in clinical diagnosis and treatment. Although certain deep learning-based techniques have been tentatively applied to this task, current performance of pancreatic cancer segmentation is far from meeting the clinical needs due to the tiny size, irregular shape and extremely uncertain boundary of the cancer. Besides, most of the existing studies are established on the black-box models which only learn the annotation distribution instead of the logical thinking and diagnostic experience of high-level medical experts, the latter is more credible and interpretable. To alleviate the above issues, we propose a novel Segment-Like-A-Doctor (SLAD) framework to learn the reliable clinical thinking and experience for pancreas and pancreatic cancer segmentation on CT images. Specifically, SLAD aims to simulate the essential logical thinking and experience of doctors in the progressive diagnostic stages of pancreatic cancer: organ, lesion and boundary stage. Firstly, in the organ stage, an Anatomy-aware Masked AutoEncoder (AMAE) is introduced to model the doctors' overall cognition for the anatomical distribution of abdominal organs on CT images by self-supervised pretraining. Secondly, in the lesion stage, a Causality-driven Graph Reasoning Module (CGRM) is designed to learn the global judgment of doctors for lesion detection by exploring topological feature difference between the causal lesion and the non-causal organ. Finally, in the boundary stage, a Diffusion-based Discrepancy Calibration Module (DDCM) is developed to fit the refined understanding of doctors for uncertain boundary of pancreatic cancer by inferring the ambiguous segmentation discrepancy based on the trustworthy lesion core. Experimental results on three independent datasets demonstrate that our approach boosts pancreatic cancer segmentation accuracy by 4%-9% compared with the state-of-the-art methods. Additionally, the tumor-vascular involvement analysis is also conducted to verify the superiority of our method in clinical applications. Our source codes will be publicly available at https://github.com/ZouLiwen-1999/SLAD.

Collapse

Zhang X, Xiao Z, Wu X, Chen Y, Zhao J, Hu Y, Liu J. Pyramid Pixel Context Adaption Network for Medical Image Classification With Supervised Contrastive Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025;36:6802-6815. [PMID: 38829749 DOI: 10.1109/tnnls.2024.3399164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]

He Y, An C, Dong K, Lyu Z, Qin S, Tan K, Hao X, Zhu C, Xiu W, Hu B, Xia N, Wang C, Dong Q. A Novel Visual Model for Predicting Prognosis of Resected Hepatoblastoma: A Multicenter Study. Acad Radiol 2025:S1076-6332(25)00197-7. [PMID: 40140274 DOI: 10.1016/j.acra.2025.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2025] [Revised: 02/26/2025] [Accepted: 03/02/2025] [Indexed: 03/28/2025]

Affiliation(s)

Ying He Department of Pediatric Surgery, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (Y.H., X.H., W.X., C.W., Q.D.)
Chaohui An Department of General Surgery, Shanghai Children's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China (C.A., Z.L., K.T.)
Kuiran Dong Department of Pediatric Surgery, Children's Hospital of Fudan University, 399 Wanyuan Road, Shanghai 201102, China (K.D., S.Q.)
Zhibao Lyu Department of General Surgery, Shanghai Children's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China (C.A., Z.L., K.T.)
Shanlu Qin Department of Pediatric Surgery, Children's Hospital of Fudan University, 399 Wanyuan Road, Shanghai 201102, China (K.D., S.Q.)
Kezhe Tan Department of General Surgery, Shanghai Children's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China (C.A., Z.L., K.T.)
Xiwei Hao Department of Pediatric Surgery, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (Y.H., X.H., W.X., C.W., Q.D.)
Chengzhan Zhu Department of Hepatobiliary and Pancreatic Surgery, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (C.Z.)
Wenli Xiu Department of Pediatric Surgery, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (Y.H., X.H., W.X., C.W., Q.D.)
Bin Hu Department of Radiology, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (B.H.)
Nan Xia Shandong Key Laboratory of Digital Medicine and Computer-Assisted Surgery, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao 266003, China (N.X.)
Chaojin Wang Department of Pediatric Surgery, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (Y.H., X.H., W.X., C.W., Q.D.)
Qian Dong Department of Pediatric Surgery, The Affiliated Hospital of Qingdao University, No.16 Jiangsu Road, Qingdao 266003, China (Y.H., X.H., W.X., C.W., Q.D.).

Collapse

Lyu J, Bartlett PF, Nasrallah FA, Tang X. Masked Deformation Modeling for Volumetric Brain MRI Self-Supervised Pre-Training. IEEE TRANSACTIONS ON MEDICAL IMAGING 2025;44:1596-1607. [PMID: 40030579 DOI: 10.1109/tmi.2024.3510922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]

Chu S, Ren X, Ji G, Zhao J, Shi J, Wei Y, Pei B, Qiang Y. Learning Consistent Semantic Representation for Chest X-ray via Anatomical Localization in Self-Supervised Pre-Training. IEEE J Biomed Health Inform 2025;29:2100-2112. [PMID: 40030350 DOI: 10.1109/jbhi.2024.3505303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2025]

Shi C, Zhang X, Zhao R, Zhang W, Chen F. Semantic structure preservation for accurate multi-modal glioma diagnosis. Sci Rep 2025;15:7185. [PMID: 40021688 PMCID: PMC11871068 DOI: 10.1038/s41598-025-88458-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Accepted: 01/28/2025] [Indexed: 03/03/2025] Open

He Y, Huang F, Jiang X, Nie Y, Wang M, Wang J, Chen H. Foundation Model for Advancing Healthcare: Challenges, Opportunities and Future Directions. IEEE Rev Biomed Eng 2025;18:172-191. [PMID: 39531565 DOI: 10.1109/rbme.2024.3496744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]

Cai Z, Zhong Z, Lin H, Huang B, Xu Z, Huang B, Deng W, Wu Q, Lei K, Lyu J, Ye Y, Chen H, Zhang J. Self-supervised learning on dual-sequence magnetic resonance imaging for automatic segmentation of nasopharyngeal carcinoma. Comput Med Imaging Graph 2024;118:102471. [PMID: 39608271 DOI: 10.1016/j.compmedimag.2024.102471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 10/08/2024] [Accepted: 11/12/2024] [Indexed: 11/30/2024]

Liu J, Zhang Y, Wang K, Yavuz MC, Chen X, Yuan Y, Li H, Yang Y, Yuille A, Tang Y, Zhou Z. Universal and extensible language-vision models for organ segmentation and tumor detection from abdominal computed tomography. Med Image Anal 2024;97:103226. [PMID: 38852215 DOI: 10.1016/j.media.2024.103226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 03/30/2024] [Accepted: 05/27/2024] [Indexed: 06/11/2024]

Abstract

The advancement of artificial intelligence (AI) for organ segmentation and tumor detection is propelled by the growing availability of computed tomography (CT) datasets with detailed, per-voxel annotations. However, these AI models often struggle with flexibility for partially annotated datasets and extensibility for new classes due to limitations in the one-hot encoding, architectural design, and learning scheme. To overcome these limitations, we propose a universal, extensible framework enabling a single model, termed Universal Model, to deal with multiple public datasets and adapt to new classes (e.g., organs/tumors). Firstly, we introduce a novel language-driven parameter generator that leverages language embeddings from large language models, enriching semantic encoding compared with one-hot encoding. Secondly, the conventional output layers are replaced with lightweight, class-specific heads, allowing Universal Model to simultaneously segment 25 organs and six types of tumors and ease the addition of new classes. We train our Universal Model on 3410 CT volumes assembled from 14 publicly available datasets and then test it on 6173 CT volumes from four external datasets. Universal Model achieves first place on six CT tasks in the Medical Segmentation Decathlon (MSD) public leaderboard and leading performance on the Beyond The Cranial Vault (BTCV) dataset. In summary, Universal Model exhibits remarkable computational efficiency (6× faster than other dataset-specific models), demonstrates strong generalization across different hospitals, transfers well to numerous downstream tasks, and more importantly, facilitates the extensibility to new classes while alleviating the catastrophic forgetting of previously learned classes. Codes, models, and datasets are available at https://github.com/ljwztc/CLIP-Driven-Universal-Model.

Collapse

Huang W, Li C, Yang H, Liu J, Liang Y, Zheng H, Wang S. Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement. Med Image Anal 2024;97:103299. [PMID: 39146702 DOI: 10.1016/j.media.2024.103299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 07/05/2024] [Accepted: 08/06/2024] [Indexed: 08/17/2024]

Hu Z, Liu J, Shen S, Wu W, Yuan J, Shen W, Ma L, Wang G, Yang S, Xu X, Cui Y, Li Z, Shen L, Li L, Bian J, Zhang X, Han H, Lin J. Large-volume fully automated cell reconstruction generates a cell atlas of plant tissues. THE PLANT CELL 2024;36:koae250. [PMID: 39283506 PMCID: PMC11852339 DOI: 10.1093/plcell/koae250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 07/24/2024] [Accepted: 09/13/2024] [Indexed: 02/27/2025]

Affiliation(s)

Zijian Hu State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Jiazheng Liu Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China School of Future Technology, School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China
Shiya Shen State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Weiqian Wu State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Jingbin Yuan Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Weiwei Shen State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Lingyu Ma Research Institute of Wood Industry, Chinese Academy of Forestry, Beijing 100091, China
Guangchao Wang State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Shunyao Yang State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Xiuping Xu Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
Yaning Cui State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Zhenchen Li Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China School of Future Technology, School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China
Lijun Shen Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Linlin Li Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Jiahui Bian State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Xi Zhang State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Hua Han Key Laboratory of Brain Cognition and Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China School of Future Technology, School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 101408, China
Jinxing Lin State Key Laboratory of Tree Genetics and Breeding, State Key Laboratory of Efficient Production of Forest Resources, National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China

Collapse

Huang W, Li C, Zhou HY, Yang H, Liu J, Liang Y, Zheng H, Zhang S, Wang S. Enhancing representation in radiography-reports foundation model: a granular alignment algorithm using masked contrastive learning. Nat Commun 2024;15:7620. [PMID: 39223122 PMCID: PMC11369198 DOI: 10.1038/s41467-024-51749-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 08/15/2024] [Indexed: 09/04/2024] Open

Jiang N, Wang G, Ye C, Liu T, Yan T. Multi-Task Collaborative Pre-Training and Adaptive Token Selection: A Unified Framework for Brain Representation Learning. IEEE J Biomed Health Inform 2024;28:5528-5539. [PMID: 38889024 DOI: 10.1109/jbhi.2024.3416038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]

Abstract

Structural magnetic resonance imaging (sMRI) reveals the structural organization of the brain. Learning general brain representations from sMRI is an enduring topic in neuroscience. Previous deep learning models neglect that the brain, as the core of cognition, is distinct from other organs whose primary attribute is anatomy. Capturing the high-level representation associated with inter-individual cognitive variability is key to appropriately represent the brain. Given that this cognition-related information is subtle, mixed, and distributed in the brain structure, sMRI-based models need to both capture fine-grained details and understand how they relate to the overall global structure. Additionally, it is also necessary to explicitly express the cognitive information that implicitly embedded in local-global image features. Therefore, we propose MCPATS, a brain representation learning framework that combines Multi-task Collaborative Pre-training (MCP) and Adaptive Token Selection (ATS). First, we develop MCP, including mask-reconstruction to understand global context, distort-restoration to capture fine-grained local details, adversarial learning to integrate features at different granularities, and age-prediction, using age as a surrogate for cognition to explicitly encode cognition-related information from local-global image features. This co-training allows progressive learning of implicit and explicit cognition-related representations. Then, we develop ATS based on mutual attention for downstream use of the learned representation. During fine-tuning, the ATS highlights discriminative features and reduces the impact of irrelevant information. MCPATS was validated on three different public datasets for brain disease diagnosis, outperforming competing methods and achieving accurate diagnosis. Further, we performed detailed analysis to confirm that the MCPATS-learned representation captures cognition-related information.

Collapse

Choopong P, Kusakunniran W. Selection of pre-trained weights for transfer learning in automated cytomegalovirus retinitis classification. Sci Rep 2024;14:15899. [PMID: 38987446 PMCID: PMC11237151 DOI: 10.1038/s41598-024-67121-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 07/08/2024] [Indexed: 07/12/2024] Open

Abstract

Cytomegalovirus retinitis (CMVR) is a significant cause of vision loss. Regular screening is crucial but challenging in resource-limited settings. A convolutional neural network is a state-of-the-art deep learning technique to generate automatic diagnoses from retinal images. However, there are limited numbers of CMVR images to train the model properly. Transfer learning (TL) is a strategy to train a model with a scarce dataset. This study explores the efficacy of TL with different pre-trained weights for automated CMVR classification using retinal images. We utilised a dataset of 955 retinal images (524 CMVR and 431 normal) from Siriraj Hospital, Mahidol University, collected between 2005 and 2015. Images were processed using Kowa VX-10i or VX-20 fundus cameras and augmented for training. We employed DenseNet121 as a backbone model, comparing the performance of TL with weights pre-trained on ImageNet, APTOS2019, and CheXNet datasets. The models were evaluated based on accuracy, loss, and other performance metrics, with the depth of fine-tuning varied across different pre-trained weights. The study found that TL significantly enhances model performance in CMVR classification. The best results were achieved with weights sequentially transferred from ImageNet to APTOS2019 dataset before application to our CMVR dataset. This approach yielded the highest mean accuracy (0.99) and lowest mean loss (0.04), outperforming other methods. The class activation heatmaps provided insights into the model's decision-making process. The model with APTOS2019 pre-trained weights offered the best explanation and highlighted the pathologic lesions resembling human interpretation. Our findings demonstrate the potential of sequential TL in improving the accuracy and efficiency of CMVR diagnosis, particularly in settings with limited data availability. They highlight the importance of domain-specific pre-training in medical image classification. This approach streamlines the diagnostic process and paves the way for broader applications in automated medical image analysis, offering a scalable solution for early disease detection.

Collapse

Taher MRH, Gotway MB, Liang J. Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability, Composability, and Decomposability from Anatomy via Self-Supervision. PROCEEDINGS. IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION 2024;abs/210504906:11269-11281. [PMID: 39670210 PMCID: PMC11636527 DOI: 10.1109/cvpr52733.2024.01071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2024]

Zeng M, Wang X, Chen W. Worldwide research landscape of artificial intelligence in lung disease: A scientometric study. Heliyon 2024;10:e31129. [PMID: 38826704 PMCID: PMC11141367 DOI: 10.1016/j.heliyon.2024.e31129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 05/09/2024] [Accepted: 05/10/2024] [Indexed: 06/04/2024] Open

Haghighi F, Hosseinzadeh Taher MR, Gotway MB, Liang J. Self-supervised learning for medical image analysis: Discriminative, restorative, or adversarial? Med Image Anal 2024;94:103086. [PMID: 38537414 PMCID: PMC11044023 DOI: 10.1016/j.media.2024.103086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 12/15/2023] [Accepted: 01/05/2024] [Indexed: 04/16/2024]

Chen J, Li M, Han H, Zhao Z, Chen X. SurgNet: Self-Supervised Pretraining With Semantic Consistency for Vessel and Instrument Segmentation in Surgical Images. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024;43:1513-1525. [PMID: 38090838 DOI: 10.1109/tmi.2023.3341948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Xing Z, Zhu L, Yu L, Xing Z, Wan L. Hybrid Masked Image Modeling for 3D Medical Image Segmentation. IEEE J Biomed Health Inform 2024;28:2115-2125. [PMID: 38289846 DOI: 10.1109/jbhi.2024.3360239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]

Kyung S, Jang M, Park S, Yoon HM, Hong GS, Kim N. Supervised representation learning based on various levels of pediatric radiographic views for transfer learning. Sci Rep 2024;14:7551. [PMID: 38555414 PMCID: PMC10981659 DOI: 10.1038/s41598-024-58163-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 03/26/2024] [Indexed: 04/02/2024] Open

Zhao L, Fong TC, Bell MAL. Detection of COVID-19 features in lung ultrasound images using deep neural networks. COMMUNICATIONS MEDICINE 2024;4:41. [PMID: 38467808 PMCID: PMC10928066 DOI: 10.1038/s43856-024-00463-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 02/16/2024] [Indexed: 03/13/2024] Open

Abstract

BACKGROUND

Deep neural networks (DNNs) to detect COVID-19 features in lung ultrasound B-mode images have primarily relied on either in vivo or simulated images as training data. However, in vivo images suffer from limited access to required manual labeling of thousands of training image examples, and simulated images can suffer from poor generalizability to in vivo images due to domain differences. We address these limitations and identify the best training strategy.

METHODS

We investigated in vivo COVID-19 feature detection with DNNs trained on our carefully simulated datasets (40,000 images), publicly available in vivo datasets (174 images), in vivo datasets curated by our team (958 images), and a combination of simulated and internal or external in vivo datasets. Seven DNN training strategies were tested on in vivo B-mode images from COVID-19 patients.

RESULTS

Here, we show that Dice similarity coefficients (DSCs) between ground truth and DNN predictions are maximized when simulated data are mixed with external in vivo data and tested on internal in vivo data (i.e., 0.482 ± 0.211), compared with using only simulated B-mode image training data (i.e., 0.464 ± 0.230) or only external in vivo B-mode training data (i.e., 0.407 ± 0.177). Additional maximization is achieved when a separate subset of the internal in vivo B-mode images are included in the training dataset, with the greatest maximization of DSC (and minimization of required training time, or epochs) obtained after mixing simulated data with internal and external in vivo data during training, then testing on the held-out subset of the internal in vivo dataset (i.e., 0.735 ± 0.187).

CONCLUSIONS

DNNs trained with simulated and in vivo data are promising alternatives to training with only real or only simulated data when segmenting in vivo COVID-19 lung ultrasound features.

Collapse

Pang Y, Liang J, Huang T, Chen H, Li Y, Li D, Huang L, Wang Q. Slim UNETR: Scale Hybrid Transformers to Efficient 3D Medical Image Segmentation Under Limited Computational Resources. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024;43:994-1005. [PMID: 37862274 DOI: 10.1109/tmi.2023.3326188] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/22/2023]

Yu K, Sun L, Chen J, Reynolds M, Chaudhary T, Batmanghelich K. DrasCLR: A self-supervised framework of learning disease-related and anatomy-specific representation for 3D lung CT images. Med Image Anal 2024;92:103062. [PMID: 38086236 PMCID: PMC10872608 DOI: 10.1016/j.media.2023.103062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 08/24/2023] [Accepted: 12/05/2023] [Indexed: 01/12/2024]

Abstract

Large-scale volumetric medical images with annotation are rare, costly, and time prohibitive to acquire. Self-supervised learning (SSL) offers a promising pre-training and feature extraction solution for many downstream tasks, as it only uses unlabeled data. Recently, SSL methods based on instance discrimination have gained popularity in the medical imaging domain. However, SSL pre-trained encoders may use many clues in the image to discriminate an instance that are not necessarily disease-related. Moreover, pathological patterns are often subtle and heterogeneous, requiring the ability of the desired method to represent anatomy-specific features that are sensitive to abnormal changes in different body parts. In this work, we present a novel SSL framework, named DrasCLR, for 3D lung CT images to overcome these challenges. We propose two domain-specific contrastive learning strategies: one aims to capture subtle disease patterns inside a local anatomical region, and the other aims to represent severe disease patterns that span larger regions. We formulate the encoder using conditional hyper-parameterized network, in which the parameters are dependant on the anatomical location, to extract anatomically sensitive features. Extensive experiments on large-scale datasets of lung CT scans show that our method improves the performance of many downstream prediction and segmentation tasks. The patient-level representation improves the performance of the patient survival prediction task. We show how our method can detect emphysema subtypes via dense prediction. We demonstrate that fine-tuning the pre-trained model can significantly reduce annotation efforts without sacrificing emphysema detection accuracy. Our ablation study highlights the importance of incorporating anatomical context into the SSL framework. Our codes are available at https://github.com/batmanlab/DrasCLR.

Collapse

Fischer M, Bartler A, Yang B. Prompt tuning for parameter-efficient medical image segmentation. Med Image Anal 2024;91:103024. [PMID: 37976866 DOI: 10.1016/j.media.2023.103024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 07/16/2023] [Accepted: 11/03/2023] [Indexed: 11/19/2023]

Zhou J, Zhao M, Yang Z, Chen L, Liu X. Exploring the Value of MRI Measurement of Hippocampal Volume for Predicting the Occurrence and Progression of Alzheimer's Disease Based on Artificial Intelligence Deep Learning Technology and Evidence-Based Medicine Meta-Analysis. J Alzheimers Dis 2024;97:1275-1288. [PMID: 38277290 DOI: 10.3233/jad-230733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2024]

Liu F, Zhu T, Wu X, Yang B, You C, Wang C, Lu L, Liu Z, Zheng Y, Sun X, Yang Y, Clifton L, Clifton DA. A medical multimodal large language model for future pandemics. NPJ Digit Med 2023;6:226. [PMID: 38042919 PMCID: PMC10693607 DOI: 10.1038/s41746-023-00952-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 10/24/2023] [Indexed: 12/04/2023] Open

Lin G, Zhang Z, Long K, Zhang Y, Lu Y, Geng J, Zhou Z, Feng Q, Lu L, Cao L. GCLR: A self-supervised representation learning pretext task for glomerular filtration barrier segmentation in TEM images. Artif Intell Med 2023;146:102720. [PMID: 38042604 DOI: 10.1016/j.artmed.2023.102720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 10/04/2023] [Accepted: 11/14/2023] [Indexed: 12/04/2023]

Affiliation(s)

Guoyu Lin School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China
Zhentai Zhang School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China
Kaixing Long School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China
Yiwen Zhang School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China
Yanmeng Lu Central Laboratory, Southern Medical University, Guangzhou, 510515, China
Jian Geng Department of Pathology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China; Guangzhou Huayin Medical Laboratory Center, Guangzhou, 510515, China
Zhitao Zhou Central Laboratory, Southern Medical University, Guangzhou, 510515, China
Qianjin Feng School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China
Lijun Lu School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China.
Lei Cao School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China.

Collapse

Taher MRH, Gotway MB, Liang J. Towards Foundation Models Learned from Anatomy in Medical Imaging via Self-supervision. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER : 5TH MICCAI WORKSHOP, DART 2023, HELD IN CONJUNCTION WITH MICCAI 2023, VANCOUVER, BC, CANADA, OCTOBER 12, 2023, PROCEEDINGS. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (WORKSHOP) (5TH : ... 2023;14293:94-104. [PMID: 38752223 PMCID: PMC11095552 DOI: 10.1007/978-3-031-45857-6_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]

Kazemimoghadam M, Yang Z, Chen M, Ma L, Lu W, Gu X. Leveraging global binary masks for structure segmentation in medical images. Phys Med Biol 2023;68:10.1088/1361-6560/acf2e2. [PMID: 37607564 PMCID: PMC10511220 DOI: 10.1088/1361-6560/acf2e2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 08/22/2023] [Indexed: 08/24/2023]

Abstract

Deep learning (DL) models for medical image segmentation are highly influenced by intensity variations of input images and lack generalization due to primarily utilizing pixels' intensity information for inference. Acquiring sufficient training data is another challenge limiting models' applications. Here, we proposed to leverage the consistency of organs' anatomical position and shape information in medical images. We introduced a framework leveraging recurring anatomical patterns through global binary masks for organ segmentation. Two scenarios were studied: (1) global binary masks were the only input for the U-Net based model, forcing exclusively encoding organs' position and shape information for rough segmentation or localization. (2) Global binary masks were incorporated as an additional channel providing position/shape clues to mitigate training data scarcity. Two datasets of the brain and heart computed tomography (CT) images with their ground-truth were split into (26:10:10) and (12:3:5) for training, validation, and test respectively. The two scenarios were evaluated using full training split as well as reduced subsets of training data. In scenario (1), training exclusively on global binary masks led to Dice scores of 0.77 ± 0.06 and 0.85 ± 0.04 for the brain and heart structures respectively. Average Euclidian distance of 3.12 ± 1.43 mm and 2.5 ± 0.93 mm were obtained relative to the center of mass of the ground truth for the brain and heart structures respectively. The outcomes indicated encoding a surprising degree of position and shape information through global binary masks. In scenario (2), incorporating global binary masks led to significantly higher accuracy relative to the model trained on only CT images in small subsets of training data; the performance improved by 4.3%-125.3% and 1.3%-48.1% for 1-8 training cases of the brain and heart datasets respectively. The findings imply the advantages of utilizing global binary masks for building models that are robust to image intensity variations as well as an effective approach to boost performance when access to labeled training data is highly limited.

Collapse

Chen Y, Lu X, Xie Q. Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation. Comput Biol Med 2023;164:107228. [PMID: 37473563 DOI: 10.1016/j.compbiomed.2023.107228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 06/13/2023] [Accepted: 07/01/2023] [Indexed: 07/22/2023]

Abstract

Integrating transformers and convolutional neural networks represents a crucial and cutting-edge approach for tackling medical image segmentation problems. Nonetheless, the existing hybrid methods fail to fully leverage the strengths of both operators. During the Patch Embedding, the patch projection method ignores the two-dimensional structure and local spatial information within each patch, while the fixed patch size cannot capture features with rich representation effectively. Moreover, the calculation of self-attention results in attention diffusion, hindering the provision of precise details to the decoder while maintaining feature consistency. Lastly, none of the existing methods establish an efficient multi-scale modeling concept. To address these issues, we design the Collaborative Networks of Transformers and Convolutional neural networks (TC-CoNet), which is generally used for accurate 3D medical image segmentation. First, we elaborately design precise patch embedding to generate 3D features with accurate spatial position information, laying a solid foundation for subsequent learning. The encoder-decoder backbone network is then constructed by TC-CoNet in an interlaced combination to properly incorporate long-range dependencies and hierarchical object concepts at various scales. Furthermore, we employ the constricted attention bridge to constrict attention to local features, allowing us to accurately guide the recovery of detailed information while maintaining feature consistency. Finally, atrous spatial pyramid pooling is applied to high-level feature map to establish the concept of multi-scale objects. On five challenging datasets, including Synapse, ACDC, brain tumor segmentation, cardiac left atrium segmentation, and lung tumor segmentation, the extensive experiments demonstrate that TC-CoNet outperforms state-of-the-art approaches in terms of superiority, migration, and strong generalization. These illustrate in full the efficacy of the proposed transformers and convolutional neural networks combination for medical image segmentation. Our code is freely available at: https://github.com/YongChen-Exact/TC-CoNet.

Collapse

Huang SC, Pareek A, Jensen M, Lungren MP, Yeung S, Chaudhari AS. Self-supervised learning for medical image classification: a systematic review and implementation guidelines. NPJ Digit Med 2023;6:74. [PMID: 37100953 PMCID: PMC10131505 DOI: 10.1038/s41746-023-00811-0] [Citation(s) in RCA: 71] [Impact Index Per Article: 35.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 03/30/2023] [Indexed: 04/28/2023] Open

Yang Z, Xie L, Zhou W, Huo X, Wei L, Lu J, Tian Q, Tang S. VoxSeP: semi-positive voxels assist self-supervised 3D medical segmentation. MULTIMEDIA SYSTEMS 2023;29:33-48. [DOI: 10.1007/s00530-022-00977-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 06/28/2022] [Indexed: 01/23/2025]

Bilic P, Christ P, Li HB, Vorontsov E, Ben-Cohen A, Kaissis G, Szeskin A, Jacobs C, Mamani GEH, Chartrand G, Lohöfer F, Holch JW, Sommer W, Hofmann F, Hostettler A, Lev-Cohain N, Drozdzal M, Amitai MM, Vivanti R, Sosna J, Ezhov I, Sekuboyina A, Navarro F, Kofler F, Paetzold JC, Shit S, Hu X, Lipková J, Rempfler M, Piraud M, Kirschke J, Wiestler B, Zhang Z, Hülsemeyer C, Beetz M, Ettlinger F, Antonelli M, Bae W, Bellver M, Bi L, Chen H, Chlebus G, Dam EB, Dou Q, Fu CW, Georgescu B, Giró-I-Nieto X, Gruen F, Han X, Heng PA, Hesser J, Moltz JH, Igel C, Isensee F, Jäger P, Jia F, Kaluva KC, Khened M, Kim I, Kim JH, Kim S, Kohl S, Konopczynski T, Kori A, Krishnamurthi G, Li F, Li H, Li J, Li X, Lowengrub J, Ma J, Maier-Hein K, Maninis KK, Meine H, Merhof D, Pai A, Perslev M, Petersen J, Pont-Tuset J, Qi J, Qi X, Rippel O, Roth K, Sarasua I, Schenk A, Shen Z, Torres J, Wachinger C, Wang C, Weninger L, Wu J, Xu D, Yang X, Yu SCH, Yuan Y, Yue M, Zhang L, Cardoso J, Bakas S, Braren R, et alBilic P, Christ P, Li HB, Vorontsov E, Ben-Cohen A, Kaissis G, Szeskin A, Jacobs C, Mamani GEH, Chartrand G, Lohöfer F, Holch JW, Sommer W, Hofmann F, Hostettler A, Lev-Cohain N, Drozdzal M, Amitai MM, Vivanti R, Sosna J, Ezhov I, Sekuboyina A, Navarro F, Kofler F, Paetzold JC, Shit S, Hu X, Lipková J, Rempfler M, Piraud M, Kirschke J, Wiestler B, Zhang Z, Hülsemeyer C, Beetz M, Ettlinger F, Antonelli M, Bae W, Bellver M, Bi L, Chen H, Chlebus G, Dam EB, Dou Q, Fu CW, Georgescu B, Giró-I-Nieto X, Gruen F, Han X, Heng PA, Hesser J, Moltz JH, Igel C, Isensee F, Jäger P, Jia F, Kaluva KC, Khened M, Kim I, Kim JH, Kim S, Kohl S, Konopczynski T, Kori A, Krishnamurthi G, Li F, Li H, Li J, Li X, Lowengrub J, Ma J, Maier-Hein K, Maninis KK, Meine H, Merhof D, Pai A, Perslev M, Petersen J, Pont-Tuset J, Qi J, Qi X, Rippel O, Roth K, Sarasua I, Schenk A, Shen Z, Torres J, Wachinger C, Wang C, Weninger L, Wu J, Xu D, Yang X, Yu SCH, Yuan Y, Yue M, Zhang L, Cardoso J, Bakas S, Braren R, Heinemann V, Pal C, Tang A, Kadoury S, Soler L, van Ginneken B, Greenspan H, Joskowicz L, Menze B. The Liver Tumor Segmentation Benchmark (LiTS). Med Image Anal 2023;84:102680. [PMID: 36481607 PMCID: PMC10631490 DOI: 10.1016/j.media.2022.102680] [Show More Authors] [Citation(s) in RCA: 175] [Impact Index Per Article: 87.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Revised: 09/27/2022] [Accepted: 10/29/2022] [Indexed: 11/18/2022]

Affiliation(s)

Patrick Bilic Department of Informatics, Technical University of Munich, Germany
Patrick Christ Department of Informatics, Technical University of Munich, Germany
Hongwei Bran Li Department of Informatics, Technical University of Munich, Germany; Department of Quantitative Biomedicine, University of Zurich, Switzerland.
Eugene Vorontsov Ecole Polytechnique de Montréal, Canada
Avi Ben-Cohen Department of Biomedical Engineering, Tel-Aviv University, Israel
Georgios Kaissis Institute for AI in Medicine, Technical University of Munich, Germany; Institute for diagnostic and interventional radiology, Klinikum rechts der Isar, Technical University of Munich, Germany; Department of Computing, Imperial College London, London, United Kingdom
Adi Szeskin School of Computer Science and Engineering, the Hebrew University of Jerusalem, Israel
Colin Jacobs Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Gabriel Efrain Humpire Mamani Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Gabriel Chartrand The University of Montréal Hospital Research Centre (CRCHUM) Montréal, Québec, Canada
Fabian Lohöfer Institute for diagnostic and interventional radiology, Klinikum rechts der Isar, Technical University of Munich, Germany
Julian Walter Holch Department of Medicine III, University Hospital, LMU Munich, Munich, Germany; Comprehensive Cancer Center Munich, Munich, Germany; Division of Medical Image Computing, German Cancer Research Center (DKFZ), Heidelberg, Germany
Wieland Sommer Department of Radiology, University Hospital, LMU Munich, Germany
Felix Hofmann Department of General, Visceral and Transplantation Surgery, University Hospital, LMU Munich, Germany; Department of Radiology, University Hospital, LMU Munich, Germany
Alexandre Hostettler Department of Surgical Data Science, Institut de Recherche contre les Cancers de l'Appareil Digestif (IRCAD), France
Naama Lev-Cohain Department of Radiology, Hadassah University Medical Center, Jerusalem, Israel
Michal Drozdzal Polytechnique Montréal, Mila, QC, Canada
Michal Marianne Amitai Department of Diagnostic Radiology, Sheba Medical Center, Tel Aviv university, Israel
Refael Vivanti Rafael Advanced Defense System, Israel
Jacob Sosna Department of Radiology, Hadassah University Medical Center, Jerusalem, Israel
Ivan Ezhov Department of Informatics, Technical University of Munich, Germany
Anjany Sekuboyina Department of Informatics, Technical University of Munich, Germany; Department of Quantitative Biomedicine, University of Zurich, Switzerland
Fernando Navarro Department of Informatics, Technical University of Munich, Germany; Department of Radiation Oncology and Radiotherapy, Klinikum rechts der Isar, Technical University of Munich, Germany; TranslaTUM - Central Institute for Translational Cancer Research, Technical University of Munich, Germany
Florian Kofler Department of Informatics, Technical University of Munich, Germany; Institute for diagnostic and interventional neuroradiology, Klinikum rechts der Isar,Technical University of Munich, Germany; Helmholtz AI, Helmholtz Zentrum München, Neuherberg, Germany; TranslaTUM - Central Institute for Translational Cancer Research, Technical University of Munich, Germany
Johannes C Paetzold Department of Computing, Imperial College London, London, United Kingdom; Institute for Tissue Engineering and Regenerative Medicine, Helmholtz Zentrum München, Neuherberg, Germany
Suprosanna Shit Department of Informatics, Technical University of Munich, Germany
Xiaobin Hu Department of Informatics, Technical University of Munich, Germany
Jana Lipková Brigham and Women's Hospital, Harvard Medical School, USA
Markus Rempfler Department of Informatics, Technical University of Munich, Germany
Marie Piraud Department of Informatics, Technical University of Munich, Germany; Helmholtz AI, Helmholtz Zentrum München, Neuherberg, Germany
Jan Kirschke Institute for diagnostic and interventional neuroradiology, Klinikum rechts der Isar,Technical University of Munich, Germany
Benedikt Wiestler Institute for diagnostic and interventional neuroradiology, Klinikum rechts der Isar,Technical University of Munich, Germany
Zhiheng Zhang Department of Hepatobiliary Surgery, the Affiliated Drum Tower Hospital of Nanjing University Medical School, China
Christian Hülsemeyer Department of Informatics, Technical University of Munich, Germany
Marcel Beetz Department of Informatics, Technical University of Munich, Germany
Florian Ettlinger Department of Informatics, Technical University of Munich, Germany
Michela Antonelli School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK
Woong Bae Kakao Brain, Republic of Korea
Míriam Bellver Barcelona Supercomputing Center, Barcelona, Spain
Lei Bi School of Computer Science, the University of Sydney, Australia
Hao Chen Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, China
Grzegorz Chlebus Fraunhofer MEVIS, Bremen, Germany; Diagnostic Image Analysis Group, Radboud University Medical Center, Nijmegen, The Netherlands
Erik B Dam Department of Computer Science, University of Copenhagen, Denmark
Qi Dou Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
Chi-Wing Fu Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
Bogdan Georgescu Siemens Healthineers, USA
Xavier Giró-I-Nieto Signal Theory and Communications Department, Universitat Politecnica de Catalunya, Catalonia, Spain
Felix Gruen Institute of Control Engineering, Technische Universität Braunschweig, Germany
Xu Han Department of computer science, UNC Chapel Hill, USA
Pheng-Ann Heng Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
Jürgen Hesser Mannheim Institute for Intelligent Systems in Medicine, department of Medicine Mannheim, Heidelberg University, Germany; Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Germany; Central Institute for Computer Engineering (ZITI), Heidelberg University, Germany
Jan Hendrik Moltz Fraunhofer MEVIS, Bremen, Germany
Christian Igel Department of Computer Science, University of Copenhagen, Denmark
Fabian Isensee Division of Medical Image Computing, German Cancer Research Center (DKFZ), Heidelberg, Germany; Helmholtz Imaging, Germany
Paul Jäger Division of Medical Image Computing, German Cancer Research Center (DKFZ), Heidelberg, Germany; Helmholtz Imaging, Germany
Fucang Jia Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China
Krishna Chaitanya Kaluva Medical Imaging and Reconstruction Lab, Department of Engineering Design, Indian Institute of Technology Madras, India
Mahendra Khened Medical Imaging and Reconstruction Lab, Department of Engineering Design, Indian Institute of Technology Madras, India
Ildoo Kim Kakao Brain, Republic of Korea
Jae-Hun Kim Department of Radiology, Samsung Medical Center, Sungkyunkwan University School of Medicine, South Korea
Sungwoong Kim Kakao Brain, Republic of Korea
Simon Kohl Division of Medical Image Computing, German Cancer Research Center (DKFZ), Heidelberg, Germany
Tomasz Konopczynski Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Germany
Avinash Kori Medical Imaging and Reconstruction Lab, Department of Engineering Design, Indian Institute of Technology Madras, India
Ganapathy Krishnamurthi Medical Imaging and Reconstruction Lab, Department of Engineering Design, Indian Institute of Technology Madras, India
Fan Li Sensetime, Shanghai, China
Hongchao Li Department of Computer Science, Guangdong University of Foreign Studies, China
Junbo Li Philips Research China, Philips China Innovation Campus, Shanghai, China
Xiaomeng Li Department of Electrical and Electronic Engineering, The University of Hong Kong, China
John Lowengrub Departments of Mathematics, Biomedical Engineering, University of California, Irvine, USA; Center for Complex Biological Systems, University of California, Irvine, USA; Chao Family Comprehensive Cancer Center, University of California, Irvine, USA
Jun Ma Department of Mathematics, Nanjing University of Science and Technology, China
Klaus Maier-Hein Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital, Heidelberg, Germany; Division of Medical Image Computing, German Cancer Research Center (DKFZ), Heidelberg, Germany; Helmholtz Imaging, Germany
Kevis-Kokitsi Maninis Eidgenössische Technische Hochschule Zurich (ETHZ), Zurich, Switzerland
Hans Meine Fraunhofer MEVIS, Bremen, Germany; Medical Image Computing Group, FB3, University of Bremen, Germany
Dorit Merhof Institute of Imaging & Computer Vision, RWTH Aachen University, Germany
Akshay Pai Department of Computer Science, University of Copenhagen, Denmark
Mathias Perslev Department of Computer Science, University of Copenhagen, Denmark
Jens Petersen Division of Medical Image Computing, German Cancer Research Center (DKFZ), Heidelberg, Germany
Jordi Pont-Tuset Eidgenössische Technische Hochschule Zurich (ETHZ), Zurich, Switzerland
Jin Qi School of Information and Communication Engineering, University of Electronic Science and Technology of China, China
Xiaojuan Qi Department of Electrical and Electronic Engineering, The University of Hong Kong, China
Oliver Rippel Institute of Imaging & Computer Vision, RWTH Aachen University, Germany
Karsten Roth University of Tuebingen, Germany
Ignacio Sarasua Institute for diagnostic and interventional radiology, Klinikum rechts der Isar, Technical University of Munich, Germany; Department of Child and Adolescent Psychiatry, Ludwig-Maximilians-Universität, Munich, Germany
Andrea Schenk Fraunhofer MEVIS, Bremen, Germany; Institute for Diagnostic and Interventional Radiology, Hannover Medical School, Hannover, Germany
Zengming Shen Beckman Institute, University of Illinois at Urbana-Champaign, USA; Siemens Healthineers, USA
Jordi Torres Barcelona Supercomputing Center, Barcelona, Spain; Universitat Politecnica de Catalunya, Catalonia, Spain
Christian Wachinger Department of Informatics, Technical University of Munich, Germany; Institute for diagnostic and interventional radiology, Klinikum rechts der Isar, Technical University of Munich, Germany; Department of Child and Adolescent Psychiatry, Ludwig-Maximilians-Universität, Munich, Germany
Chunliang Wang Department of Biomedical Engineering and Health Systems, KTH Royal Institute of Technology, Sweden
Leon Weninger Institute of Imaging & Computer Vision, RWTH Aachen University, Germany
Jianrong Wu Tencent Healthcare (Shenzhen) Co., Ltd, China
Daguang Xu NVIDIA, Santa Clara, CA, USA
Xiaoping Yang Department of Mathematics, Nanjing University, China
Simon Chun-Ho Yu Department of Imaging and Interventional Radiology, Chinese University of Hong Kong, Hong Kong, China
Yading Yuan Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, NY, USA
Miao Yue CGG Services (Singapore) Pte. Ltd., Singapore
Liping Zhang Department of Imaging and Interventional Radiology, Chinese University of Hong Kong, Hong Kong, China
Jorge Cardoso School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK
Spyridon Bakas Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, PA, USA; Department of Radiology, Perelman School of Medicine, University of Pennsylvania, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, PA, USA
Rickmer Braren German Cancer Consortium (DKTK), Germany; Institute for diagnostic and interventional radiology, Klinikum rechts der Isar, Technical University of Munich, Germany; Comprehensive Cancer Center Munich, Munich, Germany
Volker Heinemann Department of Hematology/Oncology & Comprehensive Cancer Center Munich, LMU Klinikum Munich, Germany
Christopher Pal Ecole Polytechnique de Montréal, Canada
An Tang Department of Radiology, Radiation Oncology and Nuclear Medicine, University of Montréal, Canada
Samuel Kadoury Ecole Polytechnique de Montréal, Canada
Luc Soler Department of Surgical Data Science, Institut de Recherche contre les Cancers de l'Appareil Digestif (IRCAD), France
Bram van Ginneken Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Hayit Greenspan Department of Biomedical Engineering, Tel-Aviv University, Israel
Leo Joskowicz School of Computer Science and Engineering, the Hebrew University of Jerusalem, Israel
Bjoern Menze Department of Informatics, Technical University of Munich, Germany; Department of Quantitative Biomedicine, University of Zurich, Switzerland

Collapse

Zhou HY, Lu C, Wang L, Yu Y. GraVIS: Grouping Augmented Views From Independent Sources for Dermatology Analysis. IEEE TRANSACTIONS ON MEDICAL IMAGING 2022;41:3498-3508. [PMID: 36260573 DOI: 10.1109/tmi.2022.3216005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Pang J, Haghighi F, Ma D, Islam NU, Taher MRH, Gotway MB, Liang J. POPAR: Patch Order Prediction and Appearance Recovery for Self-supervised Medical Image Analysis. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER : 4TH MICCAI WORKSHOP, DART 2022, HELD IN CONJUNCTION WITH MICCAI 2022, SINGAPORE, SEPTEMBER 22, 2022, PROCEEDINGS. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (WORKSHOP) (4TH : 2022 : SIN... 2022;13542:77-87. [PMID: 36507898 PMCID: PMC9728135 DOI: 10.1007/978-3-031-16852-9_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Ma D, Taher MRH, Pang J, Islam NU, Haghighi F, Gotway MB, Liang J. Benchmarking and Boosting Transformers for Medical Image Classification. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER : 4TH MICCAI WORKSHOP, DART 2022, HELD IN CONJUNCTION WITH MICCAI 2022, SINGAPORE, SEPTEMBER 22, 2022, PROCEEDINGS. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (WORKSHOP) (4TH : 2022 : SIN... 2022;13542:12-22. [PMID: 36383492 PMCID: PMC9646404 DOI: 10.1007/978-3-031-16852-9_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Guo Z, Islam NU, Gotway MB, Liang J. Discriminative, Restorative, and Adversarial Learning: Stepwise Incremental Pretraining. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER : 4TH MICCAI WORKSHOP, DART 2022, HELD IN CONJUNCTION WITH MICCAI 2022, SINGAPORE, SEPTEMBER 22, 2022, PROCEEDINGS. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (WORKSHOP) (4TH : 2022 : SIN... 2022;13542:66-76. [PMID: 36507899 PMCID: PMC9728134 DOI: 10.1007/978-3-031-16852-9_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Taher MRH, Haghighi F, Gotway MB, Liang J. CAiD: Context-Aware Instance Discrimination for Self-supervised Learning in Medical Imaging. PROCEEDINGS OF MACHINE LEARNING RESEARCH 2022;172:535-551. [PMID: 36579134 PMCID: PMC9793869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Haghighi F, Taher MRH, Gotway MB, Liang J. DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image Analysis. PROCEEDINGS. IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION 2022;2022:20792-20802. [PMID: 36313959 PMCID: PMC9615927 DOI: 10.1109/cvpr52688.2022.02016] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Torrent TT, Matos EEDS, Belcavello F, Viridiano M, Gamonal MA, da Costa AD, Marim MC. Representing Context in FrameNet: A Multidimensional, Multimodal Approach. Front Psychol 2022;13:838441. [PMID: 35444591 PMCID: PMC9014903 DOI: 10.3389/fpsyg.2022.838441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open

Liu Z, Lu H, Pan X, Xu M, Lan R, Luo X. Diagnosis of Alzheimer’s disease via an attention-based multi-scale convolutional neural network. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107942] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Generalized radiograph representation learning via cross-supervision between images and free-text radiology reports. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-021-00425-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Tajbakhsh N, Roth H, Terzopoulos D, Liang J. Guest Editorial Annotation-Efficient Deep Learning: The Holy Grail of Medical Imaging. IEEE TRANSACTIONS ON MEDICAL IMAGING 2021;40:2526-2533. [PMID: 34795461 PMCID: PMC8594751 DOI: 10.1109/tmi.2021.3089292] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Hosseinzadeh Taher MR, Haghighi F, Feng R, Gotway MB, Liang J. A Systematic Benchmarking Analysis of Transfer Learning for Medical Image Analysis. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND AFFORDABLE HEALTHCARE AND AI FOR RESOURCE DIVERSE GLOBAL HEALTH : THIRD MICCAI WORKSHOP, DART 2021 AND FIRST MICCAI WORKSHOP, FAIR 2021 : HELD IN CONJUNCTION WITH MICCAI 2021 : STRASBOU... 2021;12968:3-13. [PMID: 35713581 PMCID: PMC9197759 DOI: 10.1007/978-3-030-87722-4_1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Islam NU, Gehlot S, Zhou Z, Gotway MB, Liang J. Seeking an Optimal Approach for Computer-Aided Pulmonary Embolism Detection. MACHINE LEARNING IN MEDICAL IMAGING. MLMI (WORKSHOP) 2021;12966:692-702. [PMID: 35695860 PMCID: PMC9184235 DOI: 10.1007/978-3-030-87589-3_71] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]