Effect of Benign Biopsy Findings on an Artificial Intelligence-Based Cancer Detector in Screening Mammography: Retrospective Case-Control Study.

AI artificial intelligence benign biopsy breast cancer cancer screening detection system diagnostic mammography radiology screening

Journal

JMIR AI
ISSN: 2817-1705
Titre abrégé: JMIR AI
Pays: Canada
ID NLM: 9918645789006676

Informations de publication

Date de publication:
31 Aug 2023
Historique:
received: 12 04 2023
accepted: 03 08 2023
revised: 17 06 2023
medline: 14 6 2024
pubmed: 14 6 2024
entrez: 14 6 2024
Statut: epublish

Résumé

Artificial intelligence (AI)-based cancer detectors (CAD) for mammography are starting to be used for breast cancer screening in radiology departments. It is important to understand how AI CAD systems react to benign lesions, especially those that have been subjected to biopsy. Our goal was to corroborate the hypothesis that women with previous benign biopsy and cytology assessments would subsequently present increased AI CAD abnormality scores even though they remained healthy. This is a retrospective study applying a commercial AI CAD system (Insight MMG, version 1.1.4.3; Lunit Inc) to a cancer-enriched mammography screening data set of 10,889 women (median age 56, range 40-74 years). The AI CAD generated a continuous prediction score for tumor suspicion between 0.00 and 1.00, where 1.00 represented the highest level of suspicion. A binary read (flagged or not flagged) was defined on the basis of a predetermined cutoff threshold (0.40). The flagged median and proportion of AI scores were calculated for women who were healthy, those who had a benign biopsy finding, and those who were diagnosed with breast cancer. For women with a benign biopsy finding, the interval between mammography and the biopsy was used for stratification of AI scores. The effect of increasing age was examined using subgroup analysis and regression modeling. Of a total of 10,889 women, 234 had a benign biopsy finding before or after screening. The proportions of flagged healthy women were 3.5%, 11%, and 84% for healthy women without a benign biopsy finding, those with a benign biopsy finding, and women with breast cancer, respectively (P<.001). For the 8307 women with complete information, radiologist 1, radiologist 2, and the AI CAD system flagged 8.5%, 6.8%, and 8.5% of examinations of women who had a prior benign biopsy finding. The AI score correlated only with increasing age of the women in the cancer group (P=.01). Compared to healthy women without a biopsy, the examined AI CAD system flagged a much larger proportion of women who had or would have a benign biopsy finding based on a radiologist's decision. However, the flagging rate was not higher than that for radiologists. Further research should be focused on training the AI CAD system taking prior biopsy information into account.

Sections du résumé

BACKGROUND BACKGROUND
Artificial intelligence (AI)-based cancer detectors (CAD) for mammography are starting to be used for breast cancer screening in radiology departments. It is important to understand how AI CAD systems react to benign lesions, especially those that have been subjected to biopsy.
OBJECTIVE OBJECTIVE
Our goal was to corroborate the hypothesis that women with previous benign biopsy and cytology assessments would subsequently present increased AI CAD abnormality scores even though they remained healthy.
METHODS METHODS
This is a retrospective study applying a commercial AI CAD system (Insight MMG, version 1.1.4.3; Lunit Inc) to a cancer-enriched mammography screening data set of 10,889 women (median age 56, range 40-74 years). The AI CAD generated a continuous prediction score for tumor suspicion between 0.00 and 1.00, where 1.00 represented the highest level of suspicion. A binary read (flagged or not flagged) was defined on the basis of a predetermined cutoff threshold (0.40). The flagged median and proportion of AI scores were calculated for women who were healthy, those who had a benign biopsy finding, and those who were diagnosed with breast cancer. For women with a benign biopsy finding, the interval between mammography and the biopsy was used for stratification of AI scores. The effect of increasing age was examined using subgroup analysis and regression modeling.
RESULTS RESULTS
Of a total of 10,889 women, 234 had a benign biopsy finding before or after screening. The proportions of flagged healthy women were 3.5%, 11%, and 84% for healthy women without a benign biopsy finding, those with a benign biopsy finding, and women with breast cancer, respectively (P<.001). For the 8307 women with complete information, radiologist 1, radiologist 2, and the AI CAD system flagged 8.5%, 6.8%, and 8.5% of examinations of women who had a prior benign biopsy finding. The AI score correlated only with increasing age of the women in the cancer group (P=.01).
CONCLUSIONS CONCLUSIONS
Compared to healthy women without a biopsy, the examined AI CAD system flagged a much larger proportion of women who had or would have a benign biopsy finding based on a radiologist's decision. However, the flagging rate was not higher than that for radiologists. Further research should be focused on training the AI CAD system taking prior biopsy information into account.

Identifiants

pubmed: 38875554
pii: v2i1e48123
doi: 10.2196/48123
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e48123

Informations de copyright

©Athanasios Zouzos, Aleksandra Milovanovic, Karin Dembrower, Fredrik Strand. Originally published in JMIR AI (https://ai.jmir.org), 31.08.2023.

Auteurs

Athanasios Zouzos (A)

Department of Oncology and Pathology, Karolinska Institute, Stockholm, Sweden.

Aleksandra Milovanovic (A)

Department of Oncology and Pathology, Karolinska Institute, Stockholm, Sweden.

Karin Dembrower (K)

Department of Oncology and Pathology, Karolinska Institute, Stockholm, Sweden.

Fredrik Strand (F)

Department of Oncology and Pathology, Karolinska Institute, Stockholm, Sweden.

Classifications MeSH