Session: PD27: Prostate Cancer: Localized: Surgical Therapy I
PD27-11: Comparison of Prostate Biopsy Grade Obtained by Human Pathologists Versus an Artificial Intelligence Algorithm for Predicting Biochemical Recurrence After Radical Prostatectomy
Introduction: Several artificial intelligence (AI) algorithms have been developed to streamline biopsy Gleason grading, but most have not been challenged with predicting oncologic outcomes. In this study, we leveraged a cohort of patients with biopsy-detected Grade Group (GG) 2 prostate cancer (PCa) and examined the association of biopsy grade by contemporary pathologist consensus grade or AI algorithm with time to biochemical recurrence (BCR) after radical prostatectomy (RP). Methods: We identified 286 patients originally diagnosed with GG2 PCa on biopsy who underwent RP at our institution from 2000 to 2014. All biopsies were re-graded by two expert genitourinary pathologists (pathologist 1 and 2). For cases with discrepant reads, a third expert genitourinary pathologist determined the consensus pathology read. GG was also obtained using a previously validated and published AI algorithm. Concordance between pathologist and AI GG was quantified using the quadratic kappa. BCR was defined as two consecutive PSAs =0.2 ng/mL. Kaplan-Meier BCR free survival estimates by GG were compared using the log-rank test. Results: After RP median follow-up was 4 years (range 1-14). To date, 16% of the men have had BCR and median time to BCR was 2 years (range 1-10). Biopsy grading between pathologist 1 and 2 generated a kappa of 0.17. Grading between the consensus pathology read and the AI algorithm generated a kappa of 0.33. Grading by consensus pathology read was associated with time to BCR-free survival (log-rank p=0.003) as was the grading by the AI algorithm (log-rank p=0.004; Figure). Conclusions: The AI algorithm Gleason grading demonstrated poor agreement with the consensus contemporary pathology read, but the agreement was also poor between the two pathologists. Yet, our cohort was only comprised of men originally diagnosed with GG2 PCa, a GG usually associated with poor inter-pathologist agreement. Remarkably, both the consensus pathology and the AI algorithm grades on the biopsy stratified patients well for subsequent BCR after RP, suggesting that AI algorithms could be developed to stratify for oncologic outcomes instead of attempting to replicate somewhat subjective human grading. SOURCE OF Funding: Research funding from Deep Bio