Acessibilidade / Reportar erro

Multi-Feature Classification of Breast Cancer Histopathology Images: An Experimental Investigation in Machine Learning and Deep Learning Paradigm

Abstract

The existing practice for Breast Cancer (BC) characterization includes histopathological analysis, which is tedious and time-consuming due to massive data analysis. Further, such techniques are subjected to inter-and intra-observer variability due to the non-availability of skilled pathologists, particularly in low resource settings. Thus, we propose a multi-feature classification technique for risk stratification of BC in Histopathology Images (HI) using machine learning strategies and a Long Short-Term Memory (LSTM) based deep learning approach. Experiments are performed on a publicly available HI database from which a total of 658 image features are extracted, while 192 relevant features are obtained after feature selection using genetic algorithm. The highest accuracy of 99.85% using 192 features under the 5-fold data division protocol is obtained with the LSTM approach. The proposed framework for analyzing HI using multiple grayscale and color features showed promising results and can be an effective tool in the histopathology laboratory.

Keywords:
Intelligent laboratory analysis; Breast cancer; Feature fusion; Machine learning; Deep learning

HIGHLIGHTS

• The proposed approach performs the classification of breast tumors in histopathology images.

• The proposed approach evaluates a multi-feature classifier for risk stratification.

• The performance of different classifiers is compared under different data division protocols.

• The highest classification accuracy of 99.85% after feature selection is reported.

Instituto de Tecnologia do Paraná - Tecpar Rua Prof. Algacyr Munhoz Mader, 3775 - CIC, 81350-010 Curitiba PR Brazil, Tel.: +55 41 3316-3052/3054, Fax: +55 41 3346-2872 - Curitiba - PR - Brazil
E-mail: babt@tecpar.br