To read this content please select one of the options below:

A new approach for histological classification of breast cancer using deep hybrid heterogenous ensemble

Hasnae Zerouaoui (MSDA, Mohammed VI Polytechnic University, Ben Guerir, Morocco)
Ali Idri (Modeling, Simulation and Data Analysis, Mohammed VI Polytechnic University, Ben Guerir, Morocco and Software Project Management Research Team, ENSIAS, Mohammed V University, Rabat, Morocco)
Omar El Alaoui (UM5R ENSIAS, Rabat, Morocco)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 18 October 2022

Issue publication date: 25 April 2023

129

Abstract

Purpose

Hundreds of thousands of deaths each year in the world are caused by breast cancer (BC). An early-stage diagnosis of this disease can positively reduce the morbidity and mortality rate by helping to select the most appropriate treatment options, especially by using histological BC images for the diagnosis.

Design/methodology/approach

The present study proposes and evaluates a novel approach which consists of 24 deep hybrid heterogenous ensembles that combine the strength of seven deep learning techniques (DenseNet 201, Inception V3, VGG16, VGG19, Inception-ResNet-V3, MobileNet V2 and ResNet 50) for feature extraction and four well-known classifiers (multi-layer perceptron, support vector machines, K-nearest neighbors and decision tree) by means of hard and weighted voting combination methods for histological classification of BC medical image. Furthermore, the best deep hybrid heterogenous ensembles were compared to the deep stacked ensembles to determine the best strategy to design the deep ensemble methods. The empirical evaluations used four classification performance criteria (accuracy, sensitivity, precision and F1-score), fivefold cross-validation, Scott–Knott (SK) statistical test and Borda count voting method. All empirical evaluations were assessed using four performance measures, including accuracy, precision, recall and F1-score, and were over the histological BreakHis public dataset with four magnification factors (40×, 100×, 200× and 400×). SK statistical test and Borda count were also used to cluster the designed techniques and rank the techniques belonging to the best SK cluster, respectively.

Findings

Results showed that the deep hybrid heterogenous ensembles outperformed both their singles and the deep stacked ensembles and reached the accuracy values of 96.3, 95.6, 96.3 and 94 per cent across the four magnification factors 40×, 100×, 200× and 400×, respectively.

Originality/value

The proposed deep hybrid heterogenous ensembles can be applied for the BC diagnosis to assist pathologists in reducing the missed diagnoses and proposing adequate treatments for the patients.

Keywords

Acknowledgements

This work was conducted under the research project “Machine Learning based Breast Cancer Diagnosis and Treatment”, 2020–2023. The authors would like to thank the Moroccan Ministry of Higher Education and Scientific Research, Digital Development Agency (ADD), CNRST and UM6P for their support.

Funding: This study was funded by Mohammed VI polytechnic university at Ben Guerir Morocco.

Conflicts of interest/competing interests: The authors report no conflicts of interest.

Citation

Zerouaoui, H., Idri, A. and El Alaoui, O. (2023), "A new approach for histological classification of breast cancer using deep hybrid heterogenous ensemble", Data Technologies and Applications, Vol. 57 No. 2, pp. 245-278. https://doi.org/10.1108/DTA-05-2022-0210

Publisher

:

Emerald Publishing Limited

Copyright © 2022, Emerald Publishing Limited

Related articles