To read this content please select one of the options below:

A hybrid learning method for distinguishing lung adenocarcinoma and squamous cell carcinoma

Anil Kumar Swain (Computer Science and Engineering, National Institute of Technology Meghalaya, Shillong, India) (School of Computer Engineering, KIIT University, Bhubaneswar, India)
Aleena Swetapadma (School of Computer Engineering, KIIT University, Bhubaneswar, India)
Jitendra Kumar Rout (Computer Science and Engineering, National Institute of Technology Raipur, Raipur, India)
Bunil Kumar Balabantaray (Computer Science and Engineering, National Institute of Technology Meghalaya, Shillong, India)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 19 May 2023

Issue publication date: 29 January 2024

66

Abstract

Purpose

The objective of the proposed work is to identify the most commonly occurring non–small cell carcinoma types, such as adenocarcinoma and squamous cell carcinoma, within the human population. Another objective of the work is to reduce the false positive rate during the classification.

Design/methodology/approach

In this work, a hybrid method using convolutional neural networks (CNNs), extreme gradient boosting (XGBoost) and long-short-term memory networks (LSTMs) has been proposed to distinguish between lung adenocarcinoma and squamous cell carcinoma. To extract features from non–small cell lung carcinoma images, a three-layer convolution and three-layer max-pooling-based CNN is used. A few important features have been selected from the extracted features using the XGBoost algorithm as the optimal feature. Finally, LSTM has been used for the classification of carcinoma types. The accuracy of the proposed method is 99.57 per cent, and the false positive rate is 0.427 per cent.

Findings

The proposed CNN–XGBoost–LSTM hybrid method has significantly improved the results in distinguishing between adenocarcinoma and squamous cell carcinoma. The importance of the method can be outlined as follows: It has a very low false positive rate of 0.427 per cent. It has very high accuracy, i.e. 99.57 per cent. CNN-based features are providing accurate results in classifying lung carcinoma. It has the potential to serve as an assisting aid for doctors.

Practical implications

It can be used by doctors as a secondary tool for the analysis of non–small cell lung cancers.

Social implications

It can help rural doctors by sending the patients to specialized doctors for more analysis of lung cancer.

Originality/value

In this work, a hybrid method using CNN, XGBoost and LSTM has been proposed to distinguish between lung adenocarcinoma and squamous cell carcinoma. A three-layer convolution and three-layer max-pooling-based CNN is used to extract features from the non–small cell lung carcinoma images. A few important features have been selected from the extracted features using the XGBoost algorithm as the optimal feature. Finally, LSTM has been used for the classification of carcinoma types.

Keywords

Acknowledgements

Funding: There is no funding for the work.

Competing Interests: Authors do not have any competing interest.

Citation

Swain, A.K., Swetapadma, A., Rout, J.K. and Balabantaray, B.K. (2024), "A hybrid learning method for distinguishing lung adenocarcinoma and squamous cell carcinoma", Data Technologies and Applications, Vol. 58 No. 1, pp. 113-131. https://doi.org/10.1108/DTA-10-2022-0384

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited

Related articles