IoT malware detection architecture using a novel channel boosted and squeezed CNN

Asam, Muhammad; Khan, Saddam Hussain; Akbar, Altaf; Bibi, Sameena; Jamal, Tauseef; Khan, Asifullah; Ghafoor, Usman; Bhutta, Muhammad Raheel

doi:10.1038/s41598-022-18936-9

Download PDF

Article
Open access
Published: 15 September 2022

IoT malware detection architecture using a novel channel boosted and squeezed CNN

Muhammad Asam^1,2,
Saddam Hussain Khan^1,2,3,
Altaf Akbar⁴,
Sameena Bibi⁵,
Tauseef Jamal^6,7,
Asifullah Khan^1,2,6,7,
Usman Ghafoor^8,9 &
…
Muhammad Raheel Bhutta¹⁰

Scientific Reports volume 12, Article number: 15498 (2022) Cite this article

4359 Accesses
33 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Interaction between devices, people, and the Internet has given birth to a new digital communication model, the internet of things (IoT). The integration of smart devices to constitute a network introduces many security challenges. These connected devices have created a security blind spot, where cybercriminals can easily launch attacks to compromise the devices using malware proliferation techniques. Therefore, malware detection is a lifeline for securing IoT devices against cyberattacks. This study addresses the challenge of malware detection in IoT devices by proposing a new CNN-based IoT malware detection architecture (iMDA). The proposed iMDA is modular in design that incorporates multiple feature learning schemes in blocks including (1) edge exploration and smoothing, (2) multi-path dilated convolutional operations, and (3) channel squeezing and boosting in CNN to learn a diverse set of features. The local structural variations within malware classes are learned by Edge and smoothing operations implemented in the split-transform-merge (STM) block. The multi-path dilated convolutional operation is used to recognize the global structure of malware patterns. At the same time, channel squeezing and merging helped to regulate complexity and get diverse feature maps. The performance of the proposed iMDA is evaluated on a benchmark IoT dataset and compared with several state-of-the CNN architectures. The proposed iMDA shows promising malware detection capacity by achieving accuracy: 97.93%, F1-Score: 0.9394, precision: 0.9864, MCC: 0. 8796, recall: 0.8873, AUC-PR: 0.9689 and AUC-ROC: 0.9938. The strong discrimination capacity suggests that iMDA may be extended for the android-based malware detection and IoT Elf files compositely in the future.

Deep learning hybridization for improved malware detection in smart Internet of Things

Article Open access 03 April 2024

Image copy-move forgery detection and localization based on super-BPD segmentation and DCNN

Article Open access 02 September 2022

An efficient intrusion detection model based on convolutional spiking neural network

Article Open access 25 March 2024

Introduction

The concept of transforming real-world objects into virtual objects emerged as the Internet of Things (IoT). Under this concept, intelligent objects and devices can share data and resources according to the situation and environment¹. This web of interconnected devices plays a vital role in our daily lives, ranging from health, smart homes, education, and, especially, industry. Masses are becoming familiar with the deployment of these devices in the field of agriculture, for soil condition monitoring², healthcare and e-health applications^3,4,5, and military domains⁶, as well. Deployments of these gadgets range from operational areas to critical infrastructure services. Industry 4.0 exploited this concept to build the link between the supply chain, industrial production, and end-users⁷. The IoT ecosystem used in industry, Industrial IoT (IIoT), undoubtedly contributes to the productivity and the quality of the industrial infrastructures.

IoTs lack secure design rules; hence, these have become an accessible playground for cybercriminals⁸. IoT devices are resource-constrained. These devices are usually installed with a default username password. Due to the embedded nature of the IoT devices, they are not patched regularly⁹. Network vulnerabilities for communicating with these devices can also be exploited easily at IoT touchpoints. The security protocol cannot be uniformly implemented on all the devices. The manufacturing of the devices does not conform to some consistent standards. These security challenges are depicted in Fig. 1.

There are many heterogenic device structures and network protocols. They also possess a unique characteristic of processor heterogeneity¹⁰. So, the IoT industry lags unified security protocols for design and implementation. These weaknesses of IoT design enlarge the attack surface area and lead to security breaches. Cybercriminals utilize these attack surfaces for their illegal actions and exploit the vulnerabilities. Major cyber security concerns are host and Network Intrusions, malware attacks, compromised nodes, botnets, rootkits, ransomware, and DDoS. Therefore, a robust mechanism for detecting such activities is needed to quickly detect and mitigate these digital security exploits.

Research on this security aspect of IoTs has attracted increased academic, industrial, and state-level attention. Several research efforts have discovered potential cyber threats and provided countermeasures against cyberattacks. Cyber security experts believe most cyber exploits are carried out through malware attacks. Many research studies in the literature have attempted this challenge of malware detection. Static, dynamic, hybrid, and image-based malware analysis comes under this challenge's broad categories¹¹.

Machine learning techniques have been extensively used for malware detection as they are more robust and give promising performance^12,13,14. Anti-malware tools have achieved improved performance with the help of machine learning tools. Several machine learning algorithms have been employed for mining the vulnerabilities in the IoT firmware and IoT applications that can infect and corrupt the edge devices and the whole network of the connected devices. Recent machine learning advancements have proved their capabilities in detecting and classifying IoT malware¹⁵. Research studies for anti-malware applications have increased the inclination towards machine learning tools and techniques. Computational power improvement has also enhanced the performance of machine learning strategies for malware detection and classification. Application of the machine learning needs the features of the IoT malware to make their verdict.

As the malware databases are increased, deep learning techniques suited more pertinent for the detection and analysis. Recent research has been molded towards applying neural networks in the field of malware analysis. Neural networks, especially deep convolution neural networks (CNNs), have proven their competencies for feature extraction and feature identification in IoT malware. Deep CNNs build the malware detection systems by defining the discriminative features in IoT malware. Deep CNNs show enhanced performance as these models learn the complicated features of the IoT malware at different abstraction levels. Features learned in the lower layers are enriched in the upper lawyers. These features are extracted from the visual images of the problem domain.

The IoT Malware dataset exploited in the current study has not been addressed previously to the best of our knowledge. This study utilized the image representation of IoTs malware and benign files. It is observed that deep CNN has shown promising performance for the visual challenges¹⁶. We have proposed applying deep learning techniques for the malware detection challenge. The main contributions in the current study are described below:

A novel IoT Malware detection architecture (iMDA), using squeezing and boosting dilated CNN, is proposed for IoT Malware analysis using a new benchmark dataset.
The proposed iMDA incorporates the edge and smoothing, multi-path dilated convolutional, channel squeezing, and boosting operations in CNN. Edge and smoothing operations are employed within split-transform-merge (STM) blocks to extract local structure and minor contrast variation in the malware images.
STM blocks performed multi-path dilated convolutional operations, which helped to recognize the global structure of malware patterns. Additionally, channel squeezing and boosting are applied at different granular levels to get the reduced but prominent and diverse feature maps for capturing texture variations.
The proposed iMDA has shown significant performance compared with existing CNNs via TL in terms of standard performance metrics using MCC, F1-Score, AUC, accuracy, precision, and recall.

The rest of the paper is structured as follows: the next section specifies related work in IoT malware analysis. “Methodology” section explains the proposed novel malware detection methodology. “Experimental setup” section describes the experimental setup. “Results and discussion” section discusses the results of our work. The is described in “conclusion” section.

Related work

IoT Malware analysis is carried out using static, dynamic, and hybrid analysis techniques. Nataraj et al.¹⁷ were the first to perform the malware analysis based on greyscale images in 2011. Malware visual images are created by transcribing the eight-bit code value of the executable files to the corresponding greyscale value. Image texture features are extracted from these images¹⁸. The idea of texture-based analysis for IoT malware is emerging in context with deep learning. Evanson et al.¹⁹ proposed an approach for malware analysis using texture images of malware files and machine learning in IoTPOT²⁰ for Bashlite and Mirai. They came up with the Haralick image texture features from the grey-level co-occurrence matrix and used machine learning classifiers. Carrilo et al.²¹ explored the malware forensic and reverse engineering capabilities for malware characterization. They first used machine learning to detect Linux-based system malware of IoT. They also discovered new malware detection by using clustering techniques. They exploited the dataset provided by E. Cozzi et al.²². Ganesh et al.²³ exploited machine learning capabilities to detect Mirai botnet attacks in IoTs. They applied ANN to evaluate their approach to the N-BaIoT dataset. Shudong Li et al.²⁴ used ensemble learning for mining the malicious code in the cloud computing environment. Bendiab et al.²⁵ applied deep learning for malware analysis traffic IoT. They applied ResNet50 for the experimental verification of their concept using a 1000 network (pcap) file.

Kyushu et al.²⁶ proposed a lightweight approach for IoT malware detection. They targeted the DDoS malware for their study and extracted the malware images from malware binaries in IoTPOT²⁰. Their experimental setup showed performance for detecting the DDoS malware and good-ware. Ren et al.²⁷ gave an end-to-end malware detection mechanism for Android IoT devices. They collected 8000 benign and 8000 malicious APK files from the Google Play store and VirusShare, respectively. They used the significance of deep learning for the evaluation of their concept. Hussain et al.²⁸ used application intent along with a supervised learning-based approach for the intelligent identification of Android-based malware. Naeem et al.²⁹ detected the malware in Industrial IoT by proposing deep CNN-based traffic, behavior, and log databases analysis. They utilized the color images of the targeted malware for detection in the Leopard Mobile dataset.

Shafiq et al.³⁰ used the Bot-IoT dataset for the correct malware feature selection and showed that their proposed method reached the accuracy of 965 for accurate detection of IoT malware over the network. They used the bijective soft set selection approach³¹ for the effective ML algorithm selection for the Bot-IoT network traffic dataset. They also used wrapper-based feature filtering and selection techniques³².

However, the evaluation of the reported work is presented in Accuracy and Precision. Practically, malware datasets are imbalanced. Therefore, other evaluation metrics must be considered. In this regard, our proposed research work exploited the benchmark Kaggle IoT dataset. Performance evaluation metrics F1-Score, MCC, AUC-PR, and AUC-ROC are also evaluated, along with Accuracy and Precision. The comprehensive workflow is presented in Fig. 2.

Methodology

Data augmentation

CNN models give better generalization upon large labeled data. Sometimes, the data points for the model training are not adequate. The data augmentation technique produces the artificial sample points by applying image transformation operations^33,34,35. These operation includes rotation (0–360 degree), scaling (0.5–1), shearing (− 0.5, + 0.5) and reflection (in left and right direction). The augmentation process helped improve generalization and made the dataset more robust for detecting IoT malware.

Proposed IoT malware detection architecture (iMDA)

This study proposes a novel image-based IoT malware detection architecture, iMDA. The suggested architecture discriminates the malware image sample from benign images. Spit-Transform-Merge (STM) is the main building block of this architecture. Three STM-based blocks concept is systematically implemented using region and edge detection operations.

The concept of channel boosting is imparted for high precision, improving the detection rate. Implementation details of the proposed architecture are highlighted in Fig. 3A. The performance of the proposed architecture is compared with the existing CNN models using TL-based implementation, as shown in Fig. 3B.

Proposed channel squeezing and boosting blocks

Deep CNN models are powerful and robust for their texture feature mining abilities. These models use convolutional operations for exploiting structural information in the image data. These operations are used to extract the dataset's features according to the target domain. This innovative feature of the deep CNN is utilized in the current architecture for IoT malware detection. This architecture is tailored by proposing a concatenated STM-based channel boosting approach³⁶, Fig. 4A.

The proposed STM block comprises a stack of four blocks, as shown in Fig. 4B. Details of the operations performed in each block are shown in Fig. 4C. Block B and Block C employ the same convolutions, batch normalization, and Relu operation with max and average pooling operation. Two convolution operations employed in each block are used to extract the feature information at the detailed and abstract levels, respectively. Block D and Block E employ the three-convolution operation. Two operations are used for the detailed features extraction, while one is used for the abstract level feature information extraction.

The STM block splits the input IoT malware image data into four branches to feed the four blocks of the STM. These blocks learn the region and edge-based informative features at a different level of abstraction from the input dataset. This learning helps to gather the highly discriminating features of the IoT malware at a high and detailed level. This info is imparted into different channels from each block. Information infused in other channels is concatenated at the exit of the STM block. This channel-boosted feature space is rich in diverse levels of textural feature information about the malware.

Equation (1) shows the convolution operation of filter f and input channel x of size p × q and A × B, respectively. The dimensions of the convolved output range from 1 to A − p + 1 and B—q + 1, respectively. ‘s’ denotes the dimension for average and max-pooling operations, shown in Eq. (2) and Eq. (3). In Eq. (4), CB, CC, CD, and CE show the channels extracted in the Block-B, -C, -D, and -E, respectively. The ‘merge’ function is used for concatenating these extracted channels. ud shows the number of neurons in Eq. (5).

$$x_{a,b} = \mathop \sum \limits_{i = 1}^{p} \mathop \sum \limits_{j = 1}^{q} x_{a + i - 1,b + j - 1} * f_{i,j}$$

(1)

$$\left( {xavg} \right)_{a,b} = \frac{1}{{s^{2} }}\mathop \sum \limits_{i = 1}^{s} \mathop \sum \limits_{j = 1}^{s} x_{a + i - 1,b + j - 1}$$

(2)

$$\left( {x\max } \right)_{a,b} = \max_{i = 1, \ldots s,j = 1, \ldots s} x_{a + i - 1, b + j - 1}$$

(3)

$$C_{Boost} = merge(C_{B} \left| {\left| {C_{C} } \right|} \right|C_{D} ||C_{E} )$$

(4)

$$x = \mathop \sum \limits_{d = 1}^{D} \mathop \sum \limits_{c = 1}^{C} u_{d} x_{c}$$

(5)

Implementation of customized existing CNNs

CNN architectures AlexNet, VGG16, inceptionv3, VGG19, Resnet50, Shufflenet, DenseNet201, Xception, and GoolgeNet are selected for a fair comparison with the proposed architecture. To achieve substantial performance, these models are initially trained on the ImageNet. These trained models are fine-tuned according to the target IoT malware dataset. Then these models are trained and tested using the target dataset using an 80-20 train-test split.

Experimental setup

Dataset

Linux operating system (OS) is becoming the dominant for IoT devices³⁷. Hence, this operating system has become a prospecting target for the malware developer community. Linux uses ELF file format for the deployment of applications or firmware. ELF files are cross plate form in nature and come in two binary formats, packed and unpacked binaries³⁸. The IOT_Malware dataset used in this study is the image representation of unpacked ELF binary files for malware and benign applications³⁹. This dataset is a standard Kaggle benchmark dataset for IoT malware detection challenges. There are 14,733 greyscale images of malware application ELF binaries and 2486 greyscale images of legitimate application ELF binaries. Visualization of benign and malicious files is shown in Fig. 5.

Implementation details

The implementation of the proposed iMDA is simulated using MATLAB-2021a on Nvidia® GTX 1060-T, GPU-enabled Dell Core I i5-7500. It took ~ 1–2 h to train a model on the said settings. One epoch took 7–10 min on Nvidia-Tesla K-80, while a single IoT malware image took approximately 2 s for detection.

Performance evaluation metrics

In the current study, we have employed performance evaluation metrics Accuracy, Precision, Recall, F1-Score, and MCC, as shown in Eqs. (6–10). The details of these performance metrics are described in Table 1. AUC-PR and AUC-ROC is also formulated for the proposed model. True Positive (TP), False Positive (FP), True Negative (TN), and False Negative (TN) are also calculated for the performance comparison.

$${\text{Acc}} = \frac{{{\text{Predicted}}\;{\text{malware}}\;{\text{samples }} + {\text{predicted}}\;{\text{benign}}\;{\text{samples}}}}{{{\text{Total}}\;{\text{samples}}}} \times 100$$

(6)

$${\text{MCC}} = \frac{{\left( {{\text{TP}}*{\text{TN}}} \right) - \left( {{\text{FP}}*{\text{FN}}} \right)}}{{\sqrt {\left( {{\text{TP}} + {\text{FP}}} \right)*\left( {{\text{FP}} + {\text{FN}}} \right)*\left( {{\text{TN}} + {\text{FP}}} \right)*\left( {{\text{TN}} + {\text{FN}}} \right)} }}$$

(7)

$${\text{P}} = \frac{{{\text{Predicted}}\;{\text{malware}}\;{\text{samples}}}}{{{\text{Predicted}}\;{\text{malware}}\;{\text{samples}} + {\text{ Incorrectly}}\;{\text{predicted}}\;{\text{Malware}}\;{\text{samples}}}} \times 100$$

(8)

$${\text{R}} = \frac{{{\text{Predicted}}\;{\text{malware}}\;{\text{samples}}}}{{{\text{Total}}\;{\text{malware}}\;{\text{samples}}}} \times 100$$

(9)

$${\text{F}}1 - {\text{Score}} = 2 \times \frac{{{\text{P}} \times {\text{R}}}}{{{\text{P}} + {\text{R}}}}$$

(10)

Table 1 Details of performance metrics.

Full size table

Several statistical measures are used as performance metrics for binary classification using four quadrants confusion matrix, i.e., TP, TN, FP, and FN. These metrics are selected according to the problem under investigation. There is no agreed-upon performance metrics for two or multi-class problem. The severity of the problem gives direction toward the selection of performance metrics. For an imbalanced dataset, some performance metrics show over-optimistic results. The Matthews correlation coefficient (MCC) is considered an attested statistical measure. It gives a high score for prediction only if all four quadrants are proportionally high for both positive and negative classes⁴⁰.

Results and discussion

Performance analysis of the proposed iMDA

The performance of the proposed iMDA is assessed on a standard IoT Malware dataset. F1-Score and MCC are considered standards for performance evaluation for an imbalanced dataset. F1-Score and MCC are used for assigning weightage to both the precision and sensitivity. The proposed architecture converged smoothly and reached the optimal value quickly, as shown in the training plots of the model. Misclassification occurred due to the intrinsic code similarity between the malicious and benign files. This similarity refers to the identical attack pattern in the malware images. This phenomenon occurred substantially with the implementation of other CNN models for malware detection. The iMDA is carried out using data augmentation techniques that improve the generalization and robustness of the trained model during testing.

Performance comparison with existing CNNs

The performance of the IoT malware detection architecture, iMDA, is also compared with existing models, AlexNet, VGG16, inceptionv3, VGG19, Resnet50, Shufflenet, DenseNet201, Xception, and GoolgeNet. Improved performance is shown in Table 2.

Table 2 Comparison of proposed framework with the existing CNN models.

Full size table

CNN models seek to find the identifiable textures and patterns in the image dataset. Our proposed malware detection architecture, iMDA, better explored textural variation in the malware images by systematically using region and boundary information through the Avg and Max-pooling operations. Channel split-transform-merge technique helped to extract the features at different granularity. Incorporating the concepts mentioned earlier in CNN improved the performance of the proposed architecture over the existing models. This study reported the significance of performance using deep learning architecture and quantified it using MCC, F1-Score, AUC-ROC, Accuracy, Precision, and Recall.

Detection capability of the proposed iMDA

The effectiveness of a malware detection framework is mainly assessed through precision rate and detection rate. Accurately detecting infused malware in a system is the first parameter to secure and control the spread. False alarms may be increased if only the precision of the proposed detection technique is improved. Decreasing the false alarm may degrade the detection rate. Keeping in mind this intuition, the proposed model leveraged the difference by comparing F1-Score, the harmonic mean of both parameters. Minimum and maximum performance gains against the existing CNN models are shown in Fig. 6. Results of the proposed iMDA are summarized in Table 3. A comparison of detection performance of our proposed model using F1-Score, Accuracy, and MCC with the existing model is shown in Fig. 7. In contrast, customized existing CNNs are compared and found that few models showed considerably good precision with poor recall.

Table 3 Performance of the proposed model.

Full size table

Feature space-based analsis of the proposed iMDA

The decision-making of the proposed architecture is better analyzed with the help of feature space visualization. Better discrimination factor of the model is associated with the prominent visual features. This distinction helps to improve the learning and lower the variance of the model. The feature space visualization for the principal components of our proposed iMDA is shown in Fig. 8. Channel squeezing and channel boosting used in STM blocks helped to capture the discriminative features of the IoT malware images at a multi-level. Additionally, STM extended the reduced prominent feature with the help of channel concatenation. The feature space visualization for the proposed iMDA showed an improvement in identifying the distinct and diverse features, hence improving the detection of the IoT malware files.

Auc-roc and auc-pr based analysis

The optimal performance of the model is also best understood by the ROC and PR plots, Fig. 9. These plots show the bifurcation capability of the models at an optimal threshold value. Our proposed iMDA showed high sensitivity along with a decreased false positive rate.

Conclusion

Analysis of malware in IoT is an early line of defense in securing this world of connected devices from cyberattacks. Malware analysis help to identify and designate the malicious code segments hidden in the legitimate files. This malicious code snippet is present according to the malware signature or obfuscated otherwise. The obfuscation techniques hide the malicious code lines from pattern/signature matching. These lines may be distributed or intermixed with the legitimate line over the complete file. The IoT-specific malicious patterns are detected in this study by developing iMDA, new CNN architecture: iMDA based on the ideas of dilated convolutional operations, channel squeezing, and boosting. The proposed architecture discriminates the malware from benign based on textural, contrast, and pattern variations. The proposed iMDA outperformed existing CNN and achieved the best result for Accuracy (97.33%), MCC (0.8796), F1-Score (93.94), AUC-ROC (0.9938), and AUC-PR (0.9689). In the future, the proposed iMDA may be extended for the android-based malware detection and IoT Elf files compositely.

Data availability

The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.

References

Madakam, S., Ramaswamy, R. & Tripathi, S. Internet of things (IoT): A literature review. J. Comput. Commun. 30, 164. https://doi.org/10.4236/jcc.2015.35021 (2015).
Article Google Scholar
Vuran, M. C., Salam, A., Wong, R. & Irmak, S. Internet of underground things in precision agriculture: Architecture and technology aspects. Ad Hoc Netw. 81, 160–173. https://doi.org/10.1016/j.adhoc.2018.07.017 (2018).
Article Google Scholar
Zafar, M. M. et al. Detection of tumour infiltrating lymphocytes in CD3 and CD8 stained histopathological images using a two-phase deep CNN. Photodiagnosis Photodyn. Ther. 37, 102676. https://doi.org/10.1016/j.pdpdt.2021.102676 (2022).
Article CAS PubMed Google Scholar
Islam, S. M. R., Kwak, D., Kabir, M. H., Hossain, M. & Kwak, K. S. The internet of things for health care: A comprehensive survey. IEEE Access 3, 678–708. https://doi.org/10.1109/ACCESS.2015.2437951 (2015).
Article Google Scholar
Zahoor, M. M., Qureshi, S. A., Khan, S. H. & Khan, A. A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRI (2022). https://arxiv.org/abs/2201.05373
Iyer, B. & Patil, N. IoT enabled tracking and monitoring sensor for military applications. Int. J. Syst. Assur. Eng. Manag. 9(6), 1294–1301. https://doi.org/10.1007/s13198-018-0727-8 (2018).
Article Google Scholar
Mikhalevich, I. F. & Trapeznikov, V. A. Critical infrastructure security: alignment of views. In 2019 Systems of Signals Generating Processing in the Field of on Board Communication. SOSG 2019 1–5 (2019). https://doi.org/10.1109/SOSG.2019.8706821
Shao, Z., Yuan, S. & Wang, Y. Adaptive online learning for IoT botnet detection. Inf. Sci. (Ny) 574, 84–95. https://doi.org/10.1016/j.ins.2021.05.076 (2021).
Article Google Scholar
Ngo, Q. D., Nguyen, H. T., Le, V. H. & Nguyen, D. H. A survey of IoT malware and detection methods based on static features. ICT Express 6(4), 280–286. https://doi.org/10.1016/j.icte.2020.04.005 (2020).
Article Google Scholar
Vignau, B., Khoury, R., Hallé, S. & Hamou-Lhadj, A. The evolution of IoT malwares, from 2008 to 2019: Survey, taxonomy, process simulator and perspectives. J. Syst. Archit. 116, 102143. https://doi.org/10.1016/j.sysarc.2021.102143 (2021).
Article Google Scholar
Asam, M. et al. Detection of exceptional malware variants using deep boosted feature spaces and machine learning. Appl. Sci. 11, 21. https://doi.org/10.3390/app112110464 (2021).
Article CAS Google Scholar
Or-Meir, O., Cohen, A., Elovici, Y., Rokach, L. & Nissim, N. Pay attention: Improving classification of PE malware using attention mechanisms based on system call analysis. Proc. Int. Jt. Conf. Neural Netw. https://doi.org/10.1109/IJCNN52387.2021.9533481 (2021).
Article Google Scholar
Asam, M., Hussain Khan, S., Jamal, T., Zahoora, U. & Khan, A. Malware Classification Using Deep Boosted Learning.
Rafique, M. F., Ali, M., Qureshi, A. S., Khan, A. & Mirza, A. M. Malware Classification using Deep Learning based Feature Extraction and Wrapper based Feature Selection Technique, Oct. 2019, Accessed: Jun. 20, 2021. [Online]. Available: http://arxiv.org/abs/1910.10958
Li, S., Zhang, Q., Wu, X., Han, W. & Tian, Z. Attribution classification method of APT malware in IoT using machine learning techniques. Secur. Commun. Netw. https://doi.org/10.1155/2021/9396141 (2021).
Article Google Scholar
Khan, A., Sohail, A., Zahoora, U. & Qureshi, A. S. A survey of the recent architectures of deep convolutional neural networks. Artif. Intell. Rev. 53(8), 5455–5516. https://doi.org/10.1007/s10462-020-09825-6 (2020).
Article Google Scholar
Nataraj, L., Karthikeyan, S., Jacob, G. & Manjunath, B. S. Malware images: Visualization and automatic classification. ACM Int. Conf. Proc. Ser. https://doi.org/10.1145/2016904.2016908 (2011).
Article Google Scholar
Ma, Y., Liu, S., Jiang, J., Chen, G. & Li, K. A Comprehensive Study on Learning-Based PE Malware Family Classification Methods, vol. 1, 1. Association for Computing Machinery (2021).
Karanja, E. M., Masupe, S. & Jeffrey, M. G. Analysis of internet of things malware using image texture features and machine learning techniques. Internet Things (Netherlands) 9, 100153. https://doi.org/10.1016/j.iot.2019.100153 (2020).
Article Google Scholar
Pa, Y. M., Suzuki, S., Yoshioka, K., Matsumoto, T., Kasama, T. & Rossow, C. IoTPOT: Analysing the rise of IoT compromises. In 9th USENIX Work. Offensive Technology WOOT 2015 (2015).
Carrillo-Mondéjar, J., Martínez, J. L. & Suarez-Tangil, G. Characterizing Linux-based malware: Findings and recent trends. Futur. Gen. Comput. Syst. 110, 267–281. https://doi.org/10.1016/j.future.2020.04.031 (2020).
Article Google Scholar
Cozzi, E., Graziano, M., Fratantonio, Y. & Balzarotti, D. Understanding Linux malware. In Proceedings of IEEE Symposium Secure Privacy, vol. 2018-May, 161–175 (2018). https://doi.org/10.1109/SP.2018.00054
Palla, T. G. & Tayeb S. Intelligent Mirai Malware Detection in IoT Devices. In 2021 IEEE World AI IoT Congress AIIoT 2021, 420–426 (2021). https://doi.org/10.1109/AIIoT52608.2021.9454215
Li, S. et al. Malicious mining code detection based on ensemble learning in cloud computing environment. Simul. Model. Pract. Theory 113, 102391. https://doi.org/10.1016/j.simpat.2021.102391 (2021).
Article Google Scholar
Bendiab, G., Shiaeles, S., Alruban, A. & Kolokotronis, N. IoT malware network traffic classification using visual representation and deep learning. In Proceedings of 2020 IEEE Conference on Network Softwarization Bridge Gap Between AI Network Softwarization, NetSoft 2020 444–449 (2020). https://doi.org/10.1109/NetSoft48620.2020.9165381.
Su, J. et al. Lightweight classification of IoT malware based on image recognition. Proc. Int. Comput. Softw. Appl. Conf. 2, 664–669. https://doi.org/10.1109/COMPSAC.2018.10315 (2018).
Article Google Scholar
Ren, Z., Wu, H., Ning, Q., Hussain, I. & Chen, B. End-to-end malware detection for android IoT devices using deep learning. Ad Hoc Netw. 101, 102098. https://doi.org/10.1016/j.adhoc.2020.102098 (2020).
Article Google Scholar
Hussain, S. J. et al. IMIAD: Intelligent malware identification for android platform. Int. Conf. Comput. Inf. Sci. ICCIS 2019, 1–6. https://doi.org/10.1109/ICCISci.2019.8716471 (2019).
Article Google Scholar
Naeem, H. et al. Malware detection in industrial internet of things based on hybrid image visualization and deep learning model. Ad Hoc Netw. 105, 102154. https://doi.org/10.1016/j.adhoc.2020.102154 (2020).
Article Google Scholar
Shafiq, M., Tian, Z., Bashir, A. K., Du, X. & Guizani, M. CorrAUC: A malicious Bot-IoT traffic detection method in IoT network using machine-learning techniques. IEEE Internet Things J. 8(5), 3242–3254. https://doi.org/10.1109/JIOT.2020.3002255 (2021).
Article Google Scholar
Shafiq, M., Tian, Z., Sun, Y., Du, X. & Guizani, M. Selection of effective machine learning algorithm and Bot-IoT attacks traffic identification for internet of things in smart city. Futur. Gen. Comput. Syst. 107, 433–442. https://doi.org/10.1016/j.future.2020.02.017 (2020).
Article Google Scholar
Shafiq, M., Tian, Z., Bashir, A. K., Du, X. & Guizani, M. IoT malicious traffic identification using wrapper-based feature selection mechanisms. Comput. Secur. https://doi.org/10.1016/j.cose.2020.101863 (2020).
Article Google Scholar
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6, 1. https://doi.org/10.1186/s40537-019-0197-0 (2019).
Article Google Scholar
Wang, J. & Perez, L. The Effectiveness of Data Augmentation in Image Classification using Deep Learning (2017).
Hussain Khan, S., Khan, A., Soo Lee, Y., Hassan, M. & Kyo Jeong, W. Segmentation of Shoulder Muscle MRI Using a New Region and Edge Based Deep Auto-Encoder.
Khan, S. H., Sohail, A., Khan, A. & Lee, Y.-S. COVID-19 detection in chest X-ray images using a new channel boosted CNN. Diagnostics 12(2), 267. https://doi.org/10.3390/diagnostics12020267 (2022).
Article CAS PubMed PubMed Central Google Scholar
E. Foundation. Iot-Comm-Adoption-Survey-2019 (2020).
Wan, T. L. et al., IoT-malware detection based on byte sequences of executable files. In 2020 15th Asia Joint Conference on Information Security (AsiaJCIS 2020) 143–150 (2020). https://doi.org/10.1109/AsiaJCIS50894.2020.00033
Elmasry, A. IOT_Malware, https://www.kaggle.com/anaselmasry/iot-malware (accessed Aug. 08, 2021).
Chicco, D. & Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation 1–13 (2020).

Download references

Acknowledgements

This work was supported by the National Research Foundation (NRF) of Korea through the Auspices of the Ministry of Science and ICT, Republic of Korea, under Grant NRF-2020R1G1A1012741.

Author information

Authors and Affiliations

Pattern Recognition Lab, Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, 45650, Pakistan
Muhammad Asam, Saddam Hussain Khan & Asifullah Khan
PIEAS Artificial Intelligence Center (PAIC), Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, 45650, Pakistan
Muhammad Asam, Saddam Hussain Khan & Asifullah Khan
Department of Computer Systems Engineering, University of Engineering and Applied Sciences, Swat, 19060, Pakistan
Saddam Hussain Khan
Department of Economics, Management, Industrial Engineering and Tourism (DEGEIT), University of Aveiro, Aveiro, Portugal
Altaf Akbar
Department of Mathematics, Air University, Islamabad, 44000, Pakistan
Sameena Bibi
Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, 45650, Pakistan
Tauseef Jamal & Asifullah Khan
Center for Mathematical Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, 45650, Pakistan
Tauseef Jamal & Asifullah Khan
Department of Mechanical Engineering, Institute of Space Technology, Islamabad, 44000, Pakistan
Usman Ghafoor
School of Mechanical Engineering, Pusan National University, Busan, 46241, South Korea
Usman Ghafoor
Department of Electrical and Computer Engineering, University of UTAH Asia Campus, Incheon, 21985, South Korea
Muhammad Raheel Bhutta

Authors

Muhammad Asam
View author publications
You can also search for this author in PubMed Google Scholar
Saddam Hussain Khan
View author publications
You can also search for this author in PubMed Google Scholar
Altaf Akbar
View author publications
You can also search for this author in PubMed Google Scholar
Sameena Bibi
View author publications
You can also search for this author in PubMed Google Scholar
Tauseef Jamal
View author publications
You can also search for this author in PubMed Google Scholar
Asifullah Khan
View author publications
You can also search for this author in PubMed Google Scholar
Usman Ghafoor
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Raheel Bhutta
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, writing—original draft preparation, M.A.; Supervision, methodology, re-sources, Software, S.H.K, A.A., and S.B.; Formal analysis, visualization, T.J.; Supervision, writing—review and editing, A.K.; Validation; funding acquisition, data curation, U.G.; Project administration, validation, funding acquisition, M.R.B.; All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Usman Ghafoor or Muhammad Raheel Bhutta.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Asam, M., Khan, S.H., Akbar, A. et al. IoT malware detection architecture using a novel channel boosted and squeezed CNN. Sci Rep 12, 15498 (2022). https://doi.org/10.1038/s41598-022-18936-9

Download citation

Received: 17 April 2022
Accepted: 19 August 2022
Published: 15 September 2022
DOI: https://doi.org/10.1038/s41598-022-18936-9

This article is cited by

Deep learning hybridization for improved malware detection in smart Internet of Things
- Abdulwahab Ali Almazroi
- Nasir Ayub
Scientific Reports (2024)
Malaria parasitic detection using a new Deep Boosted and Ensemble Learning framework
- Hafiz M. Asif
- Saddam Hussain Khan
- Amena Mahmoud
Complex & Intelligent Systems (2024)
COVID-19 infection analysis framework using novel boosted CNNs and radiological images
- Saddam Hussain Khan
- Tahani Jaser Alahmadi
- Hesham A. Sakr
Scientific Reports (2023)
Zero-day attack detection: a systematic literature review
- Rasheed Ahmad
- Izzat Alsmadi
- Lo’ai Tawalbeh
Artificial Intelligence Review (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Deep learning hybridization for improved malware detection in smart Internet of Things

Image copy-move forgery detection and localization based on super-BPD segmentation and DCNN

An efficient intrusion detection model based on convolutional spiking neural network

Introduction

Related work

Methodology

Data augmentation

Proposed IoT malware detection architecture (iMDA)

Proposed channel squeezing and boosting blocks

Implementation of customized existing CNNs

Experimental setup

Dataset

Implementation details

Performance evaluation metrics

Results and discussion

Performance analysis of the proposed iMDA

Performance comparison with existing CNNs

Detection capability of the proposed iMDA

Feature space-based analsis of the proposed iMDA

Auc-roc and auc-pr based analysis

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Deep learning hybridization for improved malware detection in smart Internet of Things

Malaria parasitic detection using a new Deep Boosted and Ensemble Learning framework

COVID-19 infection analysis framework using novel boosted CNNs and radiological images

Zero-day attack detection: a systematic literature review

Comments

Search

Quick links