Machine Learning and AI: Cybersecurity and Threat Hunting

Behavioral Malware Detection using a Language Model Classifier Trained on sys2vec Embeddings

Location

Hilton Hawaiian Village, Honolulu, Hawaii

Event Website

https://hicss.hawaii.edu/

Start Date

3-1-2024 12:00 AM

End Date

6-1-2024 12:00 AM

Description

Behavioral malware detection is an effective way to detect ever-changing malware. Often, kernel-level system calls are collected on device and then processed and fed to machine learning models. In this work, we show that using simple natural language processing (NLP) techniques on system calls, such as a bag-of-n-grams model, coupled with shallow machine learning classifiers, are not as useful for stealthier malware. In contrast, training a Word2Vec-like model, which we call sys2vec, on the system call traces and feeding the resulting embeddings to a language model classifier provides consistently better results. We evaluate and compare the two classifiers using Area Under the Receiver Operating Characteristic Curve (AUC) and the True Positive Rate (TPR) at an acceptable False Positive Rate (FPR). We then discuss how this work can be further expanded in the language model space going forward.

Recommended Citation

Carter, John; Mancoridis, Spiros; Protopapas, Pavlos; and Galinkin, Erick, "Behavioral Malware Detection using a Language Model Classifier Trained on sys2vec Embeddings" (2024). Hawaii International Conference on System Sciences 2024 (HICSS-57). 3.
https://aisel.aisnet.org/hicss-57/st/threat_hunting/3

Download

COinS

Jan 3rd, 12:00 AM Jan 6th, 12:00 AM

Behavioral Malware Detection using a Language Model Classifier Trained on sys2vec Embeddings

Hilton Hawaiian Village, Honolulu, Hawaii

https://aisel.aisnet.org/hicss-57/st/threat_hunting/3

Machine Learning and AI: Cybersecurity and Threat Hunting

Behavioral Malware Detection using a Language Model Classifier Trained on sys2vec Embeddings

Location

Event Website

Start Date

End Date

Description

Recommended Citation

Search

Browse

Author Corner

Machine Learning and AI: Cybersecurity and Threat Hunting

Behavioral Malware Detection using a Language Model Classifier Trained on sys2vec Embeddings

Presenter Information

Location

Event Website

Start Date

End Date

Description

Recommended Citation

Share

Search

Browse

Author Corner