Machine Learning and Cyber Threat Intelligence and Analytics

Interpretability of API Call Topic Models: An Exploratory Study

Location

Grand Wailea, Hawaii

Event Website

https://hicss.hawaii.edu/

Start Date

7-1-2020 12:00 AM

End Date

10-1-2020 12:00 AM

Description

Topic modeling is an unsupervised method for discovering semantically coherent combinations of words, called topics, in unstructured text. However, the human interpretability of topics discovered from non-natural language corpora, specifically Windows API call logs, is unknown. Our objective is to explore the coherence of topics and their ability to represent the themes of API calls from malware analysts’ perspective. Three Latent Dirichlet Allocation (LDA) models were fit to a collection of dynamic API call logs. Topics, or behavioral themes, were manually evaluated by malware analysts. The results were compared to existing automated quality measures. Participants were able to accurately determine API calls that did not belong in behavioral themes learned by the 20 topic model. Our results agree with topic coherence measures in terms of highest interpretable topics. The results are not compatible with log-perplexity, which concur with the findings of topic evaluation literature on natural language corpora.

Download

COinS

Jan 7th, 12:00 AM Jan 10th, 12:00 AM

Interpretability of API Call Topic Models: An Exploratory Study

Grand Wailea, Hawaii

https://aisel.aisnet.org/hicss-53/st/cyber_threat_intelligence/6

Machine Learning and Cyber Threat Intelligence and Analytics

Interpretability of API Call Topic Models: An Exploratory Study

Location

Event Website

Start Date

End Date

Description

Search

Browse

Author Corner

Machine Learning and Cyber Threat Intelligence and Analytics

Interpretability of API Call Topic Models: An Exploratory Study

Presenter Information

Location

Event Website

Start Date

End Date

Description

Share

Search

Browse

Author Corner