Information extraction of cybersecurity concepts: An LSTM approach

Gasmi H.; Laval J.; Bouras A.

View/Open

applsci-09-03945-v2.pdf (2.048Mb)

Date

2019

Author

Gasmi H.
Laval J.
Bouras A.

Metadata

Show full item record

Abstract

Extracting cybersecurity entities and the relationships between them from online textual resources such as articles, bulletins, and blogs and converting these resources into more structured and formal representations has important applications in cybersecurity research and is valuable for professional practitioners. Previous works to accomplish this task were mainly based on utilizing feature-based models. Feature-based models are time-consuming and need labor-intensive feature engineering to describe the properties of entities, domain knowledge, entity context, and linguistic characteristics. Therefore, to alleviate the need for feature engineering, we propose the usage of neural network models, specifically the long short-term memory (LSTM) models to accomplish the tasks of Named Entity Recognition (NER) and Relation Extraction (RE).We evaluated the proposed models on two tasks. The first task is performing NER and evaluating the results against the state-of-the-art Conditional Random Fields (CRFs) method. The second task is performing RE using three LSTM models and comparing their results to assess which model is more suitable for the domain of cybersecurity. The proposed models achieved competitive performance with less feature-engineering work. We demonstrate that exploiting neural network models in cybersecurity text mining is effective and practical. - 2019 by the authors.

DOI/handle

http://dx.doi.org/10.3390/app9193945
http://hdl.handle.net/10576/13961

Collections

Computer Science & Engineering [‎2489‎ items ]