AUTOMATED COLLECTION AND ANALYSIS OF OPEN-SOURCE CYBER THREAT INTELLIGENCE

Overview

In collaboration with the Knowledge Discovery in Databases lab at Kansas State University, this project aims to develop machine learning tools and techniques for collection, analysis, and generation of cybersecurity threat intelligence from Open-Source Intelligence (OSINT) sources (e.g., social media, web forums, dark web). The main components of this research are as follows:

Collecting relevant intelligence (documents and data) from multiple sources / media
Validating the trustworthiness / reliability of sources using the historical “big picture”
Fusing heterogeneous sources into a consistent and comprehensible whole
Processing data at high volume and rate to find indicators of emerging threat

Current Team Members:

Shreya Gopal Sundari
Cytisus Eurydice
Avishek Bose (KDD)
PIs: Dr.Vahid Behzadan, Prof William Hsu

Affiliate Research Groups:

Knowledge Discovery in Databases lab (Kansas State University)

Tools and Datasets:

Our initial dataset of ~21000 manually annotated tweets for their relevance to cyber-threat intelligence and the type of threat is available in the project’s Git Repository. For more information on the collection, annotation, and structure of the dataset, please refer to the relevant paper.

Publications:

SAIL Lab

AUTOMATED COLLECTION AND ANALYSIS OF OPEN-SOURCE CYBER THREAT INTELLIGENCE

CYBERSECURITY