Skip to content

Data quality

Marcus Bakker edited this page Mar 29, 2019 · 10 revisions

What is the quality of that data? If the quality of your data is poor, it is either hard to find what you are looking for or the data may even not be there. That is where scoring your data quality comes in. Blue ATT&CK offers a standardised scoring table to score your data's quality based on different aspects. Use the score that fits the best. In the end, when you have determined all your data source's quality, you can start improving.

Data quality dimension

Blue ATT&CK describes five different data quality dimensions: device completeness, data completeness, timeliness, consistency and retention. Scoring your data quality means scoring each of these dimensions for every data source you have. The scorings table will guide you in scoring.

The administration of your data sources can be done in YAML files and is explained in YAML administration data sources.

Excel output

You can generate an Excel sheet containing all your data sources, attributes, notes and data quality scoring:

python blue_attack.py ds -f sample-data/data-sources-endpoints.yaml -e
Blue ATT&CK - Data quality
Clone this wiki locally