Measuring Data Quality in Information Systems research

2019 
Abstract Although contemporary research relies to a large extent on data, data quality in Information Systems research is a subject that has not received much attention until now. In this paper, a framework is presented for the measurement of scientific data quality using the principles of rule-based measurement. The proposed framework is capable of handling data quality problems due to both incorrect execution and incorrect description of data collection and validation processes. It is then argued that uncertainty can arise during the measurement, which complicates data quality assessment. The framework is therefore extended to handle uncertainty about the truth value of predicates. Instead of a numerical quality level, data quality is then expressed as either a probability distribution or a possibility distribution over the ordinal quality scale. Finally, it is also shown how quality thresholds can be formulated based on the results of the quality measurement. The usefulness of the proposed framework is illustrated throughout the paper with an example of the construction of a possible survey data quality measurement system and, subsequently, the application of that system on a realistic example.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    29
    References
    12
    Citations
    NaN
    KQI
    []