PlumX Metrics
Embed PlumX Metrics

Scalable Data Profiling for Quality Analytics Extraction

IFIP Advances in Information and Communication Technology, ISSN: 1868-422X, Vol: 715 IFIPAICT, Page: 177-189
2024
  • 0
    Citations
  • 0
    Usage
  • 0
    Captures
  • 0
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Conference Paper Description

In today’s modern society, data play an integral role in the development global industry, since they have become a valuable asset for companies, institutions, governments, and others. At the same time, data generated daily, at a global scale, require significant resources to pre-process, filter and store. When it comes to acquiring such stored data, it is essential to understand which dataset fits to the needs of the user beforehand. One particularly important factor is the quality of a dataset, which could be determined based on a series of quality related attributes generated by it. Such attributes constitute “Profiling”, the process of obtaining information from a data sample, related to the complete dataset’s quality. However, in the era of Big Data, the ability to apply profiling techniques in complete large datasets should also be considered, in order to obtain complete quality insights. This paper attempts to provide a solution for this consideration by presenting “DaQuE”, a scalable framework for efficient profiling and quality analytics extraction in complete datasets of all volumes.

Bibliographic Details

Anastasios Nikolakopoulos; Efthymios Chondrogiannis; Efstathios Karanastasis; Theodora Varvarigou; María José López Osa; Jordi Arjona Aroca; Michalis Kefalogiannis; Vasiliki Apostolopoulou; Efstathia Deligeorgi; Vasileios Siopidis

Springer Science and Business Media LLC

Decision Sciences

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know