Data Lake Management Based on DLDS Approach

2022 
Over the past few years, big data is at the center of the concerns of actors in all fields of activity. The rapid growth of this massive data requires the question of its storage. Data lakes meet these storage needs, offering data storage without a predefined schema. In this context, a strategy for building a clear data catalog is fundamental for any organization that stores big data, helping to ensure the effective and efficient use of information. Setting up a data catalog in a data lake remains a complicated task and presents a major issue for data managers. However, the data catalog is still essential. This article presents the use of XML and JAXB technologies in the modeling of the data catalog by proposing an approach called DLDS (stands for Data Lake Description Service) and enables to build a central catalog file that allows the users to search, locate, understand and query different data sources stored in the lake.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    17
    References
    0
    Citations
    NaN
    KQI
    []