Data Integration without Unification

Beckett Sterner,Steve Elliott,Edward Gilbert,Nico M. Franz

Data Integration without Unification

2020

Beckett Sterner
Steve Elliott
Edward Gilbert
Nico M. Franz

Life scientists generate big data by pooling many smaller datasets and by ensuring that those datasets combine to form a trustworthy body of information with a net increase in use value. Most proceed by constructing a maximally comprehensive dataset based on universal standards for representing the data’s empirical content and fit for different uses. We argue that this approach rests on an regulative ideal to create unified datasets, but following this ideal isn’t necessary: there are alternatives that enable the benefits of data pooling to be realized through infrastructure supporting lateral exchange and customization of data among multiple sources. We illustrate data integration without unification in the context of big data for biodiversity, which aims to address rapid biodiversity losses across the globe.

Keywords:

ideal
Pooling
context
Big data
Data science
Globe
Data integration
Unification
Personalization
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations