EASY offers sustainable archiving of research data and access to thousands of datasets.
2016-06-22 Beek, MSc W.G.J. (VU University Amsterdam); Rietveld, MSc L. (VU University Amsterdam); Schlobach, Dr. S. (VU University Amsterdam) 10.17026/dans-znh-bcg3
The LOD Laundromat provides access to a large subset of Linked Open Data (LOD) that is published in today's LOD Cloud. It automatically scrapes and cleans 650 thousand datasets containing over 38 billion statements.
The cleaned data is converted to a standards-compliant format that is optimized for reuse. The data cleaning process cleans hundreds of millions of 'stains' from the data, including syntax errors, duplicate statements, and unnamed nodes. The resulting datasets are republished in a uniform format and are indexed for fast search & retrieval via a publicly accessible web service. The LOD Laundromat data collection allows researchers to access large quantities of data from different domain, including humanities, health, and social sciences.