Thursday, February 2, 2012

LODStats

LODStats looks interesting and is a potentially useful tool if you are working with linked
open data resources.
One of the major obstacles for a wider usage of Web Data is the difficulty to obtain a clear picture of the available datasets. In order to reuse, link, revise or query a dataset published on the Web it is important to know the structure, coverage and coherence of the data.

LODStats is a statement-stream-based approach for gathering comprehensive statistics about datasets adhering to the Resource Description Framework (RDF). LODStats is based on the declarative description of statistical dataset characteristics. Its main advantages over other approaches are a smaller memory footprint and significantly better performance and scalability. We integrated LODStats into the CKAN dataset metadata registry and obtained a comprehensive picture of the current state of the Data Web.

No comments: