Museums and the Web

An annual conference exploring the social, cultural, design, technological, economic, and organizational issues of culture, science and heritage on-line.

You are hereMW / hoard.it: Aggregating, displaying and mining object-data without consent

hoard.it: Aggregating, displaying and mining object-data without consent


Titlehoard.it: Aggregating, displaying and mining object-data without consent
Publication TypeConference Paper
Year of Publication2009
AuthorsEllis, M., & Zambonini D.
Secondary TitleMuseums and the Web 2009. Proceedings
Conference Start DateApril 15-18, 200
PublisherArchives & Museum Informatics
Place PublishedIndianapolis, Indiana, USA
EditorTrant, J., & Bearman D.
Keywordsaggregation, API, collections, data, scraping, Semantic Web, top down
Abstract

A prototype system that allows the aggregation of data from museum and related Web sites, including object and event records, was rapidly developed. By screen-scraping the existing pages of 17 Web sites, tens of thousands of data records were collected without any technical agreement, investment or consent from the participating institutions. In this paper, we examine the reasons and benefits for aggregating this type of data, how our approach differs to other funded projects that have similar aspirations, and the relative strengths and weaknesses of each. An analysis of the data is presented, showing how the aggregate data set varies by assorted parameters, including location and date. Our work is related to the bigger picture of on-line data publishing, such as Semantic Web technologies, and some suggestions are presented as to how the grand vision of the Semantic Web may be achievable without the complexity.

URLhttp://www.archimuse.com/mw2009/papers/ellis/ellis.html