A concept of Generic Workspace for Big Data Processing in Humanities

TitleA concept of Generic Workspace for Big Data Processing in Humanities
Publication TypeConference Paper
AuthorsRybicki, J., B. von St Vieth, and D. Mallmann
Abstract

Big Data challenges often require application of new data processing paradigms (like MapReduce), and corresponding software solutions (e. g. Hadoop). This trend causes a pressure on both cyber-infrastructure providers (to quickly integrate new services) and infrastructure users (to quickly learn to use new tools). In this paper we present the concept of DARIAH Generic Workspace for Big Data Processing in eHumanities which alleviates the aforementioned problems. It establishes a common integration layer, thus enables a quick integration of new services, and by providing unified interfaces, allows the users to start using new tools without learning their internal details. We describe the overall architecture and implementation details of the working prototype. The presented concept is generic enough to be applied in other emerging cyber-infrastructures for humanities.

DOI10.1109/BigData.2013.6691672