Title | Integrating Full-Text Search and Linguistic Analyses on Disperse Data for Humanities and Social Sciences Research Projects |
Publication Type | Conference Paper |
Authors | Villegas, M., and C. Parra |
Abstract | The research reported in this paper is part of the activities carried out within the CLARIN (common language resources and technology infrastructure) project, a large-scale pan-European project to create, coordinate and make language resources and technologies (LRT) available and readily useable. CLARIN is devoted to the creation of a persistent and stable infrastructure serving the needs of the European humanities and social sciences (HSS) research community. HSS researchers will be able to efficiently access distributed resources and apply analysis and exploitation tools relevant for their research. Hereby we present a real use case addressed as a CLARIN scenario and the implementation of a demonstrator that enables us to foresee the potential problems and contributes to the planning of the implementation phase. It deals with how to support researchers interested in harvesting and analyzing data from historical press archives. Therefore, we address the integration and interoperability of distributed and heterogeneous research data and analysis tools. |
DOI | 10.1109/e-Science.2009.12 |