Harvesting large-scale grids for software resources

Asterios Katsifodimos*, George Pallis, Marios D. Dikaiakos

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

4 Citations (Scopus)

Abstract

Grid infrastructures are in operation around the world, federating an impressive collection of computational resources and a wide variety of application software. In this context, it is important to establish advanced software discovery services that could help end-users locate software components suitable to their needs. In this paper, we present the design, architecture and implementation of an open-source keywordbased paradigm for the search of software resources in Grid infrastructures, called Minersoft. A key goal of Minersoft is to annotate automatically all the software resources with keywordrich metadata. Using advanced Information Retrieval techniques, we locate software resources with respect to users queries. Experiments were conducted in EGEE, one of the largest Grid production services currently in operation. Results showed that Minersoft successfully crawled 12.3 million valid files (620 GB size) and sustained, in most sites, high crawling rates.

Original languageEnglish
Title of host publication2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID 2009
Pages252-259
Number of pages8
DOIs
Publication statusPublished - 2009
Externally publishedYes
EventCCGRID 2009: 9th IEEE/ACM International Symposium on Cluster Computing and the Grid - Shanghai, China
Duration: 18 May 200921 May 2009

Conference

ConferenceCCGRID 2009
Country/TerritoryChina
CityShanghai
Period18/05/0921/05/09

Fingerprint

Dive into the research topics of 'Harvesting large-scale grids for software resources'. Together they form a unique fingerprint.

Cite this