Searching for software on the EGEE infrastructure

George Pallis*, Asterios Katsifodimos, Marios D. Dikaiakos

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

2 Citations (Scopus)


Several large-scale Grid infrastructures are currently in operation around the world, federating an impressive collection of computational resources, a wide variety of application software, and hundreds of user communities. To better serve the current and prospective users of Grid infrastructures, it is important to develop advanced software retrieval services that could help users locate software components suitable to their needs. In this paper, we present the design and implementation of Minersoft, a distributed, multi-threaded harvester for application software located in large-scale Grid infrastructures. Minersoft crawls the sites of a Grid infrastructure, discovers installed software resources, annotates them with keyword-rich metadata, and creates inverted indexes that can be used to support full-text software retrieval. We present insights derived from the implementation and deployment of Minersoft on EGEE, one of the largest Grid production services currently in operation. Experimental results show that Minersoft achieves a high performance in crawling EGEE sites and discovering software-related files, and a high efficiency in supporting software retrieval.

Original languageEnglish
Pages (from-to)281-304
Number of pages24
JournalJournal of Grid Computing
Issue number2
Publication statusPublished - Jun 2010
Externally publishedYes


  • Knowledge Grids
  • Management resources
  • Software retrieval


Dive into the research topics of 'Searching for software on the EGEE infrastructure'. Together they form a unique fingerprint.

Cite this