Indicators on firm level innovation activities from web scraped data

Sajad Ashouri, Arho Suominen*, Arash Hajikhani, Lukas Pukelis, Torben Schubert, Serdar Türkeli, Cees Van Beers, Scott Cunningham

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

4 Citations (Scopus)
63 Downloads (Pure)


This article presents data on companies' innovative behavior measured at the firm-level based on web scraped firm-level data derived from medium-high and high-technology companies in the European Union and the United Kingdom. The data are retrieved from individual company websites and contains in total data on 96,921 companies. The data provide information on various aspects of innovation, most significantly the research and development orientation of the company at the company and product level, the company's collaborative activities, company's products, and use of standards. In addition to the web scraped data, the dataset aggregates a variety firm-level indicators including patenting activities. In total, the dataset includes 21 variables with unique identifiers which enables connecting to other databases such as financial data.

Original languageEnglish
Article number108246
Number of pages14
JournalData in Brief
Publication statusPublished - 2022


  • Big data
  • Firm-level data
  • Innovation
  • Text data
  • Web scraped data


Dive into the research topics of 'Indicators on firm level innovation activities from web scraped data'. Together they form a unique fingerprint.

Cite this