Deterministic bibliometric disambiguation challenges in company names

Andrea Belz*, Alexandra Graddy-Reed, F. N.U. Shweta, Aleksandar Giga, Shivesh Meenakshi Murali

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

2 Citations (Scopus)
6 Downloads (Pure)


Peer-reviewed publications and patents serve as important signatures of knowledge generation, and therefore the authors and their organizations can represent agents of intellectual transformation. Accurate tracking of these players enables scholars to follow knowledge evolution. However, while author name disambiguation has been discussed extensively, less is known about the impact of organization name on bibliometric studies. We expand here on the recently defined phenomenon of onomastic profusion, high-frequency words used in organization names for semantic reasons, and thus contributing a non-random source of error to bibliographic studies. We use the Small Business Innovation Research (SBIR) Phase I awardees of the National Aeronautics and Space Administration (NASA) as a use case in the field of engineering innovation. We find that firms in California or Massachusetts experience a six percent decrease in the likelihood of using the word Technologies in their names. Furthermore, use of the words Research and Science is linked to doubling the number of awards. We illustrate that, in aggregate, firms executing rational strategic naming decisions can create deterministic bibliometric challenges.
Original languageEnglish
Title of host publicationProceedings - 17th IEEE International Conference on Semantic Computing, ICSC 2023
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Number of pages5
ISBN (Electronic)9781665482639
Publication statusPublished - 2023
Event17th IEEE International Conference on Semantic Computing, ICSC 2023 - Virtual, Online, United States
Duration: 1 Feb 20233 Feb 2023

Publication series

NameProceedings - 17th IEEE International Conference on Semantic Computing, ICSC 2023


Conference17th IEEE International Conference on Semantic Computing, ICSC 2023
Country/TerritoryUnited States
CityVirtual, Online

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.


  • bibliometric
  • disambiguation
  • names
  • NASA
  • NLP
  • patents
  • SBIR


Dive into the research topics of 'Deterministic bibliometric disambiguation challenges in company names'. Together they form a unique fingerprint.

Cite this