TY - GEN
T1 - Deterministic bibliometric disambiguation challenges in company names
AU - Belz, Andrea
AU - Graddy-Reed, Alexandra
AU - Shweta, F. N.U.
AU - Giga, Aleksandar
AU - Murali, Shivesh Meenakshi
N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
PY - 2023
Y1 - 2023
N2 - Peer-reviewed publications and patents serve as important signatures of knowledge generation, and therefore the authors and their organizations can represent agents of intellectual transformation. Accurate tracking of these players enables scholars to follow knowledge evolution. However, while author name disambiguation has been discussed extensively, less is known about the impact of organization name on bibliometric studies. We expand here on the recently defined phenomenon of onomastic profusion, high-frequency words used in organization names for semantic reasons, and thus contributing a non-random source of error to bibliographic studies. We use the Small Business Innovation Research (SBIR) Phase I awardees of the National Aeronautics and Space Administration (NASA) as a use case in the field of engineering innovation. We find that firms in California or Massachusetts experience a six percent decrease in the likelihood of using the word Technologies in their names. Furthermore, use of the words Research and Science is linked to doubling the number of awards. We illustrate that, in aggregate, firms executing rational strategic naming decisions can create deterministic bibliometric challenges.
AB - Peer-reviewed publications and patents serve as important signatures of knowledge generation, and therefore the authors and their organizations can represent agents of intellectual transformation. Accurate tracking of these players enables scholars to follow knowledge evolution. However, while author name disambiguation has been discussed extensively, less is known about the impact of organization name on bibliometric studies. We expand here on the recently defined phenomenon of onomastic profusion, high-frequency words used in organization names for semantic reasons, and thus contributing a non-random source of error to bibliographic studies. We use the Small Business Innovation Research (SBIR) Phase I awardees of the National Aeronautics and Space Administration (NASA) as a use case in the field of engineering innovation. We find that firms in California or Massachusetts experience a six percent decrease in the likelihood of using the word Technologies in their names. Furthermore, use of the words Research and Science is linked to doubling the number of awards. We illustrate that, in aggregate, firms executing rational strategic naming decisions can create deterministic bibliometric challenges.
KW - bibliometric
KW - disambiguation
KW - names
KW - NASA
KW - NLP
KW - patents
KW - SBIR
UR - http://www.scopus.com/inward/record.url?scp=85151503165&partnerID=8YFLogxK
U2 - 10.1109/ICSC56153.2023.00047
DO - 10.1109/ICSC56153.2023.00047
M3 - Conference contribution
AN - SCOPUS:85151503165
T3 - Proceedings - 17th IEEE International Conference on Semantic Computing, ICSC 2023
SP - 239
EP - 243
BT - Proceedings - 17th IEEE International Conference on Semantic Computing, ICSC 2023
PB - IEEE
T2 - 17th IEEE International Conference on Semantic Computing, ICSC 2023
Y2 - 1 February 2023 through 3 February 2023
ER -