Document Type : Original Article

Authors

1 Associate professor, Department of Information Science and Knowledge Studies, Payame Noor University, Tehran, Iran

2 MA in Scientometrics, Department of Information Science and Epistemology, Yazd University, Yazd, Iran

Abstract

Background and Objectives: Citation is an important element in scientific writing and has a prominent role in the production and dissemination of information. Citation analysis is one of the applications of citation that examines the relationship between citing and cited document and studies the rules governing this relation. Citation analysis of patents is nowadays frequently observed in studies and is an important tool for identifying and analyzing the technical knowledge within patents. This research tries to identify determinants of patent citations by using a survival analysis.
Methodology: Research method was patent citation analysis. Research population consisted of 25,392 patents in the USPTO database in the Data Processing: Artificial Intelligence (AI) (Category: 706). There were 25392 AI patents in the USPTO database, among them, 13644 patents remained after deleting duplicates. Therefore, the new list, containing 13,644 patents, was saved in a plain text file by patents number, and these file was used to download the by Ravar Premep software. By using this software and using data mining technique, patents were saved in a separated location whit HTML format. Information needed for this analysis was: patent number, filed year, issue date, title, abstract, inventors, assignee, citations, and categories. This information was extract from patents in a comprehensive file. There were about 80 patent applications in the field at the USPTO, with the manually extracted information from the database added to this comprehensive file. To classify patent, first, patent that did not receive any citation in the field of AI were grouped into separate group and then the rest of the patents were divided into 3 groups using Bradford law based on number of citations. Group 1: Patents that have received at least 22 citations (412 patents).
Group 2: Patent applications that have received at least 9 and at most 21 citations (1123 patents). Group 3: Patents that have received at least 1 and up to 8 citations (4975 patents). Group 4: Patent Letters That Did Not Receive Any Citation (7134 patents). In this study 8922 inventors were retrieved as the first inventors for 13,644 patents. Since the classification of the first inventors did not fit well with any of the conventional classification methods in scientometrics, the inventors were divided into three groups according to the number of inventions. Inventors with more than 20 patents in the first group, inventors with 11 to 20 inventions in the second group and inventors with 10 inventions and fewer in the third group. In the field of AI, 2898 assignees hold 13644 patents. Bradford law was used to classify assignees in terms of frequency of inventions because of their high fit to the data. According to this method, assignees were classified into three groups in terms of their ability to produce patents. First Class (Strong): assignees who have produced at least 87 patents (19 assignees). Second Class (Medium): assignees who have produced at least 7 patents (233 assignees). Third Class (Poor): Proprietors who have produced at least 1 patent (2646 assignees). Bradford law was also used to classify assignees in terms of citation. Since in this method the minimum number is equal to one, assignees whose patents did not receive any citations were placed in a separate group (Group 4) and patent holders whose patents received at least one patent. They were classified into 3 groups according to Bradford method. Cox and Kaplan-Meier regression were used to analyze the data.
Findings: The research findings showed that of the 6749 patents that received first citation, in the first two years, the probability of citing each patent was less than 50 percent, and the probability of being cited increased over time. So that probability has risen to over 90% after 84 months. Kaplan-Meier test results showed that it takes between 38 and 40 months on average to a patent obtain first citation in the field of AI. The Kaplan-Meier test results in survival analysis showed that one year after a patent first was cited, the probability of being cited for the ninth time was 2.2%, and that probability increased over time. The findings also showed that on average, a patent may be cited for the ninth time between 69 and 74 months after receiving its first citation. Kaplan-Meier test showed that in the first 4 months after receiving the ninth citation, the probability of receiving the 22th citation for patent applications was zero, and then the probability increased. Also the finding showed that of the 6749 patents reviewed  citation for the first time, in the first 2 years, the probability of citing each patent was less than 50%, and the probability of being cited increased over time. So the probability has risen to over 90% after 84 months.
Discussion: The results of the Cox test showed that, at the significant level of 0.05, inventors and assignees were influenced the productivity and receiving citation of patents. The results showed that the frequency distribution of citations received by patents based on the first inventor and proprietor complies with Bradford’s scattering law. The results showed that the percentage of cited patents increased logarithmically over time. In other words, after a few years, the chances of citing a patent are reduced to a fixed amount. The results of the relationship between the power of first inventors both in terms of productivity and receiving citation showed that, the more the first inventors had a patent or citation, the shorter length of time need to be cited for the first time. The results of Cox test showed that at the significant level of 0.05 percent, inventors and assignees were influential in productivity and receiving citation. In other words, the chance of getting the first citation of assignees whose first inventor is in the first group (strong group) is 1.926 times higher than the patent of their first inventor in the third group. The results also showed that the probability of receiving the first citation for patents whose first inventors were in the first group was 1.925 times higher than that of the third group. Also those patent that the inventors where in second groups 1.44 times higher than of the third group. The results also showed that assignees were influenced by both the time they produced the invention and the citation received. The more assignees have had more or patents or citations, the less time it takes for them to be cited for the first time. The results also showed that the strongest assignees groups were more likely to receive their first citation at the same time interval than the weaker assignees. It may be argued that the role of countries in citing to patent applications is not very influential, but other factors are influential, one of which being the relevance of other patents that the inventor may cited or the evaluator of that role. And the type of citations in patent differs from those based on articles and other scientific documents. 

Keywords

Main Subjects

Abdekhoda, M. H., Noruzi, A., & Ravand, S. (2012). Mapping Iranian patents from 1976 to 2011 based on international patent classification (IPC). Payavard, 5(5), 46-56. https://payavard.tums.ac.ir/browse.php?a_id=53&sid=1&slc_lang=fa
Akrami, F. (2017). Studying the technical knowledge flow in the field of purification and recovery of hydrocarbon compounds through the analysis of citation relationships between patents (M.A. thesis). Department of Knowledge and Information Science, Faculty of Social Sciences, Yazd University.
Alaee Arani, M., Naghshineh, N., & Taheri, S. M. (2012). Science and technology output indicators in the Islamic Republic of Iran: A case study on the relevance between patents and scientific products of Iranian inventors. Iranian Journal of Information Processing and Management, 27(4), 1033-1052. https://api.semanticscholar.org/CorpusID:178064626
Amir Hosseini, M. (1993). Bibliometrics & informetrics. National Studies on Librarianship and Information Organization, 3(1-4), 183-209. https://ensani.ir/file/download/article/20120325202704-1144-56.pdf
Bigdeli, Z., & Serati, M. (2015). Investigating the link between science and technology through citation analysis of Iranian patents during 2009-2013. National Studies on Librarianship and Information Organization, 26(2), 65-76. https://www.sid.ir/paper/484508/fa
Borgman, C. L. (1990). Scholarly communication and bibliometrics. Newbury Park, CA: Sage. https://doi.org/10.1002/aris.1440360102
Bornmann, L., & Daniel, H. D. (2008). What do citation counts measure? A review of studies on citing behavior. Journal of Documentation, 64(1), 45-80. https://doi.org/10.1108/00220410810844150
Chen, C. (2013). Mapping scientific frontiers: The quest for knowledge visualization (2nd ed.). London, UK: Springer-Verlag. https://doi.org/10.1007/978-1-4471-5128-9
Ding, C. G., Hung, W. C., Lee, M. C., & Wang, H. J. (2017). Exploring paper characteristics that facilitate the knowledge flow from science to technology. Journal of Informetrics, 11(1), 244-256. https://doi.org/10.1016/j.joi.2017.01.001
Eom, S. B. (2015). Mining cocitation data with SAS Enterprise Guide. New Castle upon Tyne: Cambridge Scholars Publishing. https://www.cambridgescholars.com/product/978-1-4438-7422-9
Ernst, H., Leptien, C., & Vitt, J. (2000). Inventors are not alike: The distribution of patenting output among industrial R&D personnel. IEEE Transactions on Engineering Management, 47(2), 184–199. https://doi.org/10.1109/17.846787
Falagas, M. E., Zarkali, A., Karageorgopoulos, D. E., Bardakas, V., & Mavros, M. N. (2013). The impact of article length on the number of future citations: A bibliometric analysis of general medicine journals. PLoS ONE, 8, e49476. https://doi.org/10.1371/journal.pone.0049476
Garfield, E. (1980). Bradford law and related statistical patterns. Current Contents, 1(19), 5–12. https://garfield.library.upenn.edu/essays/v4p476y1979-80.pdf
Gay, C., Le Bas, C., Patel, P., & Touach, K. (2005). The determinants of patent citations: An empirical analysis of French and British patents in the US. Economics of Innovation and New Technology, 14(5), 339-350. https://doi.org/10.1080/1043859042000265931
Haghighi, M. (2002). The application of citation in scientific writing. Psychology and Educational Science Journal of Tehran University, 32(2), 215-232. https://journals.ut.ac.ir/article_10618_73b93039876ccb58b1fa091485f86ff8.pdf
Hassanzadeh, M., & Teymori Tabieh, M. (2015). Survey of knowledge flow in the knowledge-based organization of Science and Technology Park of the University of Tehran and presentation of a conceptual model. Knowledge Studies, 1(2), 23-40. https://doi.org/10.22054/jks.2015.1088
Haupt, R., Kloyer, M., & Lange, M. (2007). Patent indicators for the technology life cycle development. Research Policy, 36(3), 387–398. https://doi.org/10.1016/j.respol.2006.12.004
Hazeri, A., Tavakholizadeh-Ravari, M., & Shahbazi Manshadi, E. (2017). A study of the patent citation intensity in Iranian chemistry journal papers. Journal of Scientometrics, 3(5), 1-14. https://doi.org/10.22070/rsci.2017.790
Hou, J. (2017). Exploration into the evolution and historical roots of citation analysis by referenced publication year spectroscopy. Scientometrics, 110(3), 1437-1452. https://doi.org/10.1007/s11192-016-2206-9
Jaffe, A. B., & Trajtenberg, M. (1996). Flows of knowledge from universities and federal laboratories: Modeling the flow of patent citations over time and across institutional and geographic boundaries. Proceedings of the National Academy of Sciences, 93(23), 12671-12677. https://doi.org/10.1073/pnas.93.23.12671
Kermani, A., & Neshat, N. (2012). An evaluation of bibliographic couplings in clustering patents. National Studies on Librarianship and Information Organization, 23(3), 22-37. https://ensani.ir/file/download/article/20121209091125-9556-66.pdf
Khasseh, A. A. (2016). Knowledge structure in metric studies: Analysis of co-citations, co-authorships, and co-words of records using network analysis and science visualization (Ph.D. dissertation). Department of Knowledge and Information Science, Faculty of Educational Sciences and Psychology, Payame Noor University.
Kiehne, D. O., & Krill, B. (2017). The influence of the amount of inventors on patent quality. Management, 47, 300–308. https://www.researchgate.net/publication/315809224
Long, H., Plucker, J. A., Yu, Q., Ding, Y., & Kaufman, J. C. (2014). Research productivity and performance of journals in the creativity sciences: A bibliometric analysis. Creativity Research Journal, 26(3), 353-360. https://doi.org/10.1080/10400419.2014.929424
makes the first forward citation of a patent occur earlier? Scientometrics, 113(1), 279-298. https://doi.org/10.1007/s11192-017-2487-0
Mansoori, A., & Soheli, F. (2017). A survey on lag time in forming knowledge flow in Islamic countries patents. Journal of Library and Information Science Studies, 24(21), 49-70. https://doi.org/10.22055/slis.2018.12267
Mariani, M. S., Medo, M., & Lafond, F. (2019). Early identification of important patents: Design and validation of citation network metrics. Technological Forecasting and Social Change, 146, 644-654. https://doi.org/10.1016/j.techfore.2019.01.008
Osareh, F., & Mansoori, A. (2014). Knowledge flow among a network of inventors in electricity and electronics. Library and Information Science, 16(262), 143-166. https://lis.aqr-libjournal.ir/article_42631.html
Pao, M. L. (1989). Concepts of information retrieval. Englewood, CO: Libraries Unlimited. https://openlibrary.org/books/OL2059950M/Concepts_of_information_retrieval
Podolny, J. M., & Stuart, T. E. (1995). A role-based ecology of technological change. American Journal of Sociology, 100(5), 1224–1260. http://dx.doi.org/10.1086/230637
Tavakolizadeh-Ravari, M., & Soheili, F. (2013). Study of citation studies of patent licenses. Rahyaft, 23(55), 13-31. https://rahyaft.nrisp.ac.ir/article_13546.html
Tavakolizadeh-Ravari, M., Soheili, F., & Khasseh, A. A. (2020). An introduction to scientometrics foundation. Tehran: Payame Noor University.
Tavakolizadeh-Ravari, M., Soheili, F., Makkizadeh, F., & Akrami, F. (2020). A study on first citations of patents through a combination of Bradford's distribution, Cox-regression, and life tables' method. Journal of Information Science (forthcoming).