AUTHOR=Wang Kuansan , Shen Zhihong , Huang Chiyuan , Wu Chieh-Han , Eide Darrin , Dong Yuxiao , Qian Junjie , Kanakia Anshul , Chen Alvin , Rogahn Richard TITLE=A Review of Microsoft Academic Services for Science of Science Studies JOURNAL=Frontiers in Big Data VOLUME=2 YEAR=2019 URL=https://www.frontiersin.org/journals/big-data/articles/10.3389/fdata.2019.00045 DOI=10.3389/fdata.2019.00045 ISSN=2624-909X ABSTRACT=

Since the relaunch of Microsoft Academic Services (MAS) 4 years ago, scholarly communications have undergone dramatic changes: more ideas are being exchanged online, more authors are sharing their data, and more software tools used to make discoveries and reproduce the results are being distributed openly. The sheer amount of information available is overwhelming for individual humans to keep up and digest. In the meantime, artificial intelligence (AI) technologies have made great strides and the cost of computing has plummeted to the extent that it has become practical to employ intelligent agents to comprehensively collect and analyze scholarly communications. MAS is one such effort and this paper describes its recent progresses since the last disclosure. As there are plenty of independent studies affirming the effectiveness of MAS, this paper focuses on the use of three key AI technologies that underlies its prowess in capturing scholarly communications with adequate quality and broad coverage: (1) natural language understanding in extracting factoids from individual articles at the web scale, (2) knowledge assisted inference and reasoning in assembling the factoids into a knowledge graph, and (3) a reinforcement learning approach to assessing scholarly importance for entities participating in scholarly communications, called the saliency, that serves both as an analytic and a predictive metric in MAS. These elements enhance the capabilities of MAS in supporting the studies of science of science based on the GOTO principle, i.e., good and open data with transparent and objective methodologies. The current direction of development and how to access the regularly updated data and tools from MAS, including the knowledge graph, a REST API and a website, are also described.