• Research on Developing Regulations of Big Data Application Technology Based on Bibliometrics Laws

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-10-08 Cooperative journals: 《知识管理论坛》

    Abstract: [Purpose/significance] This paper aims to study the trend over the past and the present situation of big data application, and indicate the development regulation and trend in the future. [Method/process] This paper analyzed literature year distribution, journal distribution and author distribution of big data application field, and was the first to verify whether or not its development corresponds to the three fundamental bibliometric laws, though there are lots of factors that have not been taken into consideration. [Result/ conclusion] Results shows that starting in 1990, literatures related to big data application field went through a period of stable development, and appears to develop rapidly from 2012, whose development corresponds to Price law of scientific literature growth. Development of the literature sample in the study is in line with Bradford’s law, and forms a group of core journals, including Bmc bioinformatics, Sensors and so on. In the view of the author distribution, distribution in this filed is far different from that in Lotka’s law, and there’s no doubt that it has not yet formed the core author group.

  • Multi-expertise Researcher Identification: A Case Study of the Big Data

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-08-26 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance]In response to the rapid shifting of knowledge needs, how to choose the appropriate researchers for a given problem is an important issue for the government, companies, as well as research institutions. When we face a real complex problem, it is essential to find multi-expertise researchers. This research aims to find a proper way to identify multi-expertise researchers. [Method/process]This paper used a Term Frequency-Inverse Document Frequency (TFIDF) weighted overlapping K-means clustering method. Based on the researchers' co-authorship network built up from the publication data, the TFIDF weighted overlapping K-means clustering method was applied to cluster researchers into overlapping clusters and identify the multi-expertise researchers. [Result/conclusion]Results show that the TFIDF weighted overlapping K-means method has an advantage over the previous work in terms of the precision ratio, the recall ratio and the F-value, so such a method can be beneficial to identify multi-expertise researchers.

  • Research on Selection of Innovative Solutions Based on SAO Structure: A Case Study on Air Purification Technology

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance] Facing the increasingly competitive social environment, innovation is the foundation and the way of existence for enterprises. The scope of the innovation not only includes creating new products, technologies, etc. in the target research field, but also includes introducing new technologies, products, etc. from other research filed into the target field. The latter one is much easier to be accomplished. However, with the increasingly high degree of specialization in every disciplinary field, researchers have little time to grasp the knowledge besides their own research field. So it is needed to use scientific method and technology to explore the deep relationships between knowledge from different research fields. [Method/process] Using the analytical process of LRDI methodology, the paper proposes the research on selection of innovative solutions based on SAO structure, seeking for the potential solutions in whole research field based on the specific problems from target research field. The paper evaluates these potential solutions from the aspects of the technical feasibility and expected results and gives priority to recommend the solutions as innovative solutions for target research field. An exploratory study is conducted on air purification technology for this systematic process. [Result/conclusion] The research shows that some of the selected innovative solutions have been effectively used in air purification field, which also verify the proposed research method is feasible and valid.

  • Research on Drug Combination Recommendation Based on Link Prediction for Concurrent Diseases Treatment

    Subjects: Library Science,Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance] Compared with single drug, drug combination has many advantages in clinical treatment. But the growth of drug quantity brings difficulties to drug combination screening experiment. Therefore, it is of great significance to design an effective prediction method to recommend drug combination which is more likely to produce synergistic effect for pharmaceutical staff, so as to improve the screening efficiency.[Method/process] For the need of concurrent diseases treatment, proposed a drug combination recommendation model based on link prediction, and used the SAO semantic mining to identify the complications in medical literature. On this basis, we used the medical database to build the heterogeneous "disease-drug-target" network, and introduced link prediction to evaluate the similarity of drug action mechanism, and predicted which drug combinations were more likely to have synergistic effect. Based on the prediction results, recommended a combination of drugs for a certain disease or a pair of complications.[Result/conclusion] The empirical analysis of intestinal disease data verified the practicality and efficiency of the model.

  • Radical Innovative Topic Identification from a Perspective of Dynamic Topic Network:Taking the Field of Blockchain as an Example

    Subjects: Library Science,Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/Significance]Radical innovation plays a key role in the development of science and technology. In the big data environment, the complex, multidimensional, and continuous evolutionary characteristics of science and technology development itself is becoming more observable than ever before. It is important to identify these topics from a dynamic perspective to provide solutions for countries, enterprises and universities to analyze radical innovation areas, allocate innovation resources rationally and seek innovation upgrades.[Method/Process] This paper integrated methods of topic modeling, word embedding algorithm, and complex network analysis to construct dynamic topic networks, and evaluate the structural characteristics of the topics within different time windows and the topic evolution states between these time windows. Based on dynamic topic networks, this paper then combined the novelty, mutation, impact and interdisciplinary characteristics of radical innovation to identify topics of radical innovation.[Result/Conclusion] Through the empirical study on blockchain, this paper recognizes that two topics with the most significant radical innovative characteristics are Neural Network and Edge Computing. With existing research of blockchain and the list of critical and emerging technologies issued by the National Science and Technology Council (NSTC) of the United States, this paper finally verifies the feasibility and effectiveness of the proposed method. However, further quantitative verification of the result of this paper, and identification of radical innovative topics by fusing multi-source data, require further research in the future.

  • Research on the Method of Technology Opportunity Discovery Promoted by Science

    Subjects: Library Science,Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/Significance] The close relationship between science and technology makes it more reasonable and efficient to analyze technology opportunities by combining papers and patents rather than using single data. This paper makes the generation of science-technology relationship more automatic, reduces the dependence on subjective judgment, and makes the technology units smaller. The purpose is to provide R&D suggestions for technology researchers, and help apply theories and ideas from scientific research to technological innovation.[Method/Process] The abstract texts of papers and patents were represented by Doc2vec vector, which were associated into a network through text similarity, and then science-technology clusters were generated based on Louvain algorithm to identify technology opportunities promoted by scientific research. Finally, 3D printing technology was taken as a case for empirical research.[Result/Conclusion] Several technology opportunities promoted by scientific research are identified, and it is verified that the identified opportunities have technological potential, which proves the feasibility and effectiveness of the method.

  • Research Status, Trends and Future Thinking of Technology Forecasting: From the Perspective of Data Analytics

    Subjects: Library Science,Information Science >> Information Science submitted time 2023-04-01 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/Significance] From the change of the research data and research methods, this paper makes a systematic analysis of technology forecasting research from the perspective of data analytics.[Method/Process] In order to clarify its development process, this research divided the technology forecasting research based on data analytics into four stages of nascent phase (1981-1991), growth phase (1992-2010), expansion phase (2011-2017) and bottleneck phase (2018-present), and made an in-depth analysis of the research fronts under each stage through the comprehensive use of bibliometrics and knowledge map analysis tools.[Result/Conclusion] The results show that technology forecasting has been moving towards a multi-level and systematic direction, but it has not yet completed the leap from 'how technology may develop' to 'how technology should develop' in complex environments. Building a scientific data sharing platform and intelligent analysis software and giving full play to the role of government macro-control will be the focus of future attention.

  • 微博城市投诉文本中地理位置实体的完整性研究

    Subjects: Library Science,Information Science >> Information Science submitted time 2017-10-11 Cooperative journals: 《数据分析与知识发现》

    Abstract: [Objective] This study aims to utilize the knowledge sharing and constantly updating advantages of the Question Answering Community - Baidu Zhidao, which helps us reduce the cost of maintaining large geographical relationship resource, and find the complete location information. [Methods] First, we changed the incomplete location information to the approximate area names retrieved from Baidu Zhidao. Second, extracted each area’s features and calculated scores of related geographic entities. Finally, we constructed the feature vectors for the areas with those geographic entities, which help us identify the geographic locations of these posts. [Results] The proposed method could retrieve accurate geographic information from 92.51% of City Complaints from the Micro-blog platform. [Limitations] The proposed method could not analyze posts without any geographic location information. [Conclusions] Our study found an effective and feasible way to locate the missing geographic information.