Recent efforts have focused on identifying multidisciplinary teams and detecting co-Authorship Networks based on exploring topic modeling to identify researchers’ expertise. Though promising, none of these efforts perform a real-life evaluation of the quality of the built topics. This paper proposes a Semantic Academic Profiler (SAP) framework that allows summarizing articles written by researchers to automatically build research profiles and perform online evaluations regarding these built profiles. SAP exploits and extends state-of-the-art Topic Modeling strategies based on Cluwords considering n-grams and introduces a new visual interface able to highlight the main topics related to articles, researchers and institutions. To evaluate SAP’s capability of summarizing the profile of such entities as well as its usefulness for supporting online assessments of the topics’ quality, we perform and contrast two types of evaluation, considering an extensive repository of Brazilian curricula vitae: (1) an offline evaluation, in which we exploit a traditional metric (NPMI) to measure the quality of several data representations strategies including (i) TFIDF, (ii) TFIDF with Bi-grams, (iii) Cluwords, and (iv) CluWords with Bi-grams; and (2) an online evaluation through an A/B test where researchers evaluate their own built profiles. We also perform an online assessment of SAP user interface through a usability test following the SUS methodology. Our experiments indicate that the CluWords with Bi-grams is the best solution and the SAP interface is very useful. We also observed essential differences in the online and offline assessments, indicating that using both together is very important for a comprehensive quality evaluation. Such type of study is scarce in the literature and our findings open space for new lines of investigation in the Topic Modeling area.
相似文献As one kind of gas bearings, foil bearings that are commonly made of one top foil and at least one bottom foil, have many attractive advantages such as low power loss, wide work temperature range, high rotating speed, low maintenance cost, simple system construct, etc. Nevertheless, foil bearings also have some disadvantages including low carrying load at low speed, wear at start-stop procedure, not easy to predict bearing performance due to the complex bearing structure and so on.
For the sake of better understanding and further utilization of foil bearings, many researchers, institutions and countries have paid great attention to study these special rolling elements in recent decades. The purpose of this paper is to explore the research status of foil bearings performance based on the Web of Science datasets and related tools, meanwhile, the cooperative relationship among individuals or organizations is further studied. Moreover, to figure out influential countries, institutions and authors, the analysis of cited frequency is adopted. In addition, keywords frequency analysis and co-occurrence analysis are applied to explore the future trend of foil bearing. The results identify influential countries, institutions and authors in the research of foil bearings performance and show that whether in quantity or in quality, research of foil bearings performance in developed countries holds a significant lead over that in developing countries. Moreover, the current hot topics are figured out as well as future direction of development is predicted. This study presents intuitive state of research of foil bearings performance and proposes a fresh perspective for relevant researchers to perform foil bearings research in the future.
相似文献Increased collaboration between researchers working in university, industry, and governmental settings is changing the landscape of academic science. Traditional models of the interaction between these sectors, such as the triple helix concept, draw clear distinctions between academic and non-academic settings and actors. This study surveyed scientists (n = 469) working outside of university settings who published articles indexed in the Web of Science about their modes of collaboration, perceptions about publishing, workplace characteristics, and information sources. We study the association between these variables, and use text analysis to examine the roles, duties, sites, topics, and workplace missions among non-university based authors. Our analysis shows that 72% of authors working in non-university settings who collaborate and publish with other scientists self-identify as academics. Furthermore, their work life resembles that of those working in university settings in that the majority report doing fundamental research in government research organizations and laboratories. Contrary to our initial hypothesis, this research suggests that peer-reviewed publications are much more dominated by non-university academics than we previously thought and that collaboration as co-authors on academic publications is not likely to be a primary conduit for the transfer of scientific knowledge between academe and industry.
相似文献Citations play a pivotal role in indicating various aspects of scientific literature. Quantitative citation analysis approaches have been used over the decades to measure the impact factor of journals, to rank researchers or institutions, to discover evolving research topics etc. Researchers doubted the pure quantitative citation analysis approaches and argued that all citations are not equally important; citation reasons must be considered while counting. In the recent past, researchers have focused on identifying important citation reasons by classifying them into important and non-important classes rather than individually classifying each reason. Most of contemporary citation classification techniques either rely on full content of articles, or they are dominated by content based features. However, most of the time content is not freely available as various journal publishers do not provide open access to articles. This paper presents a binary citation classification scheme, which is dominated by metadata based parameters. The study demonstrates the significance of metadata and content based parameters in varying scenarios. The experiments are performed on two annotated data sets, which are evaluated by employing SVM, KLR, Random Forest machine learning classifiers. The results are compared with the contemporary study that has performed similar classification employing rich list of content-based features. The results of comparisons revealed that the proposed model has attained improved value of precision (i.e., 0.68) just by relying on freely available metadata. We claim that the proposed approach can serve as the best alternative in the scenarios wherein content in unavailable.
相似文献Identifying the most relevant scientific publications on a given topic is a well-known research problem. The Author-Topic Model (ATM) is a generative model that represents the relationships between research topics and publication authors. It allows us to identify the most influential authors on a particular topic. However, since most research works are co-authored by many researchers the information provided by ATM can be complemented by the study of the most fruitful collaborations among multiple authors. This paper addresses the discovery of research collaborations among multiple authors on single or multiple topics. Specifically, it exploits an exploratory data mining technique, i.e., weighted association rule mining, to analyze publication data and to discover correlations between ATM topics and combinations of authors. The mined rules characterize groups of researchers with fairly high scientific productivity by indicating (1) the research topics covered by their most cited publications and the relevance of their scientific production separately for each topic, (2) the nature of the collaboration (topic-specific or cross-topic), (3) the name of the external authors who have (occasionally) collaborated with the group either on a specific topic or on multiple topics, and (4) the underlying correlations between the addressed topics. The applicability of the proposed approach was validated on real data acquired from the Online Mendelian Inheritance in Man catalog of genetic disorders and from the PubMed digital library. The results confirm the effectiveness of the proposed strategy.
相似文献Many empirical sciences, including the social sciences and life sciences, aim to study causal relationships. Researchers in these fields need computational methods for analyzing observed data and identifying causal structures among a set of variables. Such computational methods enable researchers to draw conclusions on the basis of both their assumptions and the observed data. Moreover, these methods are useful for developing hypotheses on causal relations, designing future observational studies, and planning future experimental studies that can potentially provide stronger evidence of estimated causal relations.
The objective of this special issue is to present an up-to-date overview of causal discovery methods, which have witnessed rapid advancements in recent years. The chief editor and guest editors invited the following three survey papers on various hot topics related to causal discovery:
相似文献Research topics rise and fall in popularity over time, some more swiftly than others. The fastest rising topics are typically called bursts; for example “deep learning”, “internet of things” and “big data”. Being able to automatically detect and track bursty terms in the literature could give insight into how scientific thought evolves over time. In this paper, we take a trend detection algorithm from stock market analysis and apply it to over 30 years of computer science research abstracts, treating the prevalence of each term in the dataset like the price of a stock. Unlike previous work in this domain, we use the free text of abstracts and titles, resulting in a finer-grained analysis. We report a list of bursty terms, and then use historical data to build a classifier to predict whether they will rise or fall in popularity in the future, obtaining accuracy in the region of 80%. The proposed methodology can be applied to any time-ordered collection of text to yield past and present bursty terms and predict their probable fate.
相似文献The analysis of vibrations and acoustic emissions (AE) are two recognized non-destructive techniques used for machine fault diagnosis. In recent years, the two techniques have been comparatively evaluated by different researchers with experimental tests. Several evaluations have shown that the AE analysis has a higher potential than the vibration analysis for fault diagnosis of mechanical components for certain cases. However, the distance between the AE sensor and the fault is an important factor that can considerably decrease the potential to detect damage and that has not been sufficiently investigated. Moreover, the comparisons have not yet addressed conditions of slow speed that for example are usual for wind turbine gearboxes. Therefore, in this paper we present two comparative case studies that address both topics. Both case studies consider planetary gearboxes with faults in their ring gears. The first case study corresponds to a small planetary gearbox in which the AE and vibration sensors were installed together at two different positions. The second case study corresponds to a full-size wind turbine gearbox in which three pairs of AE and vibration sensors were installed on the outside of the ring gear from a low-speed planetary stage. The results of the evaluations demonstrate the important influence of the distance between sensors and fault. Despite this, the good results from the AE analysis indicate that this technique should be considered as an important complement to the traditional vibration analysis. The main contribution of this paper is comparing AE and vibration analysis by using not only experimental data from a small planetary gearbox but also from a full-size wind turbine gearbox. The comparison addresses the topics of proximity of the sensor to the fault and low-speed conditions.
相似文献Bibliometric analysis is growing research filed supported in different tools. Some of these tools are based on network representation or thematic analysis. Despite years of tools development, still, there is the need to support merging information from different sources and enhancing longitudinal temporal analysis as part of trending topic evolution. We carried out a new scientometric open-source tool called ScientoPy and demonstrated it in a use case for the Internet of things topic. This tool contributes to merging problems from Scopus and Clarivate Web of Science sources, extracts and represents h-index for the analysis topic, and offers a set of possibilities for temporal analysis for authors, institutions, wildcards, and trending topics using four different visualizations options. This tool enables future bibliometric analysis in different emerging fields.
相似文献Mapping bi-regional scientific collaboration demands multiple approaches to obtain a picture as complete as possible. Usually, the first approach is the measuring of the number and typology of scientific co-publications in the most visible indexes of journals and publications covered by databases like Web of Science or Scopus, among others. This paper analyzes scientific publications listed by Web of Science (WoS), which comprises authors from the 28 EU countries and Latin American and Caribbean countries (EULAC) between 2005 and 2016. The following questions have been addressed: How are bi-regional scientific relations between EULAC countries reflected by international collaboration? What effects does this scientific collaboration have in smaller or emerging countries? Which area of knowledge has more international collaborations? The study highlights the existence of a growing global network of researchers from several countries that collaborate on their research. EULAC scientific collaboration cannot be understood in isolation from this global network.
相似文献