Probabilistic topic modeling algorithms like Latent Dirichlet Allocation (LDA) have become powerful tools for the analysis of large collections of documents (such as papers, projects, or funding applications) in science, technology an innovation (STI) policy design and monitoring. However, selecting an appropriate and stable topic model for a specific application (by adjusting the hyperparameters of the algorithm) is not a trivial problem. Common validation metrics like coherence or perplexity, which are focused on the quality of topics, are not a good fit in applications where the quality of the document similarity relations inferred from the topic model is especially relevant. Relying on graph analysis techniques, the aim of our work is to state a new methodology for the selection of hyperparameters which is specifically oriented to optimize the similarity metrics emanating from the topic model. In order to do this, we propose two graph metrics: the first measures the variability of the similarity graphs that result from different runs of the algorithm for a fixed value of the hyperparameters, while the second metric measures the alignment between the graph derived from the LDA model and another obtained using metadata available for the corresponding corpus. Through experiments on various corpora related to STI, it is shown that the proposed metrics provide relevant indicators to select the number of topics and build persistent topic models that are consistent with the metadata. Their use, which can be extended to other topic models beyond LDA, could facilitate the systematic adoption of this kind of techniques in STI policy analysis and design.
Journal of Materials Science: Materials in Electronics - In this work, the very rapid one-step mechanochemical synthesis of nanocrystalline ternary chalcogenide chalcostibite CuSbS2 prepared from... 相似文献
In this study, the behavior of carbon steel and galvanized steel in nontropical coastal marine environments was evaluated. Evaluation was carried out with specimens with dimensions of 10 cm × 10 cm × 0.3 cm. These specimens were exposed to four testing stations (Iquique, Mejillones, Los Vilos, and San Vicente), where racks were installed both at ground level (ground), as well as in the upper zone of electrical transmission towers (tower). In each station, 24 specimens of A36 carbon steel and galvanized steel were placed (12 each). The corrosivity of the environment was measured using the ISO 9223, 9225, and 9226 standards. The specimens were evaluated on-site, monthly, through visual inspection and photographic record. Once withdrawn, the corrosion rate was determined and the corrosion products were analyzed through Raman and Fourier-transform infrared. The results show that, in all cases, the corrosion rate is greater in the tower than on the ground. However, even though the Los Vilos station is located farther from the sea (3,500 vs. ≈500 m), the corrosion rate of steel in the tower is the highest. This is caused by the generation of HCl from the transformation of lepidocrocite into goethite, in the presence of low chloride content, which acidifies the steel/corrosion product interface. In the case of galvanized steel, the corrosion rate is a function of the chloride content in the atmosphere, obtaining an excellent correlation between both parameters. 相似文献
Carbohydrates play a pivotal role in intercellular communication processes. In particular, glycan antigens are key for sustaining homeostasis, helping leukocytes to distinguish damaged tissues and invading pathogens from healthy tissues. From a structural perspective, this cross-talk is fairly complex, and multiple membrane proteins guide these recognition processes, including lectins and Toll-like receptors. Since the beginning of this century, lectins have become potential targets for therapeutics for controlling and/or avoiding the progression of pathologies derived from an incorrect immune outcome, including infectious processes, cancer, or autoimmune diseases. Therefore, a detailed knowledge of these receptors is mandatory for the development of specific treatments. In this review, we summarize the current knowledge about four key C-type lectins whose importance has been steadily growing in recent years, focusing in particular on how glycan recognition takes place at the molecular level, but also looking at recent progresses in the quest for therapeutics. 相似文献