共查询到20条相似文献,搜索用时 187 毫秒
1.
2.
3.
为解决搜索引擎返回结果数量上的限制,扩展了元搜索技术,提出链接群落、链接繁殖的概念,并与生物群落进行了对比。链接繁殖的思想是首先将多个搜索引擎返回的结果作为起始信息源,利用预定义的繁殖规则,优化并整合搜索结果,对链接所指网页的链接进行分析,繁殖出更多的相关信息源。在分析不同的搜索引擎结果集时,系统根据不同搜索引擎直接与繁殖发现信息源的能力与质量,动态调整繁殖的链接的优先次序。经过实验验证,链接繁殖可以大大扩展通过搜索引擎发现主题信息源的数量。 相似文献
4.
卫星系统综合效能分析研究 总被引:6,自引:0,他引:6
从卫星系统应用角度提出了基于信息支援保障的卫星系统综合效能评估指标体系,并应用层次分析法建立了效能分析的多属性综合评价模型与方法。为量化指标值,提出了基于领域专家知识的定性与定量相结合的量化指标方法。定量分析表明了多属性综合评价方法对复杂系统效能评估是有效的。 相似文献
5.
6.
基于LDA模型的博客垃圾评论发现 总被引:1,自引:0,他引:1
Blog(博客)作为一种新兴的网络媒体,在很大程度上增强了互联网的开放性,Blog已经成为互联网上的主要信息源之一,这也使得Blog空间中的垃圾评论成倍增长,因此如何识别垃圾评论成为面临的重要问题。该文首先借鉴处理垃圾邮件的方法,针对Blog本身的特点,使用规则初步过滤垃圾评论,然后对剩余评论,利用Latent Dirichlet Allocation(LDA) 这种能够提取文本隐含主题的产生式模型,对博客中的博文进行主题提取,并结合主题信息进行判断,从而识别Blog空间的垃圾评论。通过实验验证,该方法可以发现大多数垃圾评论,实验取得了较好的结果,使Blog信息更加准确、有效的为用户使用。 相似文献
7.
如何发现主题信息源是主题Web信息整合的前提。提出了一种主题信息源发现方法,将主题信息源发现转化为网站主题分类问题,并利用站外链接发现新的信息源。从网站中提取出能反映网站主题的内容特征词和结构特征词,建立描述网站主题的改进的向量空间模型。以该模型为基础,通过类中心向量法与SVM相结合对网站主题进行分类。提出一种能尽量少爬取网页的网络搜索策略,在发现站外链接的同时爬取最能代表网站主题的页面。将该主题信息源发现方法应用于林业商务信息源,通过实验验证了该方法的有效性。 相似文献
8.
9.
Blog(博客)可以称为在线个人日志。作为一种新兴的媒体,Blog目前已经成为一种在Web上表达个人观点和情感的一种非常流行的方式。那么如何从Blog中快速准确地抽取有用的信息(话题发布时间、话题题目、话题内容、评论内容等)就成为了Blog应用中一个非常重要的步骤。提出了一种基于模板化的Blog信息抽取方法,该方法通过分析Blog网站的HTML源代码,然后提取出网站的模板,并根据该模板对Blog网页进行信息抽取。对来自国内10个著名博客网站进行模板的提取,并对这10个网站中的7 374个Blog网页进行了实验,实验结果表明,该方法能根据提取出的模板快速、准确地对Blog网页进行信息抽取。 相似文献
10.
11.
A blog acts as a platform of virtual communication to share comments or views about products, events and social issues. Like other social web activities, blogging actions spread to a large number of people. Users influence others in many ways, such as buying a product, having a particular political or social opinion or initiating new activity. Finding the top influential bloggers is an active research domain as it helps us in various fields, such as online marketing, e-commerce, product search and e-advertisements. There exist various models to find the influential bloggers, but they consider limited features using non-modular approach. This paper proposes a new model, Popularity and Productivity Model (PPM), based on a modular approach to find the top influential bloggers. It consists of popularity and productivity modules which exploit various features. We discuss the role of each proposed and existing features and evaluate the proposed model against the standard baseline models using datasets from the real-world blogs. The analysis using standard performance evaluation measures verifies that both productivity and popularity modules play a vital role to find influential bloggers in blogging community in an effective manner. 相似文献
12.
文章针对传统网络社区挖掘的注重用户之间链接关系而忽略网络中内容信息的不足,引入了主题社区的概念,利用LatentDirichlet allocation主题模型,通过计算博主所发表博文内容的主题相关性计算博主之间的主题相关性,并重新定义博主之间的链接关系,最后进行主题社区划分。实验结果表明,引入主题空间的社区发现聚类性更好。 相似文献
13.
Eric P. S. Baumer Mark Sueyoshi Bill Tomlinson 《Computer Supported Cooperative Work (CSCW)》2011,20(1-2):1-36
A significant amount of research has focused on blogs, bloggers, and blogging. However, relatively little work has examined blog readers, their interactions with bloggers, or their impact on blogging. This paper presents a qualitative study focusing specifically on readers of political blogs to develop a better understanding of readers?? interactions with blogs and bloggers. This is the first such study to examine the same blogging activity from both readers?? and bloggers?? perspectives. Readers?? significance and contributions to blogs are examined through a number of themes, including: community membership and participation; the relationship between political ideology, reading habits, and political participation; and differences and similarities between mainstream media (MSM) and blogs. Based on these analyses, this paper argues that blogging is not only a social activity, but is a collaborative process of co-creation in which both bloggers and readers engage. Implications of this finding contribute to the study and understanding of reader participation, to the design of technologies for bloggers and blog readers, and to the development of theoretical understandings of social media. 相似文献
14.
Filter Blogs vs. Personal Journals: Understanding the Knowledge Production Gap on the Internet 总被引:1,自引:0,他引:1
Blog is currently a popular form of knowledge production on the Internet. Research shows that relatively few bloggers produce "filter blogs," which focus on political knowledge. Most bloggers produce personal journals. This study uses national survey data to determine whether and why people have differential knowledge production in the form of different blogs. As knowledge is a critical indicator of social power, this research also examines whether blogs with different levels of political knowledge demonstrate unequal social power. Results show that bloggers with higher socioeconomic status contribute more filter blogs than lower-status segments. Even among filter bloggers, socioeconomic status, gender, and print-media use are associated with social influence. These findings suggest that a knowledge production gap merits more scholarly attention. 相似文献
15.
Web advertising (online advertising), a form of advertising that uses the World Wide Web to attract customers, has become one of the most commonly-used marketing channels. This paper addresses the concept of Blogger-Centric Contextual Advertising, which refers to the assignment of personal ads to any blog page, chosen in according to bloggers’ interests. As blogs become a platform for expressing personal opinions, they naturally contain various kinds of statements, including facts, comments and statements about personal interests, of both a positive and negative nature. To extend the concept behind the Long Tail theory in contextual advertising, we argue that web bloggers, as the constant visitors of their own blog-sites, could be potential consumers who will respond to ads on their own blogs. Hence, in this paper, we propose using text mining techniques to discover bloggers’ immediate personal interests in order to improve online contextual advertising. The proposed Blogger-Centric Contextual Advertising (BCCA) framework aims to combine contextual advertising matching with text mining in order to select ads that are related to personal interests as revealed in a blog and rank them according to their relevance. We validate our approach experimentally using a set of data that includes both real ads and actual blog pages. The results indicate that our proposed method could effectively identify those ads that are positively-correlated with a blogger’s personal interests. 相似文献
16.
提出了基于传统的PageRank算法的改进模型评估微博社区博主的影响力。微博社区中博主的影响力反映其话语权的大小,是研究微博社区的核心概念之一。通过对平均度、聚类系数和平均路径长度等网络特征指标的统计,验证了微博社区网络具有"小世界"的显著特性。从用户活跃度和博文质量两个角度出发,构建了博主影响力的评价指标,引入了博主传播能力这个因子,利用PageRank算法的思想设计了新的影响力排名(Influence Rank)算法模型来评估博主影响力。通过实验对比发现Influence Rank算法在考虑节点间的关系之外还考虑了节点本身的特性,能够更加准确客观地反映博主的影响力排名。 相似文献
17.
Seok-Ho Yoon Jung-Hwan Shin Sang-Wook Kim Sunju Park Jae Bum Lee 《Information Sciences》2012,184(1):215-229
In the blogosphere, there exist posts relevant to a particular subject and blogs that show interest in the subject. In this paper, we define a set of such posts and blogs as a blog community and propose a method for extracting the blog community associated with a particular subject. The proposed method is based on the idea that the blogs who have performed actions (e.g., read, comment, trackback, scrap) to the posts of a particular subject are the ones with interest in the subject, and that the posts that have received actions from such blogs are the ones that contain the subject. The proposed method starts with a small number of manually-selected seed posts containing the subject. Then, the method selects the blogs that have performed actions to the seed posts over some threshold and the posts that have received actions over some threshold. Repeating these two steps gradually expands the blog community. This paper presents various techniques to improve the accuracy of the proposed method. The experimental results show that the proposed method exhibits a higher level of accuracy than the methods proposed in prior research. This paper also discusses business applications of the extracted community, such as target marketing, market monitoring, improving search results, finding power bloggers, and revitalization of the blogosphere. 相似文献
18.
This study investigated use and management of blogs in academic health sciences libraries. Quantitative and qualitative data were collected with a questionnaire distributed to bloggers and administrators representing 22 libraries self-identified as using blogs to publish library newsletters. Perceived barriers to use of blogs included lack of patron awareness and technical issues. Blogs were viewed as easy to use by librarians and blogs are used to promote licensed resources. Blogs do not replace other interactions with librarians, but in the future, could be used to develop a community, provide tools for learning, and increase visibility of the library. 相似文献
19.
An increasing number of individuals of all ages maintain important interpersonal relationships through blogs. Wide variation exists in how people disclose and manage their privacy on these blogs, particularly concerning the choices made about leaving information permanently visible on blogs or retrieving it sometime after an initial posting. This study applies Communication Privacy Management (CPM) theory to explore the process of privacy rule adaptation for blogging by examining situations that have triggered bloggers to change their privacy rules to enact blog post deletion practices (“blog scrubbing”). Overall, open-ended responses from 356 bloggers were content analyzed. Chi-square analysis revealed differences in the frequency of triggers that changed the blogging post privacy rules and the proactive versus reactive nature of blogging privacy management deletion practices. Bloggers’ critical incidents that activate privacy rule changes demonstrate that impression management triggers, personal safety identity triggers, relational triggers, and legal/disciplinary triggers resulted in greater alteration of individual privacy rules used to protect these bloggers from the privileged online community of individuals granted access to an individual’s blog. Thus, bloggers essentially “scrubbed” their blog site and adapted their typical privacy rules with new ones that better protected them from the online community regarding that particular blogged information. 相似文献
20.
RUOK? Blogging Communication Technologies During Crises 总被引:1,自引:0,他引:1
This article compares communication technologies within and across crises, using evidence from contemporary postings in 68,022 blogs and news feeds and using a semi-automatic method to detect words that increase in usage during a crisis. Three case studies from 2005 are used: the July 7 London attacks, the New Orleans hurricane, and the Pakistan-Kashmir earthquake. The results highlight the information provision importance for bloggers of Web 2.0 resources such as Wikinews, the Wikipedia, and the Flickr picture sharing site, although these still play a minor role in comparison to the mass media. Some personal communication methods were also mentioned significantly, including SMS and cellphones, but the newest technologies of those mentioned were all Web 2.0. The importance of electronic communication for bloggers was found to depend on the nature of the crisis: For example, despite the heavy Pakistan-Kashmir earthquake death toll, there was relatively little interest in related communication issues from English language bloggers and news sources. 相似文献