
Báo cáo khoa học: WISDOM: A Web Information Credibility Analysis System
Thông tin tài liệu:
Nội dung trích xuất từ tài liệu:
Báo cáo khoa học: "WISDOM: A Web Information Credibility Analysis System" WISDOM: A Web Information Credibility Analysis System Susumu Akamine† Daisuke Kawahara† Yoshikiyo Kato† Tetsuji Nakagawa† Kentaro Inui† Sadao Kurohashi†‡ Yutaka Kidawara† † National Institute of Information and Communications Technology ‡ Graduate School of Informatics, Kyoto University {akamine, dk, ykato, tnaka, inui, kidawara}@nict.go.jp, kuro@i.kyoto-u.ac.jp Abstract distribution for a given topic. For this purpose, syntactic and discourse structures must be ana- We demonstrate an information credibility lyzed, their types and relations must be extracted, analysis system called WISDOM. The purpose and synonymous and ambiguous expressions of WISDOM is to evaluate the credibility of in- should be handled properly. formation available on the Web from multiple Furthermore, it is important to determine the viewpoints. WISDOM considers the following identity of the information sender and his/her to be the source of information credibility: in- specialty as criteria for credibility, which require formation contents, information senders, and named entity recognition and total analysis of information appearances. We aim at analyzing documents. and organizing these measures on the basis of In this paper, we describe an information cre- semantics-oriented natural language processing dibility analysis system called WISDOM, which (NLP) techniques. automatically analyzes and organizes the above aspects on the basis of semantically oriented1. Introduction NLP techniques. WISDOM currently operates As computers and computer networks become over 100 million Japanese Web pages.increasingly sophisticated, a vast amount of in- 2. Overview of WISDOMformation and knowledge has been accumulated We consider the following three criteria for theand circulated on the Web. They provide people judgment of information credibility.with options regarding their daily lives and arestarting to have a strong influence on govern- (1) Credibility of information contents,mental policies and business management. How- (2) Credibility of the information sender, andever, a crucial problem is that the information (3) Credibility estimated from the documentavailable on the Web is not necessarily credible. style and superficial characteristics.It is actually very difficult for human beings to In order to help people judge the credibility ofjudge the credibility of the information and even information from these viewpoints, we have beenmore difficult for computers. However, comput- developing an information analysis system calleders can be used to develop a system that collects, WISDOM. Figure 1 shows the analysis result oforganizes, and relativises information and helps WISDOM on the analysis topic “Is bio-ethanolhuman beings view information from several good for the environment?” Figure 2 shows theviewpoints and judge the credibility of the in- system architecture of WISDOM.formation. Given an analysis topic (query), WISDOM Information organization is a promising en- sends the query to the search engine TSUBAKIdeavor in the area of next-generation Web search. (Shinzato et al., 2008), and TSUBAKI returns aThe search engine Clusty provides a search result list of the top N relevant Web pages (N is usuallyclustering 1 , and Cuil classifies a search result on set to 1000).the basis of query-related terms2. The persuasive Then, those pages are automatically analyzed,technology research project at Stanford Universi- and major and contradictory expressions and eva-ty discussed how websites can be designed to luative expressions are extracted. Furthermore,influence people’s perceptions (B. J. Fogg, 2003). the information senders of the Web pages, whichHowever, as per our knowledge, no research has were analyzed beforehand, are collected and thebeen carried out for supporting the human judg- distribution is calculated.ment on information credibility and information The WISDOM analysis results can be viewedorganization systems for this purpose. from several viewpoints by changing the tabs In order to support the judgment of informa- using a Web browser. The leftmost tab, “Sum-tion credibility, it is necessary to extract the mary,” shows the summary of the analysis, withbackground, facts, and various opinions and their major phrases and major/contradictory state- ments first.1 http://clusty.com/, http://clusty.jp/ 1 Proceedings of the ACL-IJCNLP 2009 Software Demons ...
Tìm kiếm theo từ khóa liên quan:
Web Information Credibility Analysis System Susumu Akamine Long Papers báo cáo khoa học báo cáo ngôn ngữ xử lý ngôn ngữ tự nhiênTài liệu có liên quan:
-
63 trang 353 0 0
-
12 trang 337 0 0
-
Phương pháp tạo ra văn bản tiếng Việt có đề tài xác định
7 trang 284 0 0 -
13 trang 271 0 0
-
Báo cáo khoa học Bước đầu tìm hiểu văn hóa ẩm thực Trà Vinh
61 trang 256 0 0 -
Tóm tắt luận án tiến sỹ Một số vấn đề tối ưu hóa và nâng cao hiệu quả trong xử lý thông tin hình ảnh
28 trang 233 0 0 -
NGHIÊN CỨU CHỌN TẠO CÁC GIỐNG LÚA CHẤT LƯỢNG CAO CHO VÙNG ĐỒNG BẰNG SÔNG CỬU LONG
9 trang 230 0 0 -
Giáo trình Lập trình logic trong prolog: Phần 1
114 trang 224 0 0 -
Đề tài nghiên cứu khoa học và công nghệ cấp trường: Hệ thống giám sát báo trộm cho xe máy
63 trang 217 0 0 -
Đề tài nghiên cứu khoa học: Tội ác và hình phạt của Dostoevsky qua góc nhìn tâm lý học tội phạm
70 trang 198 0 0 -
22 trang 196 0 0
-
Xây dựng ontology trợ giúp ra quyết định về đào tạo cho các trường Đại học ở Việt Nam
10 trang 180 0 0 -
98 trang 180 0 0
-
96 trang 175 0 0
-
SỨC MẠNH CHÍNH TRỊ CỦA LIÊN MINH CHÂU ÂU TRÊN TRƯỜNG QUỐC TẾ
4 trang 174 0 0 -
26 trang 174 0 0
-
7 trang 174 0 0
-
48 trang 172 0 0
-
209 trang 168 0 0
-
Tích hợp DSM và ảnh chụp UAV với mô hình nơ-ron tích chập trong phân loại lớp phủ mặt đất
8 trang 167 0 0