案件要素句子关联图卷积的案件舆情摘要方法
作者:
作者单位:

作者简介:

韩鹏宇(1995-),男,硕士,主要研究领域为自然语言处理,文本摘要.
余正涛(1970-),男,博士,教授,博士生导师,CCF高级会员,主要研究领域为自然语言处理,机器翻译,信息检索.
高盛祥(1977-),女,博士,副教授,CCF专业会员,主要研究领域为自然语言处理,机器翻译,信息检索.
黄于欣(1983-),男,博士,CCF学生会员,主要研究领域为自然语言处理,文本摘要,文本生成.
郭军军(1987-),男,博士,讲师,CCF专业会员,主要研究领域为自然语言处理,信息检索,机器翻译.

通讯作者:

余正涛,E-mail:ztyu@hotmail.com

中图分类号:

TP18

基金项目:

国家重点研发计划(2018YFC0830105,2018YFC0830101,2018YFC0830100);国家自然科学基金(61761026,61972186,61762056);云南省自然科学基金(2018FB104)


Case-related Public Opinion Summarization Method Based on Graph Convolution of Sentence Association Graph with Case Elements
Author:
Affiliation:

Fund Project:

National Key Research and Development Project (2018YFC0830105, 2018YFC0830101, 2018YFC0830100); National Natural Science Foundation of China (61761026, 61972186, 61762056); Natural Science Foundation of Yunnan Province (2018FB104)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    案件舆情摘要是从涉及特定案件的新闻文本簇中,抽取能够概括其主题信息的几个句子作为摘要.案件舆情摘要可以看作特定领域的多文档摘要,与一般的摘要任务相比,可以通过一些贯穿于整个文本簇的案件要素来表征其主题信息.在文本簇中,由于句子与句子之间存在关联关系,案件要素与句子亦存在着不同程度的关联关系,这些关联关系对摘要句的抽取有着重要的作用.提出了基于案件要素句子关联图卷积的案件文本摘要方法,采用图的结构来对多文本簇进行建模,句子作为主节点,词和案件要素作为辅助节点来增强句子之间的关联关系,利用多种特征计算不同节点间的关联关系.然后,使用图卷积神经网络学习句子关联图,并对句子进行分类得到候选摘要句.最后,通过去重和排序得到案件舆情摘要.在收集到的案件舆情摘要数据集上进行实验,结果表明:提出的方法相比基准模型取得了更好的效果,引入要素及句子关联图对案件多文档摘要有很好的效果.

    Abstract:

    The case-related public opinion summarization is the task of extracting a few sentences that can summarize the subject information from some case-related news documents. The case-related public opinion summarization can be regarded as a multi-document summarization in a specific field. Compared with the general multi-document summarization, the topic information can be characterized by some case elements that run through the entire text cluster. In text clusters, sentences and sentences are associated with each other, case elements also have associations of varying degree with sentences. These associations play an important role in extracting abstract sentences. A case-related public opinion summarization method based on graph convolution of sentence association graph with case elements is proposed, which uses graph structure to model all text clusters, with sentences as the main node, words and case elements as auxiliary nodes to enhance the relationship between sentences. Multiple features are used to calculate the relationship between different nodes. Then, graph convolutional neural network is used to learn this sentence association graph, and the sentence is classified to obtain the candidate summary sentence. Finally, the sentence is deduplicated and ranked to obtain the case-related public opinion summarization. Experiments are performed on the case-related public opinion summary dataset. The results show that the method achieves better results than the benchmark model, indicating that both the composition method and the graph convolution learning method are effective.

    参考文献
    相似文献
    引证文献
引用本文

韩鹏宇,余正涛,高盛祥,黄于欣,郭军军.案件要素句子关联图卷积的案件舆情摘要方法.软件学报,2021,32(12):3829-3838

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-02-10
  • 最后修改日期:2020-05-26
  • 录用日期:
  • 在线发布日期: 2021-12-02
  • 出版日期: 2021-12-06
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号