融入案件辅助句的低频和易混淆罪名预测
作者:
作者单位:

作者简介:

郭军军(1987-),男,博士,讲师,CCF专业会员,主要研究领域为自然语言处理,信息检索,机器翻译.
刘真丞(1997-),男,学士,主要研究领域为自然语言处理,事件抽取.
余正涛(1970-),男,博士,教授,博士生导师,CCF高级会员,主要研究领域为自然语言处理,机器翻译,信息检索.
黄于欣(1983-),男,博士,CCF专业会员,主要研究领域为自然语言处理,文本摘要,文本生成.
相艳(1979-),女,讲师,主要研究领域为自然语言处理,情感分析,信息抽取.

通讯作者:

余正涛,E-mail:ztyu@hotmail.com

中图分类号:

TP18

基金项目:

国家重点研发计划(2018YFC0830105,2018YFC0830101,2018YFC0830100);国家自然科学基金(61972186,61762056,61472168,61866020);云南省科技厅省级人培项目(KKSY201703015);云南省基础研究专项面上项目(2019FB082,202001AT070047)


Few Shot and Confusing Charges Prediction with the Auxiliary Sentences of Case
Author:
Affiliation:

Fund Project:

National Key Research and Development Program of China (2018YFC0830105, 2018YFC0830101, 2018YFC0830100); National Natural Science Foundation of China (61972186, 61762056, 61472168, 61866020); Provincial Personnel Training Project of Yunnan Science and Technology Department (KKSY201703015); Natural Science Foundation Project of Yunnan Science and Technology Department (2019FB082, 202001AT070047)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    由于低频罪名数据量较少和易混淆罪名案情描述相似等原因,导致低频和易混淆罪名预测效果不佳.为了解决此类问题,通过构建案件辅助句,提出一种基于双向互注意力机制的案件辅助句融合方法,实现罪名预测.主要包括以下3部分:首先,基于司法领域知识构建案件辅助句,将案件辅助句作为案情描述和罪名之间的映射知识;然后,基于词级和字符级表征分别提取案情描述与案件辅助句多粒度特征;同时,借助案件辅助句与案情描述双向注意机制,获得具有辅助句倾向性的案情描述表征,并最终实现低频和易混淆罪名的预测.基于中国刑事案件公共数据集的实验结果表明:所提方法在F1值最大提升13.2%,准确率最大提升4.5%,低频罪名预测F1值提升4.3%,易混淆罪名预测F1值提升8.2%,所提算法显著地提升了低频和易混淆罪名的预测性能.

    Abstract:

    Due to the insufficiency of few shot charges and the similarity of case descriptions for the confusing charges, the prediction performance of the existing methods for few shot charges and confusing charges is not promising. To address the forementioned drawbacks, a novel few shot and confusing charges prediction method is proposed, which is based on bi-direction mutual attention mechanism with the auxiliary sentences of case. For the proposed model, firstly, the auxiliary sentence of case via the judicial field is constructed, where the auxiliary sentence of case is considered as external knowledge for mapping the description of the case to the corresponding charge. Secondly, the multi-granularity characteristics of case description and the auxiliary sentence of case are extracted at the level of both word and character, respectively. At the same time, the auxiliary sentence of case and case description are used to build bi-direction mutual attention. Finally, the tendency representation of the case description with the guidance of the auxiliary sentence of case are derived, which improve the prediction accuracy of few shot and confusing charges. The experimental results conducted on the benchmark data of criminal cases show that the proposed model increases the F1 value and prediction accuracy by 13.2% and 4.5%, respectively, and increases the F1 values for the few shot charges and confusing charges by 4.3% and 8.2%, respectively, which significantly enhance the prediction performance for few shot and confusing charges.

    参考文献
    相似文献
    引证文献
引用本文

郭军军,刘真丞,余正涛,黄于欣,相艳.融入案件辅助句的低频和易混淆罪名预测.软件学报,2021,32(10):3139-3150

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2019-12-06
  • 最后修改日期:2020-02-09
  • 录用日期:
  • 在线发布日期: 2021-10-09
  • 出版日期: 2021-10-06
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号