Indels Detection Algorithm Based on Optimal Split-Read Matching
Author:
Affiliation:

Clc Number:

Fund Project:

National Natural Science Foundation of China (61402132, 61571163, 61532014)

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The development of next-generation high-throughput DNA sequencing techniques has greatly promoted the research of structural variations (SVs) detection.Current genetic structure variation detection methods are mainly base on depth of coverage, pair-end mapping clusters, or sequence assembly, some of them are known to be not accurate or too sensitive.What's more, some methods are not able to recognize the specific position and sequence of structural variation.Insertions and deletions (indels) are the most common forms of genome structure variations.This paper puts forward an optimal split-read matching algorithm (OSRM) using dynamic programming.OSRM breaks an abnormal read into several reads in a least quantity.First, a score matrix of the abnormal read and the corresponding referenced sequence is created.Then a matrix of backtracking path is established.Next, a formula designed according to the characteristics of structural variation is used to elect the optimal backtracking path matrix.And finally the split-read and referenced sequence are matched in an optimal arrangement by which the accurate position and sequence of found indels are outputted.Experiments prove that the performance of algorithm is excellent.In addition, compared with Pindel which is the best in split-read methods, OSRM can offset its defection in detecting small and medium indels while also be able to detect more complex situation.

    Reference
    Related
    Cited by
Get Citation

王春宇,潘俊,郭茂祖,刘晓燕,刘扬,刘国军.基于读分割最优匹配的indels识别算法.软件学报,2017,28(10):2640-2653

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:June 13,2016
  • Revised:September 02,2016
  • Adopted:
  • Online: October 19,2016
  • Published:
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063