Predicting the results of molecular specific hybridization using boosted tree algorithm

Weijun Zhu, Yingjie Han, Huanmei Wu, Yang Liu, Xiaofei Nan, Qinglei Zhou

Research output: Contribution to journalArticle

3 Scopus citations

Abstract

In the field of bioinformatics and DNA computing, simulated hybridization experiments can replace real molecular hybridization experiments to some extent, avoiding some disadvantages of the actual experimental design. However, the core techniques, which are employed by the popular DNA simulation software, are limited to the exponential computational complexity of the combinatorial problems. As a result, it is impossible to decide whether a specific hybridization among complex DNA molecules is effective or not within acceptable time. To address this common problem, we hereby introduce a new method based on the machine learning technique. First, a sample set is employed to train the boosted tree algorithm, which resulted in a corresponding machine learning model. Second, this model is applied to predict the classification results of molecular hybridization for a given group of DNA molecular coding. The experiment results showed that the new method had an average accuracy level of 94.2% and an average efficiency level 90 839 times higher than that of the existing representative approaches. Especially for the case study in this paper, the efficiency of the new method is 235 000, 250 000, and 990 000 times higher than that of the three existing methods, respectively. These experimental results indicate that our new approach can quickly and accurately determine the biological effectiveness of molecular hybridization for a given DNA design.

Original languageEnglish (US)
Article numbere4982
JournalConcurrency Computation
Volume32
Issue number1
DOIs
StatePublished - Jan 10 2020

Keywords

  • DNA design
  • biological effectiveness
  • boosted tree algorithm
  • specific hybridization

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Computer Science Applications
  • Computer Networks and Communications
  • Computational Theory and Mathematics

Fingerprint Dive into the research topics of 'Predicting the results of molecular specific hybridization using boosted tree algorithm'. Together they form a unique fingerprint.

  • Cite this