Abstract
Discontinuity in long DNA sequences creates harmful diseases like Tuberculosis (TB). Given the 21th centurys exponential growth of big-data environments, knowing the precise breaks position of DNA sequences is essential for many reasons including advanced medical intervention. This study designs an automated framework to assess the breaks positions in long DNA sequences which are responsible for TB and then empirically tests it by analyzing a big DNA dataset from the National Center for Biotechnology Information (NCBI) database. The method consists of a range of data cleansing and deep neural network tools for big data situation. Findings reveal that the proposed approach is better than other methods in detecting DNA sequence breaks for TB via resolving a sample size issue of the training dataset and recursively divide the whole dataset into certain length to detect the breaks. It also provides a faster predictive analysis with more accurate and reliable outcomes.
Original language | English |
---|---|
Title of host publication | Proceedings of the Twelfth International Conference on Management Science and Engineering Management |
Editors | J. Paulo Davim |
Place of Publication | Cham, Switzerland |
Publisher | Springer |
Chapter | 23 |
Pages | 277-286 |
Number of pages | 10 |
ISBN (Electronic) | 9783319933511 |
ISBN (Print) | 9783319933504 |
DOIs | |
Publication status | Published - 2019 |
Event | 12th International Conference on Management Science and Engineering Management: ICMSEM 2018 - http://www.icmsem.org/index.html#past, Melbourne, Australia Duration: 01 Aug 2018 → 04 Aug 2018 http://www.icmsem.org/index.html#past |
Conference
Conference | 12th International Conference on Management Science and Engineering Management |
---|---|
Country/Territory | Australia |
City | Melbourne |
Period | 01/08/18 → 04/08/18 |
Internet address |