A robust method for finding the automated best matched genes based on grouping similar fragments of large-scale references for genome assembly

Jaehee Jung, Jong Im Kim, Young Sik Jeong, Gangman Yi

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Big data research on genomic sequence analysis has accelerated considerably with the development of next-generation sequencing. Currently, research on genomic sequencing has been conducted using various methods, ranging from the assembly of reads consisting of fragments to the annotation of genetic information using a database that contains known genome information. According to the development, most tools to analyze the new organelles' genetic information requires different input formats such as FASTA, GeneBank (GB) and tab separated files. The various data formats should be modified to satisfy the requirements of the gene annotation system after genome assembly. In addition, the currently available tools for the analysis of organelles are usually developed only for specific organisms, thus the need for gene prediction tools, which are useful for any organism, has been increased. The proposed method-termed the genome_search_plotter-is designed for the easy analysis of genome information from the related references without any file format modification. Anyone who is interested in intracellular organelles such as the nucleus, chloroplast, and mitochondria can analyze the genetic information using the assembled contig of an unknown genome and a reference model without any modification of the data from the assembled contig.

Original languageEnglish
Article number192
JournalSymmetry
Volume9
Issue number9
DOIs
StatePublished - 1 Sep 2017

Keywords

  • Contig
  • Gene
  • Gene annotation
  • Gene search
  • NGS
  • Organelle genome

Fingerprint

Dive into the research topics of 'A robust method for finding the automated best matched genes based on grouping similar fragments of large-scale references for genome assembly'. Together they form a unique fingerprint.

Cite this