High performance clustering algorithm for analysis of protein family clusters

Seok Hyeon Han, Gangman Yi

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Techniques for analyzing genome sequences in high performance environments to predict the function and structure of a protein have been developing. The function of a protein is determined by its characteristics and the sequence pattern, and a protein is classified as belonging to a family according to its genealogy and structure. This study determines the protein family of unknown proteins by analyzing the sequence database of the proteins, which is classified using a clustering algorithm. The analysis of the experimental clustering results verified that, by applying the proposed pf_cluster algorithm, the protein family of new proteins can be found using their sequence information.

Original languageEnglish
Pages (from-to)1878-1896
Number of pages19
JournalJournal of Supercomputing
Volume72
Issue number5
DOIs
StatePublished - 1 May 2016

Keywords

  • Clustering algorithm
  • High performance
  • Protein clustering
  • Protein family

Fingerprint

Dive into the research topics of 'High performance clustering algorithm for analysis of protein family clusters'. Together they form a unique fingerprint.

Cite this