An efficient algorithm for matching protein binding sites for protein function prediction

Leif Ellingson; Jinfeng Zhang

doi:10.1145/2147805.2147837

An efficient algorithm for matching protein binding sites for protein function prediction

Leif Ellingson, Jinfeng Zhang

Mathematics and Statistics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

4 Scopus citations

Abstract

Comparing the binding sites of proteins is effective for predicting protein functions based on their structure information. However, it is still very challenging to predict the binding ligands from the atomic structures of protein binding sites. In this study, we designed a new algorithm based on the iterative closest point (ICP) algorithm. Our algorithm aims to find the maximum number of atoms that can be superposed between two protein binding sites, where any pair of matched superposed atoms has a distance smaller than a given threshold. The search starts from similar tetrahedra between two binding sites obtained from 3D Delaunay triangulation and uses the Hungarian algorithm to find additional matched atoms. We show that our method finds more matched atoms than a leading method. For benchmark data, we use the Tanimoto Index as a similarity measure and the nearest neighbor classifier to achieve a classification performance comparable to the best methods in the literature among those that provide both the common atom set and atom correspondences.

Original language	English
Title of host publication	2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011
Pages	289-293
Number of pages	5
DOIs	https://doi.org/10.1145/2147805.2147837
State	Published - 2011
Event	2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011 - Chicago, IL, United States Duration: Aug 1 2011 → Aug 3 2011

Publication series

Name	2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011

Conference

Conference	2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011
Country/Territory	United States
City	Chicago, IL
Period	08/1/11 → 08/3/11

Keywords

Functional genomics
Protein binding site matching
Protein function prediction
Protein surface matching
Structure genomics

Access to Document

10.1145/2147805.2147837

Cite this

@inproceedings{c965f2aa65014edba3b06ff48c6fdcab,

title = "An efficient algorithm for matching protein binding sites for protein function prediction",

abstract = "Comparing the binding sites of proteins is effective for predicting protein functions based on their structure information. However, it is still very challenging to predict the binding ligands from the atomic structures of protein binding sites. In this study, we designed a new algorithm based on the iterative closest point (ICP) algorithm. Our algorithm aims to find the maximum number of atoms that can be superposed between two protein binding sites, where any pair of matched superposed atoms has a distance smaller than a given threshold. The search starts from similar tetrahedra between two binding sites obtained from 3D Delaunay triangulation and uses the Hungarian algorithm to find additional matched atoms. We show that our method finds more matched atoms than a leading method. For benchmark data, we use the Tanimoto Index as a similarity measure and the nearest neighbor classifier to achieve a classification performance comparable to the best methods in the literature among those that provide both the common atom set and atom correspondences.",

keywords = "Functional genomics, Protein binding site matching, Protein function prediction, Protein surface matching, Structure genomics",

author = "Leif Ellingson and Jinfeng Zhang",

year = "2011",

doi = "10.1145/2147805.2147837",

language = "English",

isbn = "9781450307963",

series = "2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011",

pages = "289--293",

booktitle = "2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011",

note = "2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011 ; Conference date: 01-08-2011 Through 03-08-2011",

}

Ellingson, L & Zhang, J 2011, An efficient algorithm for matching protein binding sites for protein function prediction. in 2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011. 2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011, pp. 289-293, 2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011, Chicago, IL, United States, 08/1/11. https://doi.org/10.1145/2147805.2147837

An efficient algorithm for matching protein binding sites for protein function prediction. / Ellingson, Leif; Zhang, Jinfeng.
2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011. 2011. p. 289-293 (2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - An efficient algorithm for matching protein binding sites for protein function prediction

AU - Ellingson, Leif

AU - Zhang, Jinfeng

PY - 2011

Y1 - 2011

N2 - Comparing the binding sites of proteins is effective for predicting protein functions based on their structure information. However, it is still very challenging to predict the binding ligands from the atomic structures of protein binding sites. In this study, we designed a new algorithm based on the iterative closest point (ICP) algorithm. Our algorithm aims to find the maximum number of atoms that can be superposed between two protein binding sites, where any pair of matched superposed atoms has a distance smaller than a given threshold. The search starts from similar tetrahedra between two binding sites obtained from 3D Delaunay triangulation and uses the Hungarian algorithm to find additional matched atoms. We show that our method finds more matched atoms than a leading method. For benchmark data, we use the Tanimoto Index as a similarity measure and the nearest neighbor classifier to achieve a classification performance comparable to the best methods in the literature among those that provide both the common atom set and atom correspondences.

AB - Comparing the binding sites of proteins is effective for predicting protein functions based on their structure information. However, it is still very challenging to predict the binding ligands from the atomic structures of protein binding sites. In this study, we designed a new algorithm based on the iterative closest point (ICP) algorithm. Our algorithm aims to find the maximum number of atoms that can be superposed between two protein binding sites, where any pair of matched superposed atoms has a distance smaller than a given threshold. The search starts from similar tetrahedra between two binding sites obtained from 3D Delaunay triangulation and uses the Hungarian algorithm to find additional matched atoms. We show that our method finds more matched atoms than a leading method. For benchmark data, we use the Tanimoto Index as a similarity measure and the nearest neighbor classifier to achieve a classification performance comparable to the best methods in the literature among those that provide both the common atom set and atom correspondences.

KW - Functional genomics

KW - Protein binding site matching

KW - Protein function prediction

KW - Protein surface matching

KW - Structure genomics

UR - http://www.scopus.com/inward/record.url?scp=84858962229&partnerID=8YFLogxK

U2 - 10.1145/2147805.2147837

DO - 10.1145/2147805.2147837

M3 - Conference contribution

AN - SCOPUS:84858962229

SN - 9781450307963

T3 - 2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011

SP - 289

EP - 293

BT - 2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, BCB 2011

T2 - 2011 ACM Conference on Bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011

Y2 - 1 August 2011 through 3 August 2011

ER -

An efficient algorithm for matching protein binding sites for protein function prediction

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this