Prediction of the human membrane proteome

L Fagerberg, K Jonasson, G von Heijne, M Uhlén… - …, 2010 - Wiley Online Library
L Fagerberg, K Jonasson, G von Heijne, M Uhlén, L Berglund
Proteomics, 2010Wiley Online Library
Membrane proteins are key molecules in the cell, and are important targets for
pharmaceutical drugs. Few three‐dimensional structures of membrane proteins have been
obtained, which makes computational prediction of membrane proteins crucial for studies of
these key molecules. Here, seven membrane protein topology prediction methods based on
different underlying algorithms, such as hidden Markov models, neural networks and
support vector machines, have been used for analysis of the protein sequences from the 21 …
Abstract
Membrane proteins are key molecules in the cell, and are important targets for pharmaceutical drugs. Few three‐dimensional structures of membrane proteins have been obtained, which makes computational prediction of membrane proteins crucial for studies of these key molecules. Here, seven membrane protein topology prediction methods based on different underlying algorithms, such as hidden Markov models, neural networks and support vector machines, have been used for analysis of the protein sequences from the 21 416 annotated genes in the human genome. The number of genes coding for a protein with predicted α‐helical transmembrane region(s) ranged from 5508 to 7651, depending on the method used. Based on a majority decision method, we estimate 5539 human genes to code for membrane proteins, corresponding to approximately 26% of the human protein‐coding genes. The largest fraction of these proteins has only one predicted transmembrane region, but there are also many proteins with seven predicted transmembrane regions, including the G‐protein coupled receptors. A visualization tool displaying the topologies suggested by the eight prediction methods, for all predicted membrane proteins, is available on the public Human Protein Atlas portal (www.proteinatlas.org).
Wiley Online Library