book

Algorithmic and Artificial Intelligence Methods for Protein Bioinformatics

by Yi Pan, Jianxin Wang, Min Li

November 2013

Intermediate to advanced

536 pages

16h 4m

English

Wiley-IEEE Press

Read now

Unlock full access

Cover
Series
Title Page
Copyright
Preface
Contributors
Part I: From Protein Sequence to Structure
Chapter 1: Emphasizing The Role of Proteins in Construction of the Developmental Genetic Toolkit in Plants
1.1 Introduction1.2 Evolutionary Developmental (Evo-Devo) Roles in Embryogenesis of Plants (in Developmental Plant Genetic Toolkit Formation)1.3 Phases in Embryogenesis in Arabidopsis Thaliana1.4 Analysis1.5 ConclusionsReferencesBibliography
Chapter 2: Protein Sequence Motif Information Discovery
2.1 Introduction2.2 Granule Computing Approaches2.3 Experimental Setup2.4 Protein Sequence Motif Information Discovered by FGK ModelReferences
Chapter 3: Identifying Calcium Binding Sites in Proteins
3.1 Introduction3.2 Methods3.3 Results and Discussion3.4 ConclusionReferences

Chapter 4: Review of Imbalanced Data Learning for Protein Methylation Prediction
4.1 Introduction4.2 Protein and Methylation4.3 Related Works on Methylation Prediction4.4 ConclusionAcknowledgmentsReferences
Chapter 5: Analysis and Prediction of Protein Posttranslational Modification Sites
5.1 Introduction5.2 Musite: A Machine Learning Approach5.3 Musite Implementation5.4 SummaryAcknowledgmentsReferences
Part II: Protein Analysis and Prediction
Chapter 6: Protein Local Structure Prediction
6.1 Introduction6.2 Structural Cluster Approach6.3 Sequence Cluster Approach6.4 Support Vector Machines for Local Protein Structure Prediction6.5 Clustering Support Vector Machines for Local Protein Structure Prediction6.6 Experimental ResultsReferences
Chapter 7: Protein Structural Boundary Prediction
7.1 Introduction7.2 Background7.3 New Binary Classifiers for Protein Structural Boundary Prediction7.4 ConclusionReferences
Chapter 8: Prediction of RNA Binding Sites in Proteins
8.1 Introduction8.2 Background8.3 Framework of Prediction8.4 Description Features of Protein RNA Binding Sites8.5 Existing Methods8.6 Feature Analysis and Comparison Study8.7 ConclusionAcknowledgmentsReferences
Chapter 9: Algorithmic Frameworks for Protein Disulfide Connectivity Determination
9.1 Introduction9.2 Determining Disulfide Bonds from Sequence Information: Formulations, Features, and Algorithmic Frameworks9.3 Algorithmic Methods for Determining Disulfide Bonds Using Mass Spectrometry9.4 Experimental Results9.5 Conclusions and Future DirectionsAcknowledgmentsReferences
Chapter 10: Protein Contact Order Prediction: Update
10.1 Introduction10.2 Correlated protein properties10.3 Other contact measurements10.4 Contact order calculation10.5 Contact order prediction by homology10.6 Contact order prediction from sequence10.7 The public contact order web server10.8 ConclusionsReferences
Chapter 11: Progress in Prediction of Oxidation States of Cysteines via Computational Approaches
11.1 Introduction11.2 Survey of Previous Efforts to Predict Bonding State of Cysteine Residues on Protein Via Computational Approaches11.3 SummaryReferences
Chapter 12: Computational Methods in CryoElectron Microscopy 3D Structure Reconstruction
12.1 Introduction12.2 Iterative image reconstruction methods12.3 Adaptive simultaneous algebraic reconstruction technique (ASART)12.4 Multilevel parallel strategy for iterative reconstruction algorithm12.5 Experimental results and discussion12.6 SummaryAcknowledgmentsReferences
Part III: Protein Structure Alignment and Assessment
Chapter 13: Fundamentals of Protein Structure Alignment
13.1 Introduction13.2 Biological Motivation of Protein Structure Alignment13.3 Mathematical Frameworks13.4 More Recent Advances with Database QueriesReferences
Chapter 14: Discovering 3D Protein Structures for Optimal Structure Alignment
14.1 Introduction14.2 Protein Structure14.3 Protein Databases14.4 Vector Space Model14.5 Suffix Trees14.6 Indexing 3D Protein Structures14.7 Protein Similarity Algorithm14.8 SummaryReferences
Chapter 15: Algorithmic Methodologies for Discovery of Nonsequential Protein Structure Similarities
15.1 Introduction15.2 Structural Alignment15.3 Global Sequence Order–Independent Structural Alignment15.4 Local Sequence Order–Independent Structural Alignment15.5 ConclusionAcknowledgmentsReferences
Chapter 16: Fractal Related Methods for Predicting Protein Structure Classes and Functions
16.1 Introduction16.2 Methods16.3 Results and conclusionsAcknowledgmentReferences
Chapter 17: Protein Tertiary Model Assessment
17.1 Introduction17.2 Overview of Protein Model Assessment17.3 Design and Method17.4 Implementation Using Svm17.5 Implementation Using IFID317.6 ConclusionReferencesBibliography
Part IV: Protein–Protein Analysis of Biological Networks
Chapter 18: Network Algorithms For Protein Interactions
18.1 Introduction18.2 Optimization approaches to clustering18.3 Hierarchical algorithms18.4 Features of PPI networks18.5 Implementation of hierarchical methods18.6 ConclusionReferences
Chapter 19: Identifying Protein Complexes from Protein–Protein Interaction Networks
19.1 Introduction19.2 Density-Based and Local Search Methods19.3 Hierarchical Clustering Methods19.4 Finding Overlapping Clusters19.5 Identification of Protein Complexes by Integrating Multiple Biological Sources19.6 Identifying Protein Complexes From Dynamic PPI Network19.7 Challenges and Future ResearchReferences
Chapter 20: Protein Functional Module Analysis With Protein–Protein Interaction (PPI) Networks
20.1 Introduction20.2 Properties of PPI Networks20.3 Previous Module Detection Approaches20.4 Weighted Graph Model of Protein Interaction Networks20.5 Theories and Methods20.6 Experimental Results20.7 ConclusionReferences
Chapter 21: Efficient Alignments of Metabolic Networks with Bounded Treewidth
21.1 Introduction21.2 An overview of metabolic network alignment and mining approaches21.3 Generalized Network Alignment Problem21.4 A generalized dynamic programming algorithm21.5 Predicting pathway holes and resolving enzyme ambiguityReferences
Chapter 22: Protein–protein Interaction Network Alignment: Algorithms and Tools
22.1 Introduction22.2 Preliminaries22.3 METHODS (Point 5)22.4 Coarse-Grain Comparison22.5 Concluding RemarksReferences
Part V: Application of Protein Bioinformatics
Chapter 23: Protein-Related Drug Activity Comparison Using Support Vector Machines
23.1 Introduction23.2 Related Studies for Pyrimidines Drug Activity Comparison23.3 Feature Granules and Hierarchical Kernel Design23.4 Experimental Results for Different Machine Learning Models23.5 SummaryReferences
Chapter 24: Finding repetitions in biological networks: challenges, trends, and applications
24.1 Introduction24.2 The Biological Networks Domain24.3 Problem Formulation24.4 Methods24.5 Concluding RemarksReferences
Chapter 25: MeTaDoR: Online Resource and Prediction Server for Membrane Targeting Peripheral Proteins
25.1 Introduction25.2 Resource Content25.3 Summary and ConclusionAcknowledgmentReferences
Chapter 26: Biological networks–based analysis of gene expression signatures*
26.1 Introduction26.2 Gene expression signatures26.3 Biological Network–based identification of gene expression signatures26.4 Biological Network–based integration of gene expression signatures26.5 Discussion and ConclusionReferences
Index
Series

Content preview from Algorithmic and Artificial Intelligence Methods for Protein Bioinformatics

Chapter 2 Protein Sequence Motif Information Discovery

BERNARD CHEN

2.1 Introduction

Proteins can be regarded as one of the most important elements in the process of life; they can be grouped into different families according to their sequential or structural similarities. Many biochemical tests suggest that a sequence determines conformation completely, because all the information that is necessary for specifying protein interaction sites with other molecules is embedded into the protein's amino acid sequence. The close relationship between protein sequence and structure plays an important role in current analysis and prediction technologies. Therefore, understanding the hidden relationships between protein structures and their sequences is an important task in modern bioinformatics research. The biological term sequence motif denotes a relatively small number of functionally or structurally conserved sequence patterns that occur repeatedly in a group of related proteins. These motif patterns may be able to predict the structural or functional area of other proteins, such as enzyme binding sites, DNA or RNA binding sites, prosthetic attachment sites, and protein–protein interaction sites.

PROSITE [1], PRINTS [2], and BLOCKS [3] are three of the most popular motif databases. PROSITE is a method for determining the function of uncharacterized proteins translated from genomic or cyclic DNA (cDNA) sequences. It consists of a database of biologically significant sites and patterns ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Computational Intelligence and Pattern Analysis in Biological Informatics

Publisher Resources

ISBN: 9781118567814Purchase book

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Algorithmic and Artificial Intelligence Methods for Protein Bioinformatics

by Yi Pan, Jianxin Wang, Min Li

Chapter 2

Protein Sequence Motif Information Discovery

2.1 Introduction

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.