# A new decoding algorithm for hidden Markov models improves the prediction of the topology of all-beta membrane proteins

^{1}Department of Biology, University of Bologna via Irnerio 42, 40126 Bologna, Italy

^{}Corresponding author.

#### Supplement

#### Conference

This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## Abstract

### Background

Structure prediction of membrane proteins is still a challenging computational problem. Hidden Markov models (HMM) have been successfully applied to the problem of predicting membrane protein topology. In a predictive task, the HMM is endowed with a *decoding algorithm *in order to assign the most probable state path, and in turn the labels, to an unknown sequence. The Viterbi and the posterior decoding algorithms are the most common. The former is very efficient when one path dominates, while the latter, even though does not guarantee to preserve the HMM grammar, is more effective when several concurring paths have similar probabilities. A third good alternative is 1-best, which was shown to perform equal or better than Viterbi.

### Results

In this paper we introduce the posterior-Viterbi (PV) a new decoding which combines the posterior and Viterbi algorithms. PV is a two step process: first the posterior probability of each state is computed and then the best posterior allowed path through the model is evaluated by a Viterbi algorithm.

### Conclusion

We show that PV decoding performs better than other algorithms when tested on the problem of the prediction of the topology of beta-barrel membrane proteins.

## Background

All-beta membrane proteins constitute a well structurally conserved class of proteins, that span the outer membrane of Gram-negative bacteria with a barrel-like structure. In all cases known so far with atomic resolution, the barrel consists of an even number of anti-parallel beta strands, whose number ranges from 8 to 22 strands, depending on the protein and/or its functional role [1,2]. In eukaryotes, it is known that similar architectures are present in the outer membrane of chloroplasts and mitochondria, although so far none of the so-called "porins", mainly acting as Voltage Dependent Anion Channels (VDAC), have been solved with atomic resolution ([3] and references therein). It is therefore urgent to devise methods for the prediction of the topology of this class of membrane proteins. Indeed the correct prediction of the protein topology, given the conservation of the barrel architecture may greatly help in threading procedures, especially when sequence homology is low. Furthermore reliable methods, endowed with a low rate of false positives, can also help in genome annotation on the basis of protein structure prediction [3,4]. The problem of predicting beta barrel membrane proteins has been recently addressed with machine learning approaches, and among them Hidden Markov Models (HMMs) have been shown to outperform previously existing methods [5]. HMMs were developed for alignments [6,7], pattern detection [8,9] and also for predictions, as in the case of the topology of all-alpha and all-beta membrane proteins [10-17]. When HMMs are implemented for predicting a given feature, a *decoding *algorithm is needed. With decoding we refer to the assignment of a path through the HMM states (which is the best under *a suitable measure*) given an observed sequence *O*. In this way, we can also assign a label to each sequence element [18,19]. More generally, as stated in [20], the decoding is the prediction of the labelsof an unknownpath. The most famous decoding procedure is the Viterbi algorithm, which finds the most probable allowed path through the HMM model. Viterbi decoding is particularly effective when there is a *single best path *among others much less probable. When several paths have similar probabilities, the posterior decoding or the 1-best algorithms are more convenient [20]. The posterior decoding assigns the state path on the basis of the posterior probability, although the selected path might be not allowed.

In this paper we address the problem of preserving the automaton grammar and concomitantly exploiting the posterior probabilities, without the need of the post-processing algorithm [12,21]. Prompted by this, we design a new decoding algorithm, the posterior-Viterbidecoding (PV), which preserves the automaton grammars and at thesame time exploits the posterior probabilities. A related idea, that is specific for pairwise alignments was previously introduced to improve the sequence alignment accuracy [22]. We show that PV performs better than the other algorithms when we test it on the problem of predicting the topology of beta-barrel membrane proteins.

## Results and Discussion

### Testing the decoding algorithms on all-beta membrane proteins

In order to test our decoding algorithm on real biological data, we used a previously developed HMM, devised for the prediction of the topology of beta-barrel membrane proteins [12]. The hidden Markov model is a sequence-profile-based HMM and takes advantage of emitting vectors instead of symbols, as described in [12].

Since the previously designed and trained HMM [12] emits profile vectors, sequence profiles have been computed from the alignments as derived with PSI-BLAST [23] on the non-redundant database of protein sequences ftp://ftp.ncbi.nlm.nih.gov/blast/db/.

The results obtained using the four different decoding algorithms are shown in Table Table1,1, where the performance is tested with a leave-one-out cross validation procedure for the first 15 proteins and as blind-test for the latter 5 (see Methods). It is evident that for the problem at hand the Viterbi decoding and the 1-best are unreliable, since only one of the proteins is correctly assigned. In this case the posterior decoding is more efficient and can correctly assign 60% and 40% of the proteins, in cross-validation and on the blind set, respectively. Here the posterior decoding is used without MaxSubSeq, introduced before to recast the grammar [12].

From Table Table11 it evident that the new PV decoding is the best performing decoding achieving 80% and 60% accuracy in cross-validation and on the blind set, respectively. This is done ensuring that predictions are consistent with the designed automaton grammar.

### Comparison with other available HMMs

In Table Table22 we show the results of the comparison between our HMM-decoding with those obtained from the available web servers, based on similar approaches [16,17,21]. The **pred-tmbb **server [16] allows the user to test three different algorithms (namely Viterbi, 1-best and posterior). Differently from us they find that their HMM does not show significant differences among the three decoding algorithms. This dissimilar behaviour may be due to several concurring facts: *i*) the different HMM models, *ii*) **pred-tmbb **runs on a single-sequence input, *iii*) **pred-tmbb **is trained using the *Conditional Maximum Likelihood *[24]. The second server **PROFtmb **[17] is based on a method that exploits multiple sequence information and posterior probabilities. Their decoding is related to the posterior-Viterbi; however, in their algorithm the authors first obtained the posterior sum contracted into two possible labeling (inner/outer loops and transmembrane as we did in [12]), then they made use of the explicit value of the HMM transition probabilities (*a*_{i,j}). In this way they count the transition probabilities twice (implicitly in the posterior-probability and directly into their algorithm) and the **PROFtmb **performance is not very different from ours.

Finally, the third server **HMMB2TMR **[21] achieves a performance quite similar to that obtained with PV decoding. To do that **HMMB2TMR **takes advantage of the MaxSubSeq algorithm on top of the posterior sum decoding. However, although MaxSubSeq is a very general two-class segment optimization algorithm, it is a post processing procedure that has to be applied after a HMM decoding. On the contrary, PV is a general decoding algorithm and it is more useful when the underlying predictor is a HMM, where more than two labels and different constraints can be introduced into the automaton grammars.

## Conclusion

The new PV decoding algorithm is more convenient in that it overcomes the difficulties of introducing a problem-dependent optimization algorithm when the automaton grammar is to be re-cast. When one-state-path dominates we may expect that PV does not perform better than the other decoding algorithms, and in these cases the 1-best is preferred [20]. Nevertheless, we show that when several concurring paths are present, as in the case of our beta-barrel HMM, PV performs better than the others. Although PV takes more time than other algorithms (the posterior + the Viterbi time), the PV asymptotic computational time-complexity still remains *O*(*N*^{2}·*L*) (where *L *and *N *are the protein length and the number of states, respectively) as for the other decodings. As far as the memory requirement is concerned, PV needs the same space-complexity of the Viterbi and posterior (*O*(*N*·*L*)), while 1-*best *in the average case requires less memory, and can also be reduced [20]. When computational speed is an issue, Viterbi algorithm is the fastest and the time complexity order is *time*(*viterbi*) ≤ *time*(l - *best*) ≤ *time*(*PV*). Finally, PV satisfies any HMM grammar structures, including automata containing silent states, and it is applicable to all the possible HMM models with an arbitrary number of labels and without having to work out a problem-dependent optimization algorithm.

## Methods

### The hidden Markov model definitions

For sake of clarity and compactness, in what follows we make use of explicit *BEGIN *(*B*) and *END *states and we do not treat the case of the *silent (null) *states. However, their inclusion in the algorithms is only a technical matter and can be done following the prescriptions indicated in [18,19].

An observed sequence of length *L *is indicated as *O *(= *O*_{1}...*O*_{L}) both for a single-symbol-sequence (as in the standard HMMs) or for a vector-sequence as described before [12]. *λ*(*s*) indicates the label associated to the state *s*, while Λ (= Λ_{i},... Λ_{L}) is the list of the labels associated to each sequence position *i *obtained after the application of a decoding algorithm. Depending on the problem at hand, the labels may identify transmembrane regions, loops, secondary structures of proteins, coding/non coding regions, intergenic regions, etc. A HMM consisting of *N *states (indicated below with *s *and *k*) is therefore defined by three probability distributions:

Starting probabilities

*a*_{B,k }= *P*(*k*|*B*) (1)

Transition probabilities

*a*_{k,s }= *P*(*s*|*k*) (2)

Emission probabilities

*e*_{k}(*O*_{i}) = *P*(*O*_{i}|*k*) (3)

The forward probability is

*f*_{k}(*i*) = *P*(*O*_{1},*O*_{2}...*O*_{i},*π*_{i }= *k*) (4)

which is the probability of having emitted the first partial sequence up to position *i *ending at state *k*. The backward probability is:

*b*_{k}(*i*) = *P*(*O*_{i+1},... *O*_{L - 1}, *O*_{L}|*π*_{i }= *k*) (5)

which is the probability of having emitted the sequence starting from the last element back to the (*i*+l)th element, given that we end at position *i *in state *k*. The probability of emitting the whole sequence can be computed using either the forward or backward probabilities according to:

*P*(*O*|*M*) = *f*_{END}(*L *+ 1) = *b*_{B}(0) (6)

Forward and backward probabilities are also necessary for updating the HMM parameters, using the Baum-Welch algorithm [18,19]. Alternatively a gradient-based training algorithm can be applied [18,20].

### Viterbi decoding

Viterbi decoding finds the path (*π*) through the model which has the maximal probability [18,19]. This means that we look for the path which is

*π*^{v }= *argmax*_{{π}}*P*(*π*|*O*, *M*) (7)

where *O*(= *O*_{1},... *O*_{L}) is the observed sequence of length *L *and *M *is the trained HMM model. Since the *P*(*O*|*M*) is independent of a particular path *π*, Equation 7 is equivalent to

*π*^{v }= *argmax*_{{π}}*P*(*π*, *O*|*M*) (8)

*P*(*π*, *O*|*M*) can be easily computed as

where by construction *π*(0) is always the *BEGIN *state (*B*).

Defining *v*_{k}(*i*) as the probability of the most likely path ending in state *k *at position *i*, and *p*_{i}(*k*) as the trace-back pointer, *π*^{v }can be obtained running the following dynamic programming algorithm called Viterbi decoding:

• Initialization

*v*_{B}(0) = 1 *v*_{k}(0) = 0 *for k *≠ *B*

• Recursion

• Termination

• Traceback

• Label assignment

where *λ*(*s*) is the label associated to the *s *state.

### 1-best decoding

The 1-best labeling algorithm described here is Krogh's previously described variant of the N-best decoding [20]. Since there is no exact algorithm for finding the most probable labeling, 1-best is an approximate algorithm which usually achieves good results in solving this task [20]. Differently from Viterbi, the 1-best algorithm ends when the most probable labeling is computed, so that no trace-back is needed.

For sake of clarity, here we present a redundant description, in which we define *H*_{i }as the set of all labeling hypotheses surviving as 1-best for each state *s *up to sequence position *i*. In the worst case the number of distinct labeling-hypotheses is equal to the number of states, is the current partial labeling hypothesis associated to the *s *state from the beginning to the *i*-th sequence position. In general several states may share the same labeling hypothesis. Finally, we use as the *string concatenation operator*, so that 'AAAA''B' = 'AAAAB' (the empty string is " and the empty set is ). Thus 1-best algorithm can be described as

• Initialization

*v*_{B}(*"*) = 1 *v*_{k}(*"*) = 0 *for k *≠ *B*

*v*_{k}(*λ*(*k*)) = *a*_{B,k}·*e*_{k}(*O*_{1})

*H*_{1 }= {*λ*(*k*) : *a*_{B,k }≠ 0} *H*_{i }= *for i *≠ 1

• Recursion

• Termination

With 1-best decoding, we do not need to keep a backtrace matrix since Λ is computed during the forward steps.

### Posterior decoding

The *posterior *decoding finds the path which maximizes the product of the *posterior *probability of the states [18,19]. Using the usual notation for forward (*f*_{k}(*i*)) and backward (*b*_{k}(*i*)) we have

*P*(*π*_{i }= *k*|*O*,*M*) = *f*_{k}(*i*)*b*_{k}(*i*)/*P*(*O*|*M*) (10)

The path *π*^{p }which maximizes the posterior probability is then computed as

for *i = *1... *L*. The corresponding label assignment is

If we have more than one state sharing the same label, labeling can be improved by summing over the states that share the same label *(posterior sum*). In this way we can have a path through the model which maximizes the posterior probability of being in a state with *label **λ *when emitting the observed sequence element, or more formally:

Λ_{i }= *argmax*_{{λ}}*P*(*label*(*O*_{i}) = *λ *|*O*, *S*) (14)

where *i *ranges from 1 to *L*.

The posterior-decoding drawback is that the state path sequences *π*^{p }or Λ may be not feasible paths.

However, this decoding can perform better than Viterbi, when more than one highly probable path exists [18,19]. In this case a post-processing algorithm that recasts the original topological constraints is recommended [21].

In the sequel, if not differently indicated, with the term *posterior *we mean the posterior sum.

### Posterior-Viterbi decoding

Posterior-Viterbi decoding is based on the combination of the Viterbi and posterior algorithms. After having computed the *posterior *probabilities we use a Viterbi algorithm to find the best *allowed posterior *path through the model. A related idea, specific for pairwise alignments was previously introduced to improve the sequence alignment accuracy [22].

In the PV algorithm, the basic idea is to compute the path *π*^{PV}

where *A*_{p }is the set of the allowed paths through the model, and *P*(*π*_{i}|*O*,*M*) is the *posterior *probability of the state assigned by the path *π *at position *i *(as computed in Eq. 10).

Defining a function δ*(*s*, *t*) equal to 1 if *s *→ *t *is an allowed transition of the model *M*, 0 otherwise; *v*_{k}(*i*) as the probability of the most probable *allowed-posterior *path ending at state *k *having observed the partial *O*_{1},... *O*_{i }and *p*_{i }as the trace-back pointer, we can compute the best path *π*^{PV }using the Viterbi algorithm:

• Initialization

*v*_{B}(0) = 1 *v*_{k}(0) = 0 *for k *≠ *B*

• Recursion

• Termination

• Traceback

• Label assignment

An alternative approach, that directly maximizes the most probable labelling, is to substitute the posterior probability of a given state *P*(*π*_{i }= *k*|*O*, *M*), with the posterior sum *P*(*label*(*O*_{i}) = *λ*|*O*, *M*) (equation 14). In this case all the states that share the same label have the same probability for each sequence position. However, since the performances of this second version are slightly worse we do not show them.

### Datasets

The problem of the prediction of the all-beta transmembrane regions is used to test the algorithm on a real data application. In this case we use a set that includes 20 constitutive beta-barrel membrane proteins whose sequences are less than 25% homologous and whose 3D structure have been resolved. The number of beta-strands forming the transmembrane barrel ranges from 2 to 22. Among the 20 proteins, 15 were used to train a circular HMM (described in [12]), and here are tested in cross-validation (1a0sP, 1bxwA, 1e54, 1ek9A, 1fcpA, 1fep, 1i78A, 1k24, 1kmoA, 1prn, 1qd5A, 1qj8A, 2mprA, 2omf, 2por). Since there is no detectable sequence identity among the selected 15 proteins, we adopted a leave-one-out approach for training the HMM and testing it. All the reported results are obtained during the testing phase, and the complete set of results is available at http://www.biocomp.unibo.it/piero/posvit. The other 5 new proteins (1mm4, 1nqf, 1p4t, 1uyn, 1t16) are used as a blind new test. Since our goal is to predict the beta-strands that span the membrane we score the methods using the annotations derived from the PDB files. An alternative approach not addressed here, is to predict the portion of the transmembrane beta-strands in contact with the lipid bilayer. This prediction is however out of the scope of our approach, since in real porins the localization of the beta-strands in contact with the membrane, has been so far estimated by means of different computational methods and assumptions [25].

### Measures of accuracy

We used three indices to score the accuracy of the algorithms. The first one is *Q*_{2 }which computes the number of correctly assigned labels divided by the total number of observed symbols. Then we use the *SOV *index [26] to evaluate the segment overlaps. Finally, we also adopt a very stringent measure called *Q*_{ok }: a prediction is considered correct *only if the number of transmembrane segments coincides with the observed one and the corresponding segments have a minimal overlap of m residues *[21]. The value *m *is segment-dependent and for each segment pairs, is computed as

*m *= *min*{|*seg*_{pr}*|/*2, |*seg*_{ob}|/2} (16)

where |*seg*_{pr}| and |*seg*_{ob}| are the predicted and observed segment lengths, respectively.

## List of abbreviations

• HMM: hidden Markov model.

• PV: Posterior-Viterbi.

• *B: *Begin state.

## Authors' contributions

PF developed the Posterior-Viterbi algorithm. PLM designed and trained the Hidden Markov Models. RC contributed to the problem. PF, PLM and RC authored the manuscript.

## Acknowledgements

We thank Anders Krogh for the help with the 1-best algorithm. This work was partially supported by the BioSapiens Network of Excellence, two grants of the Ministero della Istruzione dellUniversitá e della Ricerca (MIUR) 'Hydrolases from Thermophiles: Structure, Function and Homologous and Heterologous Expression' delivered to R.C. and 'Large scale modelling of proteases' delivered to P.F., a PNR 2001–2003 (FIRB art.8) and a PNR 2003–2007 (FIRB art.8).

## References

- Schulz G. Beta-barrel membrane proteins. Curr Opin Struct Biol. 2000;10:443–447. doi: 10.1016/S0959-440X(00)00120-2. [PubMed] [Cross Ref]
- Casadio R, Fariselli P, PL M. In silico prediction of the structure of membrane proteins: Is it feasible. Brief Bioinf. 2003;4:341–348. doi: 10.1093/bib/4.4.341. [PubMed] [Cross Ref]
- Casadio R, Jacoboni I, Messina A, V DP. A 3D model of the voltage-dependent anion channel (VDAC) FEBS Lett. 2003;520:1–7. doi: 10.1016/S0014-5793(02)02758-8. [PubMed] [Cross Ref]
- Casadio R, Fariselli P, Finocchiaro G, Martelli P. Fishing new proteins in the twilight zone of genomes: The test case of outer membrane proteins in Escherichia coli K12, Escherichia coli O157:H7, and other Gram-negative bacteria. Protein Sci. 2003;11:1158–1168. doi: 10.1110/ps.0223603. [PMC free article] [PubMed] [Cross Ref]
- Bagos P, Liakopoulos T, SJ H. Evaluation of methods for predicting the topology of -barrel outer membrane proteins and a consensus prediction method. BMC Bioinformatics. 2005;1:1–7. [PMC free article] [PubMed]
- Krogh A, Brown M, Mian I, Sjolander K, Haussler D. Hidden Markov models in computational biology: Applications to protein modeling. Journal of Molecular Biology. 1994;235:1501–1531. doi: 10.1006/jmbi.1994.1104. [PubMed] [Cross Ref]
- Baldi P, Chauvin Y, Hunkapiller T, McClure M. Hidden Markov Models of Biological Primary Sequence Information. PNAS USA. 1994;91:1059–1063. doi: 10.1073/pnas.91.3.1059. [PMC free article] [PubMed] [Cross Ref]
- Mamitsuka H. Predicting peptides that bind to MHC molecules using supervised learning of hidden Markov models. Proteins. 1998;33:460–474. doi: 10.1002/(SICI)1097-0134(19981201)33:4<460::AID-PROT2>3.0.CO;2-M. [PubMed] [Cross Ref]
- Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy S, Griffiths-Jones S, Howe K, Marshall M, Sonnhammer E. The Pfam Protein Families Database. Nucleic Acids Research. 2002;30:276–280. doi: 10.1093/nar/30.1.276. [PMC free article] [PubMed] [Cross Ref]
- Tusnady G, Simon I. Principles governing amino acid composition of integral membrane proteins: application to topology prediction. J Mol Biol. 1998;283:489–506. doi: 10.1006/jmbi.1998.2107. [PubMed] [Cross Ref]
- Krogh A, Larsson B, von Heijne G, Sonnhammer E. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305:567–580. doi: 10.1006/jmbi.2000.4315. [PubMed] [Cross Ref]
- Martelli P, Fariselli P, Krogh A, Casadio R. A sequence-profile-based HMM for predicting and discriminating beta barrel membrane proteins. Bioinformaticsi. 2002;18:S46–S53. [PubMed]
- Martelli P, Fariselli P, Casadio R. An ENSEMBLE machine learning approach for the prediction of all-alpha membrane proteins. Bioinformatics. 2003;19:i205–i211. doi: 10.1093/bioinformatics/btg1027. [PubMed] [Cross Ref]
- Liu Q, Zhu Y, Wang B, Li Y. A HMM-based method to predict the transmembrane regions of beta-barrel membrane proteins. Comput Biol Chem. 2003;27:69–76. doi: 10.1016/S0097-8485(02)00051-7. [PubMed] [Cross Ref]
- Viklund H, Elofsson A. Best alpha-helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information. Protein Sci. 2004;13:1908–1917. doi: 10.1110/ps.04625404. [PMC free article] [PubMed] [Cross Ref]
- Bagos P, Liakopoulos T, Spyropoulos I, SJ H. PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins. Nucleic Acids Res. 2004;32:W400–W404. doi: 10.1093/nar/gkh417. [PMC free article] [PubMed] [Cross Ref]
- Bigelow H, Petrey D, Liu J, Przybylski D, B R. Predicting transmembrane beta-barrels in proteomes. Nucleic Acids Res. 2004;32:2566–2577. doi: 10.1093/nar/gkh580. [PMC free article] [PubMed] [Cross Ref]
- Baldi P, Brunak S. Bioinformatics: the Machine Learning Approach. Cambridge: MIT Press; 2001.
- Durbin R, Eddy S, Krogh A, Mitchinson G. Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge: Cambridge Univ Press; 1998.
- Krogh A. Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology. ISCB, AAAI Press; 1997. Two methods for improving performance of a HMM and their application for gene finding; pp. 179–186.
- Fariselli P, Finelli M, Marchignoli D, Martelli P, Rossi I, R C. MaxSubSeq: an algorithm for segment-length optimization. The case study of the transmembrane spanning segments. Bioinformatics. 2003;19:500–505. doi: 10.1093/bioinformatics/btg023. [PubMed] [Cross Ref]
- Holmes I, Durbin R. Dynamic programming alignment accuracy. J Comput Biol. 1998;5:493–504. [PubMed]
- Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, DJ L. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acid Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [PMC free article] [PubMed] [Cross Ref]
- Krogh A. Proceedings 12th International Conference on Pattern Recognition Singapore. IEEE Comp Soc Press; 1994. Hidden Markov models for labeled sequences; pp. 140–144.
- Tusnady G, Dosztanyi Z, Simon I. Transmembrane proteins in the Protein Data Bank: identification and classification. Bioinformatics. 2004;20:2964–2972. doi: 10.1093/bioinformatics/bth340. [PubMed] [Cross Ref]
- Zemla A, Venclovas C, Fidelis K, B R. A modified definition of Sov, a segment-based measure for protein secondary structure prediction assessment. Proteins. 1999;34:220–223. doi: 10.1002/(SICI)1097-0134(19990201)34:2<220::AID-PROT7>3.0.CO;2-K. [PubMed] [Cross Ref]

**BioMed Central**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (267K) |
- Citation

- Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.[BMC Bioinformatics. 2005]
*Bagos PG, Liakopoulos TD, Hamodrakas SJ.**BMC Bioinformatics. 2005 Jan 12; 6:7. Epub 2005 Jan 12.* - Combined prediction of transmembrane topology and signal peptide of beta-barrel proteins: using a hidden Markov model and genetic algorithms.[Comput Biol Med. 2010]
*Zou L, Wang Z, Wang Y, Hu F.**Comput Biol Med. 2010 Jul; 40(7):621-8. Epub 2010 May 21.* - An HMM posterior decoder for sequence feature prediction that includes homology information.[Bioinformatics. 2005]
*Käll L, Krogh A, Sonnhammer EL.**Bioinformatics. 2005 Jun; 21 Suppl 1:i251-7.* - HMMEditor: a visual editing tool for profile hidden Markov model.[BMC Genomics. 2008]
*Dai J, Cheng J.**BMC Genomics. 2008; 9 Suppl 1:S8.* - Efficient decoding algorithms for generalized hidden Markov model gene finders.[BMC Bioinformatics. 2005]
*Majoros WH, Pertea M, Delcher AL, Salzberg SL.**BMC Bioinformatics. 2005 Jan 24; 6:16. Epub 2005 Jan 24.*

- Annotation of genomics data using bidirectional hidden Markov models unveils variations in Pol II transcription cycle[Molecular Systems Biology. ]
*Zacher B, Lidschreiber M, Cramer P, Gagneur J, Tresch A.**Molecular Systems Biology. 10(12)768* - Algorithms for Hidden Markov Models Restricted to Occurrences of Regular Expressions[Biology. ]
*Tataru P, Sand A, Hobolth A, Mailund T, Pedersen CN.**Biology. 2(4)1282-1295* - MicroRNA Target Site Identification by Integrating Sequence and Binding Information[Nature methods. 2013]
*Majoros WH, Lekprasert P, Mukherjee N, Skalsky RL, Corcoran DL, Cullen BR, Ohler U.**Nature methods. 2013 Jul; 10(7)10.1038/nmeth.2489* - CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation[Nucleic Acids Research. 2012]
*Nikulova AA, Favorov AV, Sutormin RA, Makeev VJ, Mironov AA.**Nucleic Acids Research. 2012 Jul; 40(12)e93* - A Classification of Bioinformatics Algorithms from the Viewpoint of Maximizing Expected Accuracy (MEA)[Journal of Computational Biology. 2012]
*Hamada M, Asai K.**Journal of Computational Biology. 2012 May; 19(5)532-549*

- PubMedPubMedPubMed citations for these articles
- TaxonomyTaxonomyRelated taxonomy entry
- Taxonomy TreeTaxonomy Tree

- A new decoding algorithm for hidden Markov models improves the prediction of the...A new decoding algorithm for hidden Markov models improves the prediction of the topology of all-beta membrane proteinsBMC Bioinformatics. 2005; 6(Suppl 4)S12

Your browsing activity is empty.

Activity recording is turned off.

See more...