Inference algorithms for pattern-based CRFs on sequence data

From MaRDI portal
Publication:334918

DOI10.1007/S00453-015-0017-7zbMATH Open1353.68237arXiv1210.0508OpenAlexW2157550084MaRDI QIDQ334918FDOQ334918


Authors: Vladimir Kolmogorov, Rustem Takhanov Edit this on Wikidata


Publication date: 1 November 2016

Published in: Algorithmica (Search for Journal in Brave)

Abstract: We consider Conditional Random Fields (CRFs) with pattern-based potentials defined on a chain. In this model the energy of a string (labeling) x1...xn is the sum of terms over intervals [i,j] where each term is non-zero only if the substring xi...xj equals a prespecified pattern alpha. Such CRFs can be naturally applied to many sequence tagging problems. We present efficient algorithms for the three standard inference tasks in a CRF, namely computing (i) the partition function, (ii) marginals, and (iii) computing the MAP. Their complexities are respectively O(nL), O(nLellmax) and O(nLmin|D|,log(ellmax+1)) where L is the combined length of input patterns, ellmax is the maximum length of a pattern, and D is the input alphabet. This improves on the previous algorithms of (Ye et al., 2009) whose complexities are respectively O(nL|D|), O(n|Gamma|L2ellmax2) and O(nL|D|), where |Gamma| is the number of input patterns. In addition, we give an efficient algorithm for sampling. Finally, we consider the case of non-positive weights. (Komodakis & Paragios, 2009) gave an O(nL) algorithm for computing the MAP. We present a modification that has the same worst-case complexity but can beat it in the best case.


Full work available at URL: https://arxiv.org/abs/1210.0508




Recommendations




Cites Work


Cited In (6)





This page was built for publication: Inference algorithms for pattern-based CRFs on sequence data

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q334918)