Header logo is


2001


no image
Unsupervised Segmentation and Classification of Mixtures of Markovian Sources

Seldin, Y., Bejerano, G., Tishby, N.

In The 33rd Symposium on the Interface of Computing Science and Statistics (Interface 2001 - Frontiers in Data Mining and Bioinformatics), pages: 1-15, 33rd Symposium on the Interface of Computing Science and Statistics (Interface - Frontiers in Data Mining and Bioinformatics), 2001 (inproceedings)

Abstract
We describe a novel algorithm for unsupervised segmentation of sequences into alternating Variable Memory Markov sources, first presented in [SBT01]. The algorithm is based on competitive learning between Markov models, when implemented as Prediction Suffix Trees [RST96] using the MDL principle. By applying a model clustering procedure, based on rate distortion theory combined with deterministic annealing, we obtain a hierarchical segmentation of sequences between alternating Markov sources. The method is applied successfully to unsupervised segmentation of multilingual texts into languages where it is able to infer correctly both the number of languages and the language switching points. When applied to protein sequence families (results of the [BSMT01] work), we demonstrate the method‘s ability to identify biologically meaningful sub-sequences within the proteins, which correspond to signatures of important functional sub-units called domains. Our approach to proteins classification (through the obtained signatures) is shown to have both conceptual and practical advantages over the currently used methods.

ei

PDF Web [BibTex]

2001


PDF Web [BibTex]


no image
Markovian domain fingerprinting: statistical segmentation of protein sequences

Bejerano, G., Seldin, Y., Margalit, H., Tishby, N.

Bioinformatics, 17(10):927-934, 2001 (article)

ei

PDF Web [BibTex]

PDF Web [BibTex]


no image
Unsupervised Sequence Segmentation by a Mixture of Switching Variable Memory Markov Sources

Seldin, Y., Bejerano, G., Tishby, N.

In In the proceeding of the 18th International Conference on Machine Learning (ICML 2001), pages: 513-520, 18th International Conference on Machine Learning (ICML), 2001 (inproceedings)

Abstract
We present a novel information theoretic algorithm for unsupervised segmentation of sequences into alternating Variable Memory Markov sources. The algorithm is based on competitive learning between Markov models, when implemented as Prediction Suffix Trees (Ron et al., 1996) using the MDL principle. By applying a model clustering procedure, based on rate distortion theory combined with deterministic annealing, we obtain a hierarchical segmentation of sequences between alternating Markov sources. The algorithm seems to be self regulated and automatically avoids over segmentation. The method is applied successfully to unsupervised segmentation of multilingual texts into languages where it is able to infer correctly both the number of languages and the language switching points. When applied to protein sequence families, we demonstrate the method‘s ability to identify biologically meaningful sub-sequences within the proteins, which correspond to important functional sub-units called domains.

ei

PDF [BibTex]

PDF [BibTex]


no image
Inference Principles and Model Selection

Buhmann, J., Schölkopf, B.

(01301), Dagstuhl Seminar, 2001 (techreport)

ei

Web [BibTex]

Web [BibTex]


Isotropic second-order nonlinear optical susceptibilities
Isotropic second-order nonlinear optical susceptibilities

Fischer, P., Buckingham, A., Albrecht, A.

PHYSICAL REVIEW A, 64(5), 2001 (article)

Abstract
The second-order nonlinear optical susceptibility, in the electric dipole approximation, is only nonvanishing for materials that are noncentrosymmetric. Should the medium be isotropic, then only a chiral system. such as an optically active liquid, satisfies this symmetry requirement. We derive the quantum-mechanical form of the isotropic component of the sum- and difference-frequency susceptibility and discuss its unusual spectral properties. We show that any coherent second-order nonlinear optical process in a system of randomly oriented molecules requires the medium to be chiral. and the incident frequencies to be different and nonzero. Furthermore, a minimum of two nondegenerate excited molecular states are needed for the isotropic part of the susceptibility to be nonvanishing. The rotationally invariant susceptibility is zero in the static field limit and shows exceptionally sensitive resonance and dephasing effects that are particular to chiral centers.

pf

DOI [BibTex]

DOI [BibTex]


Reply to ``Comment on `Phenomenological damping in optical response tensors'{''}
Reply to “Comment on ‘Phenomenological damping in optical response tensors’”

Buckingham, A., Fischer, P.

PHYSICAL REVIEW A, 63(4), 2001 (article)

Abstract
We show that damping factors must not be incorporated in the perturbation of the ground state by a static electric field. If they are included, as in the theory of Stedman et al. {[}preceding Comment. Phys. Rev. A 63, 047801 (2001)], then there would be an electric dipole in the y direction induced in a hydrogen atom in the M-s = + 1/2 state by a static electric field in the x direction. Such a dipole is excluded by symmetry.

pf

DOI [BibTex]