Header logo is


2003


no image
Concentration Inequalities for Sub-Additive Functions Using the Entropy Method

Bousquet, O.

Stochastic Inequalities and Applications, 56, pages: 213-247, Progress in Probability, (Editors: Giné, E., C. Houdré and D. Nualart), November 2003 (article)

Abstract
We obtain exponential concentration inequalities for sub-additive functions of independent random variables under weak conditions on the increments of those functions, like the existence of exponential moments for these increments. As a consequence of these general inequalities, we obtain refinements of Talagrand's inequality for empirical processes and new bounds for randomized empirical processes. These results are obtained by further developing the entropy method introduced by Ledoux.

ei

PostScript [BibTex]

2003


PostScript [BibTex]


no image
Statistical Learning Theory, Capacity and Complexity

Schölkopf, B.

Complexity, 8(4):87-94, July 2003 (article)

Abstract
We give an exposition of the ideas of statistical learning theory, followed by a discussion of how a reinterpretation of the insights of learning theory could potentially also benefit our understanding of a certain notion of complexity.

ei

Web DOI [BibTex]


no image
Dealing with large Diagonals in Kernel Matrices

Weston, J., Schölkopf, B., Eskin, E., Leslie, C., Noble, W.

Annals of the Institute of Statistical Mathematics, 55(2):391-408, June 2003 (article)

Abstract
In kernel methods, all the information about the training data is contained in the Gram matrix. If this matrix has large diagonal values, which arises for many types of kernels, then kernel methods do not perform well: We propose and test several methods for dealing with this problem by reducing the dynamic range of the matrix while preserving the positive definiteness of the Hessian of the quadratic programming problem that one has to solve when training a Support Vector Machine, which is a common kernel approach for pattern recognition.

ei

PDF DOI [BibTex]

PDF DOI [BibTex]


no image
The em Algorithm for Kernel Matrix Completion with Auxiliary Data

Tsuda, K., Akaho, S., Asai, K.

Journal of Machine Learning Research, 4, pages: 67-81, May 2003 (article)

ei

PDF [BibTex]

PDF [BibTex]


no image
Constructing Descriptive and Discriminative Non-linear Features: Rayleigh Coefficients in Kernel Feature Spaces

Mika, S., Rätsch, G., Weston, J., Schölkopf, B., Smola, A., Müller, K.

IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(5):623-628, May 2003 (article)

Abstract
We incorporate prior knowledge to construct nonlinear algorithms for invariant feature extraction and discrimination. Employing a unified framework in terms of a nonlinearized variant of the Rayleigh coefficient, we propose nonlinear generalizations of Fisher‘s discriminant and oriented PCA using support vector kernel functions. Extensive simulations show the utility of our approach.

ei

DOI [BibTex]

DOI [BibTex]


no image
Tractable Inference for Probabilistic Data Models

Csato, L., Opper, M., Winther, O.

Complexity, 8(4):64-68, April 2003 (article)

Abstract
We present an approximation technique for probabilistic data models with a large number of hidden variables, based on ideas from statistical physics. We give examples for two nontrivial applications. © 2003 Wiley Periodicals, Inc.

ei

PDF GZIP Web [BibTex]

PDF GZIP Web [BibTex]


no image
Feature selection and transduction for prediction of molecular bioactivity for drug design

Weston, J., Perez-Cruz, F., Bousquet, O., Chapelle, O., Elisseeff, A., Schölkopf, B.

Bioinformatics, 19(6):764-771, April 2003 (article)

Abstract
Motivation: In drug discovery a key task is to identify characteristics that separate active (binding) compounds from inactive (non-binding) ones. An automated prediction system can help reduce resources necessary to carry out this task. Results: Two methods for prediction of molecular bioactivity for drug design are introduced and shown to perform well in a data set previously studied as part of the KDD (Knowledge Discovery and Data Mining) Cup 2001. The data is characterized by very few positive examples, a very large number of features (describing three-dimensional properties of the molecules) and rather different distributions between training and test data. Two techniques are introduced specifically to tackle these problems: a feature selection method for unbalanced data and a classifier which adapts to the distribution of the the unlabeled test data (a so-called transductive method). We show both techniques improve identification performance and in conjunction provide an improvement over using only one of the techniques. Our results suggest the importance of taking into account the characteristics in this data which may also be relevant in other problems of a similar type.

ei

Web [BibTex]


no image
Use of the Zero-Norm with Linear Models and Kernel Methods

Weston, J., Elisseeff, A., Schölkopf, B., Tipping, M.

Journal of Machine Learning Research, 3, pages: 1439-1461, March 2003 (article)

Abstract
We explore the use of the so-called zero-norm of the parameters of linear models in learning. Minimization of such a quantity has many uses in a machine learning context: for variable or feature selection, minimizing training error and ensuring sparsity in solutions. We derive a simple but practical method for achieving these goals and discuss its relationship to existing techniques of minimizing the zero-norm. The method boils down to implementing a simple modification of vanilla SVM, namely via an iterative multiplicative rescaling of the training data. Applications we investigate which aid our discussion include variable and feature selection on biological microarray data, and multicategory classification.

ei

PDF PostScript PDF [BibTex]

PDF PostScript PDF [BibTex]


no image
An Introduction to Variable and Feature Selection.

Guyon, I., Elisseeff, A.

Journal of Machine Learning, 3, pages: 1157-1182, 2003 (article)

ei

[BibTex]

[BibTex]


no image
New Approaches to Statistical Learning Theory

Bousquet, O.

Annals of the Institute of Statistical Mathematics, 55(2):371-389, 2003 (article)

Abstract
We present new tools from probability theory that can be applied to the analysis of learning algorithms. These tools allow to derive new bounds on the generalization performance of learning algorithms and to propose alternative measures of the complexity of the learning task, which in turn can be used to derive new learning algorithms.

ei

PostScript [BibTex]

PostScript [BibTex]


Thumb xl toc image
New electro-optic effect: Sum-frequency generation from optically active liquids in the presence of a dc electric field

Fischer, P., Buckingham, A., Beckwitt, K., Wiersma, D., Wise, F.

PHYSICAL REVIEW LETTERS, 91(17), 2003 (article)

Abstract
We report the observation of sum-frequency signals that depend linearly on an applied electrostatic field and that change sign with the handedness of an optically active solute. This recently predicted chiral electro-optic effect exists in the electric-dipole approximation. The static electric field gives rise to an electric-field-induced sum-frequency signal (an achiral third-order process) that interferes with the chirality-specific sum-frequency at second order. The cross-terms linear in the electrostatic field constitute the effect and may be used to determine the absolute sign of second- and third-order nonlinear-optical susceptibilities in isotropic media.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Chiral and achiral contributions to sum-frequency generation from optically active solutions of binaphthol

Fischer, P., Wise, F., Albrecht, A.

JOURNAL OF PHYSICAL CHEMISTRY A, 107(40):8232-8238, 2003 (article)

Abstract
The nonlinear sum- and difference-frequency generation spectroscopies can be probes of molecular chirality in optically active systems. We present a tensorial analysis of the chirality-specific electric-dipolar sum-frequency-generation susceptibility and the achiral electric-quadrupolar and magnetic-dipolar nonlinearities at second order in isotropic media. The chiral and achiral contributions to the sum-frequency signal from the bulk of optically active solutions of 1,1'-bi-2-naphthol (2,2'-dehydroxy-1,1'-binaphthyl) can be distinguished, and the former dominates. Ab initio computations reveal the dramatic resonance enhancement that the isotropic component of the electric-dipolar three-wave mixing hyperpolarizability experiences. Away from resonance its magnitude rapidly decreases, as-unlike the vector component-it is zero in the static limit. The dispersion of the first hyperpolarizability is computed by a configuration interaction singles sum-over-states approach with explicit regard to the Franck-Condon active vibrational substructure for all resonant electronic states.

pf

DOI [BibTex]

DOI [BibTex]

2002


no image
Constructing Boosting algorithms from SVMs: an application to one-class classification.

Rätsch, G., Mika, S., Schölkopf, B., Müller, K.

IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9):1184-1199, September 2002 (article)

Abstract
We show via an equivalence of mathematical programs that a support vector (SV) algorithm can be translated into an equivalent boosting-like algorithm and vice versa. We exemplify this translation procedure for a new algorithm—one-class leveraging—starting from the one-class support vector machine (1-SVM). This is a first step toward unsupervised learning in a boosting framework. Building on so-called barrier methods known from the theory of constrained optimization, it returns a function, written as a convex combination of base hypotheses, that characterizes whether a given test point is likely to have been generated from the distribution underlying the training data. Simulations on one-class classification problems demonstrate the usefulness of our approach.

ei

DOI [BibTex]

2002


DOI [BibTex]


no image
The contributions of color to recognition memory for natural scenes

Wichmann, F., Sharpe, L., Gegenfurtner, K.

Journal of Experimental Psychology: Learning, Memory and Cognition, 28(3):509-520, May 2002 (article)

Abstract
The authors used a recognition memory paradigm to assess the influence of color information on visual memory for images of natural scenes. Subjects performed 5-10% better for colored than for black-and-white images independent of exposure duration. Experiment 2 indicated little influence of contrast once the images were suprathreshold, and Experiment 3 revealed that performance worsened when images were presented in color and tested in black and white, or vice versa, leading to the conclusion that the surface property color is part of the memory representation. Experiments 4 and 5 exclude the possibility that the superior recognition memory for colored images results solely from attentional factors or saliency. Finally, the recognition memory advantage disappears for falsely colored images of natural scenes: The improvement in recognition memory depends on the color congruence of presented images with learned knowledge about the color gamut found within natural scenes. The results can be accounted for within a multiple memory systems framework.

ei

PDF Web DOI [BibTex]

PDF Web DOI [BibTex]


no image
Training invariant support vector machines

DeCoste, D., Schölkopf, B.

Machine Learning, 46(1-3):161-190, January 2002 (article)

Abstract
Practical experience has shown that in order to obtain the best possible performance, prior knowledge about invariances of a classification problem at hand ought to be incorporated into the training procedure. We describe and review all known methods for doing so in support vector machines, provide experimental results, and discuss their respective merits. One of the significant new results reported in this work is our recent achievement of the lowest reported test error on the well-known MNIST digit recognition benchmark task, with SVM training times that are also significantly faster than previous SVM methods.

ei

PDF DOI [BibTex]

PDF DOI [BibTex]


no image
Contrast discrimination with sinusoidal gratings of different spatial frequency

Bird, C., Henning, G., Wichmann, F.

Journal of the Optical Society of America A, 19(7), pages: 1267-1273, 2002 (article)

Abstract
The detectability of contrast increments was measured as a function of the contrast of a masking or “pedestal” grating at a number of different spatial frequencies ranging from 2 to 16 cycles per degree of visual angle. The pedestal grating always had the same orientation, spatial frequency and phase as the signal. The shape of the contrast increment threshold versus pedestal contrast (TvC) functions depend of the performance level used to define the “threshold,” but when both axes are normalized by the contrast corresponding to 75% correct detection at each frequency, the (TvC) functions at a given performance level are identical. Confidence intervals on the slope of the rising part of the TvC functions are so wide that it is not possible with our data to reject Weber’s Law.

ei

PDF [BibTex]

PDF [BibTex]


no image
Support Vector Machines and Kernel Methods: The New Generation of Learning Machines

Cristianini, N., Schölkopf, B.

AI Magazine, 23(3):31-41, 2002 (article)

ei

[BibTex]


no image
Contrast discrimination with pulse-trains in pink noise

Henning, G., Bird, C., Wichmann, F.

Journal of the Optical Society of America A, 19(7), pages: 1259-1266, 2002 (article)

Abstract
Detection performance was measured with sinusoidal and pulse-train gratings. Although the 2.09-c/deg pulse-train, or line gratings, contained at least 8 harmonics all at equal contrast, they were no more detectable than their most detectable component. The addition of broadband pink noise designed to equalize the detectability of the components of the pulse train made the pulse train about a factor of four more detectable than any of its components. However, in contrast-discrimination experiments, with a pedestal or masking grating of the same form and phase as the signal and 15% contrast, the noise did not affect the discrimination performance of the pulse train relative to that obtained with its sinusoidal components. We discuss the implications of these observations for models of early vision in particular the implications for possible sources of internal noise.

ei

PDF [BibTex]

PDF [BibTex]


Thumb xl toc images
Chirality-specific nonlinear spectroscopies in isotropic media

Fischer, P., Albrecht, A.

BULLETIN OF THE CHEMICAL SOCIETY OF JAPAN, 75(5):1119-1124, 2002, 10th International Conference on Time-Resolved Vibrational Spectroscopy (TRVS 2001), OKAZZAKI, JAPAN, MAY 21-25, 2001 (article)

Abstract
Sum or difference frequency generation (SFG or DFG) in isotropic media is in the electric-dipole approximation only symmetry allowed for optically active systems. The hyperpolarizability giving rise to these three-wave mixing processes features only one isotropic component. It factorizes into two terms, an energy (denominator) factor and a triple product of transition moments. These forbid degenerate SFG, i.e., second harmonic generation, as well as the existence of the linear electrooptic effect (Pockels effect) in isotropic media. This second order response also has no static limit, which leads to particularly strong resonance phenomena that are qualitatively different from those usually seen in the ubiquitous even-wave mixing spectroscopies. In particular, the participation of two (not the usual one) excited states is essential to achieve dramatic resonance enhancement, We report our first efforts to see such resonantly enhanced chirality specific SFG.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
The chiral specificity of sum-frequency generation in solutions

Fischer, P., Beckwitt, K., Wise, F., Albrecht, A.

CHEMICAL PHYSICS LETTERS, 352(5-6):463-468, 2002 (article)

Abstract
Sum-frequency generation in isotropic media is in the electric-dipole approximation the only symmetry allowed for chiral systems. We demonstrate that the sum-frequency intensity from an optically active liquid depends quadratically on the difference in concentration of the two enantiomers. The dominant contribution to the signal is found to be due to the chirality specific electric-dipolar three-wave mixing nonlinearity. Selecting the polarization of all fields allows the chiral electric-dipolar contributions to the bulk sum-frequency signal to be discerned from any achiral magnetic-dipolar and electric-quadrupolar contributions. (C) 2002 Published by Elsevier Science B.V.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
On optical rectification in isotropic media

Fischer, P., Albrecht, A.

LASER PHYSICS, 12(8):1177-1181, 2002 (article)

Abstract
Coherent nonlinear optical processes at second-order are only electric-dipole allowed in isotropic media that are optically active. Sum-frequency generation in chiral liquids has recently been observed, and difference-frequency and optical rectification have been predicted to exist in isotropic chiral media. Both Rayleigh-Schrodinger perturbation theory and the density matrix approach are used to discuss the quantum-chemical basis of optical rectification in optically active liquids. For pinene we compute the corresponding orientationally averaged hyperpolarizability, and estimate the light-induced dc electric polarization and the consequent voltage across a measuring capacitor it may give rise to near resonance.

pf

[BibTex]

[BibTex]