Header logo is


2015


Thumb xl grassmanteaser
Scalable Robust Principal Component Analysis using Grassmann Averages

Hauberg, S., Feragen, A., Enficiaud, R., Black, M.

IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), December 2015 (article)

Abstract
In large datasets, manual data verification is impossible, and we must expect the number of outliers to increase with data size. While principal component analysis (PCA) can reduce data size, and scalable solutions exist, it is well-known that outliers can arbitrarily corrupt the results. Unfortunately, state-of-the-art approaches for robust PCA are not scalable. We note that in a zero-mean dataset, each observation spans a one-dimensional subspace, giving a point on the Grassmann manifold. We show that the average subspace corresponds to the leading principal component for Gaussian data. We provide a simple algorithm for computing this Grassmann Average (GA), and show that the subspace estimate is less sensitive to outliers than PCA for general distributions. Because averages can be efficiently computed, we immediately gain scalability. We exploit robust averaging to formulate the Robust Grassmann Average (RGA) as a form of robust PCA. The resulting Trimmed Grassmann Average (TGA) is appropriate for computer vision because it is robust to pixel outliers. The algorithm has linear computational complexity and minimal memory requirements. We demonstrate TGA for background modeling, video restoration, and shadow removal. We show scalability by performing robust PCA on the entire Star Wars IV movie; a task beyond any current method. Source code is available online.

ps sf

preprint pdf from publisher supplemental Project Page [BibTex]

2015


preprint pdf from publisher supplemental Project Page [BibTex]


Thumb xl toc image
Enzymatically active biomimetic micropropellers for the penetration of mucin gels

Walker (Schamel), D., Käsdorf, B. T., Jeong, H. H., Lieleg, O., Fischer, P.

Science Advances, 1(11):e1500501, December 2015 (article)

Abstract
In the body, mucus provides an important defense mechanism by limiting the penetration of pathogens. It is therefore also a major obstacle for the efficient delivery of particle-based drug carriers. The acidic stomach lining in particular is difficult to overcome because mucin glycoproteins form viscoelastic gels under acidic conditions. The bacterium Helicobacter pylori has developed a strategy to overcome the mucus barrier by producing the enzyme urease, which locally raises the pH and consequently liquefies the mucus. This allows the bacteria to swim through mucus and to reach the epithelial surface. We present an artificial system of reactive magnetic micropropellers that mimic this strategy to move through gastric mucin gels by making use of surface-immobilized urease. The results demonstrate the validity of this biomimetic approach to penetrate biological gels, and show that externally propelled microstructures can actively and reversibly manipulate the physical state of their surroundings, suggesting that such particles could potentially penetrate native mucus.

pf

link (url) DOI [BibTex]

link (url) DOI [BibTex]


Thumb xl splitbodieswebteaser2
SMPL: A Skinned Multi-Person Linear Model

Loper, M., Mahmood, N., Romero, J., Pons-Moll, G., Black, M. J.

ACM Trans. Graphics (Proc. SIGGRAPH Asia), 34(6):248:1-248:16, ACM, New York, NY, October 2015 (article)

Abstract
We present a learned model of human body shape and pose-dependent shape variation that is more accurate than previous models and is compatible with existing graphics pipelines. Our Skinned Multi-Person Linear model (SMPL) is a skinned vertex-based model that accurately represents a wide variety of body shapes in natural human poses. The parameters of the model are learned from data including the rest pose template, blend weights, pose-dependent blend shapes, identity-dependent blend shapes, and a regressor from vertices to joint locations. Unlike previous models, the pose-dependent blend shapes are a linear function of the elements of the pose rotation matrices. This simple formulation enables training the entire model from a relatively large number of aligned 3D meshes of different people in different poses. We quantitatively evaluate variants of SMPL using linear or dual-quaternion blend skinning and show that both are more accurate than a Blend-SCAPE model trained on the same data. We also extend SMPL to realistically model dynamic soft-tissue deformations. Because it is based on blend skinning, SMPL is compatible with existing rendering engines and we make it available for research purposes.

ps

pdf video code/model errata DOI Project Page Project Page [BibTex]

pdf video code/model errata DOI Project Page Project Page [BibTex]


Thumb xl toc image
The EChemPen: A Guiding Hand To Learn Electrochemical Surface Modifications

Valetaud, M., Loget, G., Roche, J., Hueken, N., Fattah, Z., Badets, V., Fontaine, O., Zigah, D.

J. of Chem. Ed., 92(10):1700-1704, September 2015 (article)

Abstract
The Electrochemical Pen (EChemPen) was developed as an attractive tool for learning electrochemistry. The fabrication, principle, and operation of the EChemPen are simple and can be easily performed by students in practical classes. It is based on a regular fountain pen principle, where the electrolytic solution is dispensed at a tip to locally modify a conductive surface by triggering a localized electrochemical reaction. Three simple model reactions were chosen to demonstrate the versatility of the EChemPen for teaching various electrochemical processes. We describe first the reversible writing/erasing of metal letters, then the electrodeposition of a black conducting polymer "ink", and finally the colorful writings that can be generated by titanium anodization and that can be controlled by the applied potential. These entertaining and didactic experiments are adapted for teaching undergraduate students that start to study electrochemistry by means of surface modification reactions.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl dynateaser
Dyna: A Model of Dynamic Human Shape in Motion

Pons-Moll, G., Romero, J., Mahmood, N., Black, M. J.

ACM Transactions on Graphics, (Proc. SIGGRAPH), 34(4):120:1-120:14, ACM, August 2015 (article)

Abstract
To look human, digital full-body avatars need to have soft tissue deformations like those of real people. We learn a model of soft-tissue deformations from examples using a high-resolution 4D capture system and a method that accurately registers a template mesh to sequences of 3D scans. Using over 40,000 scans of ten subjects, we learn how soft tissue motion causes mesh triangles to deform relative to a base 3D body model. Our Dyna model uses a low-dimensional linear subspace to approximate soft-tissue deformation and relates the subspace coefficients to the changing pose of the body. Dyna uses a second-order auto-regressive model that predicts soft-tissue deformations based on previous deformations, the velocity and acceleration of the body, and the angular velocities and accelerations of the limbs. Dyna also models how deformations vary with a person’s body mass index (BMI), producing different deformations for people with different shapes. Dyna realistically represents the dynamics of soft tissue for previously unseen subjects and motions. We provide tools for animators to modify the deformations and apply them to new stylized characters.

ps

pdf preprint video data DOI Project Page Project Page [BibTex]

pdf preprint video data DOI Project Page Project Page [BibTex]


Thumb xl objs2acts
Linking Objects to Actions: Encoding of Target Object and Grasping Strategy in Primate Ventral Premotor Cortex

Vargas-Irwin, C. E., Franquemont, L., Black, M. J., Donoghue, J. P.

Journal of Neuroscience, 35(30):10888-10897, July 2015 (article)

Abstract
Neural activity in ventral premotor cortex (PMv) has been associated with the process of matching perceived objects with the motor commands needed to grasp them. It remains unclear how PMv networks can flexibly link percepts of objects affording multiple grasp options into a final desired hand action. Here, we use a relational encoding approach to track the functional state of PMv neuronal ensembles in macaque monkeys through the process of passive viewing, grip planning, and grasping movement execution. We used objects affording multiple possible grip strategies. The task included separate instructed delay periods for object presentation and grip instruction. This approach allowed us to distinguish responses elicited by the visual presentation of the objects from those associated with selecting a given motor plan for grasping. We show that PMv continuously incorporates information related to object shape and grip strategy as it becomes available, revealing a transition from a set of ensemble states initially most closely related to objects, to a new set of ensemble patterns reflecting unique object-grip combinations. These results suggest that PMv dynamically combines percepts, gradually navigating toward activity patterns associated with specific volitional actions, rather than directly mapping perceptual object properties onto categorical grip representations. Our results support the idea that PMv is part of a network that dynamically computes motor plans from perceptual information. Significance Statement: The present work demonstrates that the activity of groups of neurons in primate ventral premotor cortex reflects information related to visually presented objects, as well as the motor strategy used to grasp them, linking individual objects to multiple possible grips. PMv could provide useful control signals for neuroprosthetic assistive devices designed to interact with objects in a flexible way.

ps

publisher link DOI Project Page [BibTex]

publisher link DOI Project Page [BibTex]


Thumb xl toc image
Optimal Length of Low Reynolds Number Nanopropellers

Walker (Schamel), D., Kuebler, M., Morozov, K. I., Fischer, P., Leshansky, A. M.

Nano Letters, 15(7):4412-4416, June 2015 (article)

Abstract
Locomotion in fluids at the nanoscale is dominated by viscous drag. One efficient propulsion scheme is to use a weak rotating magnetic field that drives a chiral object. Froth bacterial flagella to artificial drills, the corkscrew is a universally useful chiral shape for propulsion in viscous environments. Externally powered magnetic micro- and nanomotors have been recently developed that allow for precise fuel-free propulsion in complex media. Here, we combine analytical and numerical theory with experiments on nanostructured screw-propellers to show that the optimal length is surprisingly short only about one helical turn, which is shorter than most of the structures in use to date. The results have important implications for the design of artificial actuated nano- and micropropellers and can dramatically reduce fabrication times, while ensuring optimal performance.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
A theoretical study of potentially observable chirality-sensitive NMR effects in molecules

Garbacz, P., Cukras, J., Jaszunski, M.

Phys. Chem. Chem. Phys., 17(35):22642-22651, May 2015 (article)

Abstract
Two recently predicted nuclear magnetic resonance effects, the chirality-induced rotating electric polarization and the oscillating magnetization, are examined for several experimentally available chiral molecules. We discuss in detail the requirements for experimental detection of chirality-sensitive NMR effects of the studied molecules. These requirements are related to two parameters: the shielding polarizability and the antisymmetric part of the nuclear magnetic shielding tensor. The dominant second contribution has been computed for small molecules at the coupled cluster and density functional theory levels. It was found that DFT calculations using the KT2 functional and the aug-cc-pCVTZ basis set adequately reproduce the CCSD(T) values obtained with the same basis set. The largest values of parameters, thus most promising from the experimental point of view, were obtained for the fluorine nuclei in 1,3-difluorocyclopropene and 1,3-diphenyl-2-fluoro-3-trifluoromethylcyclopropene.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl af4e540bee9e66bef88de83541787fe62fb3803ca149aa6a76018772ebe5b95f
Dynamic Inclusion Complexes of Metal Nanoparticles Inside Nanocups

Alarcon-Correa, M., Lee, T. C., Fischer, P.

Angew. Chem. Int. Ed., 54(23):6730-6734, May 2015, Featured cover article. (article)

Abstract
Host-guest inclusion complexes are abundant in molecular systems and of fundamental importance in living organisms. Realizing a colloidal analogue of a molecular dynamic inclusion complex is challenging because inorganic nanoparticles (NPs) with a well-defined cavity and portal are difficult to synthesize in high yield and with good structural fidelity. Herein, a generic strategy towards the fabrication of dynamic 1: 1 inclusion complexes of metal nanoparticles inside oxide nanocups with high yield (> 70%) and regiospecificity (> 90%) by means of a reactive double Janus nanoparticle intermediate is reported. Experimental evidence confirms that the inclusion complexes are formed by a kinetically controlled mechanism involving a delicate interplay between bipolar galvanic corrosion and alloying-dealloying oxidation. Release of the NP guest from the nanocups can be efficiently triggered by an external stimulus. Featured cover article.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Surface roughness-induced speed increase for active Janus micromotors

Choudhury, U., Soler, L., Gibbs, J. G., Sanchez, S., Fischer, P.

Chem. Comm., 51(41):8660-8663, April 2015 (article)

Abstract
We demonstrate a simple physical fabrication method to control surface roughness of Janus micromotors and fabricate self-propelled active Janus microparticles with rough catalytic platinum surfaces that show a four-fold increase in their propulsion speed compared to conventional Janus particles coated with a smooth Pt layer.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl screen shot 2015 10 14 at 08.57.57
Multi-view and 3D Deformable Part Models

Pepik, B., Stark, M., Gehler, P., Schiele, B.

Pattern Analysis and Machine Intelligence, 37(11):14, IEEE, March 2015 (article)

Abstract
As objects are inherently 3-dimensional, they have been modeled in 3D in the early days of computer vision. Due to the ambiguities arising from mapping 2D features to 3D models, 3D object representations have been neglected and 2D feature-based models are the predominant paradigm in object detection nowadays. While such models have achieved outstanding bounding box detection performance, they come with limited expressiveness, as they are clearly limited in their capability of reasoning about 3D shape or viewpoints. In this work, we bring the worlds of 3D and 2D object representations closer, by building an object detector which leverages the expressive power of 3D object representations while at the same time can be robustly matched to image evidence. To that end, we gradually extend the successful deformable part model [1] to include viewpoint information and part-level 3D geometry information, resulting in several different models with different level of expressiveness. We end up with a 3D object model, consisting of multiple object parts represented in 3D and a continuous appearance model. We experimentally verify that our models, while providing richer object hypotheses than the 2D object models, provide consistently better joint object localization and viewpoint estimation than the state-of-the-art multi-view and 3D object detectors on various benchmarks (KITTI [2], 3D object classes [3], Pascal3D+ [4], Pascal VOC 2007 [5], EPFL multi-view cars [6]).

ps

DOI Project Page [BibTex]

DOI Project Page [BibTex]


Thumb xl toc image
Active colloidal microdrills

Gibbs, J. G., Fischer, P.

Chem. Comm., 51(20):4192-4195, Febuary 2015 (article)

Abstract
We demonstrate a chemically driven, autonomous catalytic microdrill. An asymmetric distribution of catalyst causes the helical swimmer to twist while it undergoes directed propulsion. A driving torque and hydrodynamic coupling between translation and rotation at low Reynolds number leads to drill-like swimming behaviour.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl ssimssmall
Spike train SIMilarity Space (SSIMS): A framework for single neuron and ensemble data analysis

Vargas-Irwin, C. E., Brandman, D. M., Zimmermann, J. B., Donoghue, J. P., Black, M. J.

Neural Computation, 27(1):1-31, MIT Press, January 2015 (article)

Abstract
We present a method to evaluate the relative similarity of neural spiking patterns by combining spike train distance metrics with dimensionality reduction. Spike train distance metrics provide an estimate of similarity between activity patterns at multiple temporal resolutions. Vectors of pair-wise distances are used to represent the intrinsic relationships between multiple activity patterns at the level of single units or neuronal ensembles. Dimensionality reduction is then used to project the data into concise representations suitable for clustering analysis as well as exploratory visualization. Algorithm performance and robustness are evaluated using multielectrode ensemble activity data recorded in behaving primates. We demonstrate how Spike train SIMilarity Space (SSIMS) analysis captures the relationship between goal directions for an 8-directional reaching task and successfully segregates grasp types in a 3D grasping task in the absence of kinematic information. The algorithm enables exploration of virtually any type of neural spiking (time series) data, providing similarity-based clustering of neural activity states with minimal assumptions about potential information encoding models.

ps

pdf: publisher site pdf: author's proof DOI Project Page [BibTex]

pdf: publisher site pdf: author's proof DOI Project Page [BibTex]


Thumb xl thumb teaser mrg
Metric Regression Forests for Correspondence Estimation

Pons-Moll, G., Taylor, J., Shotton, J., Hertzmann, A., Fitzgibbon, A.

International Journal of Computer Vision, pages: 1-13, 2015 (article)

ps

springer PDF Project Page [BibTex]

springer PDF Project Page [BibTex]


Thumb xl advs201570022 gra 0001 m
Selectable Nanopattern Arrays for Nanolithographic Imprint and Etch-Mask Applications

Jeong, H. H., Mark, A. G., Lee, T., Son, K., Chen, W., Alarcon-Correa, M., Kim, I., Schütz, G., Fischer, P.

Adv. Science, 2(7):1500016, 2015, Featured cover article. (article)

Abstract
A parallel nanolithographic patterning method is presented that can be used to obtain arrays of multifunctional nanoparticles. These patterns can simply be converted into a variety of secondary nanopatterns that are useful for nanolithographic imprint, plasmonic, and etch-mask applications.

pf

link (url) DOI [BibTex]

link (url) DOI [BibTex]


Thumb xl fotorobos
Formation control driven by cooperative object tracking

Lima, P., Ahmad, A., Dias, A., Conceição, A., Moreira, A., Silva, E., Almeida, L., Oliveira, L., Nascimento, T.

Robotics and Autonomous Systems, 63(1):68-79, 2015 (article)

Abstract
In this paper we introduce a formation control loop that maximizes the performance of the cooperative perception of a tracked target by a team of mobile robots, while maintaining the team in formation, with a dynamically adjustable geometry which is a function of the quality of the target perception by the team. In the formation control loop, the controller module is a distributed non-linear model predictive controller and the estimator module fuses local estimates of the target state, obtained by a particle filter at each robot. The two modules and their integration are described in detail, including a real-time database associated to a wireless communication protocol that facilitates the exchange of state data while reducing collisions among team members. Simulation and real robot results for indoor and outdoor teams of different robots are presented. The results highlight how our method successfully enables a team of homogeneous robots to minimize the total uncertainty of the tracked target cooperative estimate while complying with performance criteria such as keeping a pre-set distance between the teammates and the target, avoiding collisions with teammates and/or surrounding obstacles.

ps

DOI [BibTex]

DOI [BibTex]

2009


Thumb xl foe2009
Fields of Experts

Roth, S., Black, M. J.

International Journal of Computer Vision (IJCV), 82(2):205-29, April 2009 (article)

Abstract
We develop a framework for learning generic, expressive image priors that capture the statistics of natural scenes and can be used for a variety of machine vision tasks. The approach provides a practical method for learning high-order Markov random field (MRF) models with potential functions that extend over large pixel neighborhoods. These clique potentials are modeled using the Product-of-Experts framework that uses non-linear functions of many linear filter responses. In contrast to previous MRF approaches all parameters, including the linear filters themselves, are learned from training data. We demonstrate the capabilities of this Field-of-Experts model with two example applications, image denoising and image inpainting, which are implemented using a simple, approximate inference scheme. While the model is trained on a generic image database and is not tuned toward a specific application, we obtain results that compete with specialized techniques.

ps

pdf pdf from publisher [BibTex]

2009


pdf pdf from publisher [BibTex]


Thumb xl toc image
Full phase and amplitude control in computer-generated holography

Fratz, M., Fischer, P., Giel, D. M.

OPTICS LETTERS, 34(23):3659-3661, 2009 (article)

Abstract
We report what we believe to be the first realization of a computer-generated complex-valued hologram recorded in a single film of photoactive polymer. Complex-valued holograms give rise to a diffracted optical field with control over its amplitude and phase. The holograms are generated by a one-step direct laser writing process in which a spatial light modulator (SLM) is imaged onto a polymer film. Temporal modulation of the SLM during exposure controls both the strength of the induced birefringence and the orientation of the fast axis. We demonstrate that complex holograms can be used to impart arbitrary amplitude and phase profiles onto a beam and thereby open new possibilities in the control of optical beams. (C) 2009 Optical Society of America

pf

[BibTex]

[BibTex]


Thumb xl toc image
Digital polarization holograms with defined magnitude and orientation of each pixel’s birefringence

Fratz, M., Giel, D. M., Fischer, P.

OPTICS LETTERS, 34(8):1270-1272, 2009 (article)

Abstract
A new form of digital polarization holography is demonstrated that permits both the amplitude and the phase of a diffracted beam to be independently controlled. This permits two independent intensity images to be stored in the same hologram. To fabricate the holograms, a birefringence with defined retardance and orientation of the fast axis is recorded into a photopolymer film. The holograms are selectively read out by choosing the polarization state of the read beam. Polarization holograms of this kind increase the data density in holographic data storage and allow higher quality diffractive optical elements to be written. (C) 2009 Optical Society of America

pf

[BibTex]


Thumb xl toc images
Controlled Propulsion of Artificial Magnetic Nanostructured Propellers

Ghosh, A., Fischer, P.

NANO LETTERS, 9(6):2243-2245, 2009, Featured highlight ‘Nanotechnology: The helix that delivers’ Nature 459, 13 (2009). (article)

Abstract
For biomedical applications, such as targeted drug delivery and microsurgery, it is essential to develop a system of swimmers that can be propelled wirelessly in fluidic environments with good control. Here, we report the construction and operation of chiral colloidal propellers that can be navigated in water with micrometer-level precision using homogeneous magnetic fields. The propellers are made via nanostructured surfaces and can be produced in large numbers. The nanopropellers can carry chemicals, push loads, and act as local probes in rheological measurements.

Featured highlight ‘Nanotechnology: The helix that delivers’ Nature 459, 13 (2009).

pf

Video - Nanospropellers DOI [BibTex]

Video - Nanospropellers DOI [BibTex]


Thumb xl toc image
Absolute Asymmetric Reduction Based on the Relative Orientation of Achiral Reactants

Kuhn, A., Fischer, P.

ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 48(37):6857-6860, 2009 (article)

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl ajp1
Left Ventricular Regional Wall Curvedness and Wall Stress in Patients with Ischemic Dilated Cardiomyopathy

Liang Zhong, Yi Su, Si Yong Yeo, Ru San Tan Dhanjoo Ghista, Ghassan Kassab

American Journal of Physiology – Heart and Circulatory Physiology, 296(3):H573-84, 2009 (article)

Abstract
Geometric remodeling of the left ventricle (LV) after myocardial infarction is associated with changes in myocardial wall stress. The objective of this study was to determine the regional curvatures and wall stress based on three-dimensional (3-D) reconstructions of the LV using MRI. Ten patients with ischemic dilated cardiomyopathy (IDCM) and 10 normal subjects underwent MRI scan. The IDCM patients also underwent delayed gadolinium-enhancement imaging to delineate the extent of myocardial infarct. Regional curvedness, local radii of curvature, and wall thickness were calculated. The percent curvedness change between end diastole and end systole was also calculated. In normal heart, a short- and long-axis two-dimensional analysis showed a 41 +/- 11% and 45 +/- 12% increase of the mean of peak systolic wall stress between basal and apical sections, respectively. However, 3-D analysis showed no significant difference in peak systolic wall stress from basal and apical sections (P = 0.298, ANOVA). LV shape differed between IDCM patients and normal subjects in several ways: LV shape was more spherical (sphericity index = 0.62 +/- 0.08 vs. 0.52 +/- 0.06, P < 0.05), curvedness at end diastole (mean for 16 segments = 0.034 +/- 0.0056 vs. 0.040 +/- 0.0071 mm(-1), P < 0.001) and end systole (mean for 16 segments = 0.037 +/- 0.0068 vs. 0.067 +/- 0.020 mm(-1), P < 0.001) was affected by infarction, and peak systolic wall stress was significantly increased at each segment in IDCM patients. The 3-D quantification of regional wall stress by cardiac MRI provides more precise evaluation of cardiac mechanics. Identification of regional curvedness and wall stresses helps delineate the mechanisms of LV remodeling in IDCM and may help guide therapeutic LV restoration.

ps

[BibTex]

[BibTex]


Thumb xl mbec1
A Curvature-Based Approach for Left Ventricular Shape Analysis from Cardiac Magnetic Resonance Imaging

Si Yong Yeo, Liang Zhong, Yi Su, Ru San Tan, Dhanjoo Ghista

Medical & Biological Engineering & Computing, 47(3):313-322, 2009 (article)

Abstract
It is believed that left ventricular (LV) regional shape is indicative of LV regional function, and cardiac pathologies are often associated with regional alterations in ventricular shape. In this article, we present a set of procedures for evaluating regional LV surface shape from anatomically accurate models reconstructed from cardiac magnetic resonance (MR) images. LV surface curvatures are computed using local surface fitting method, which enables us to assess regional LV shape and its variation. Comparisons are made between normal and diseased hearts. It is illustrated that LV surface curvatures at different regions of the normal heart are higher than those of the diseased heart. Also, the normal heart experiences a larger change in regional curvedness during contraction than the diseased heart. It is believed that with a wide range of dataset being evaluated, this approach will provide a new and efficient way of quantifying LV regional function.

ps

link (url) [BibTex]

link (url) [BibTex]

2003


Thumb xl hedvig
Learning the statistics of people in images and video

Sidenbladh, H., Black, M. J.

International Journal of Computer Vision, 54(1-3):183-209, August 2003 (article)

Abstract
This paper address the problems of modeling the appearance of humans and distinguishing human appearance from the appearance of general scenes. We seek a model of appearance and motion that is generic in that it accounts for the ways in which people's appearance varies and, at the same time, is specific enough to be useful for tracking people in natural scenes. Given a 3D model of the person projected into an image we model the likelihood of observing various image cues conditioned on the predicted locations and orientations of the limbs. These cues are taken to be steered filter responses corresponding to edges, ridges, and motion-compensated temporal differences. Motivated by work on the statistics of natural scenes, the statistics of these filter responses for human limbs are learned from training images containing hand-labeled limb regions. Similarly, the statistics of the filter responses in general scenes are learned to define a “background” distribution. The likelihood of observing a scene given a predicted pose of a person is computed, for each limb, using the likelihood ratio between the learned foreground (person) and background distributions. Adopting a Bayesian formulation allows cues to be combined in a principled way. Furthermore, the use of learned distributions obviates the need for hand-tuned image noise models and thresholds. The paper provides a detailed analysis of the statistics of how people appear in scenes and provides a connection between work on natural image statistics and the Bayesian tracking of people.

ps

pdf pdf from publisher code DOI [BibTex]

2003


pdf pdf from publisher code DOI [BibTex]


Thumb xl delatorreijcvteaser
A framework for robust subspace learning

De la Torre, F., Black, M. J.

International Journal of Computer Vision, 54(1-3):117-142, August 2003 (article)

Abstract
Many computer vision, signal processing and statistical problems can be posed as problems of learning low dimensional linear or multi-linear models. These models have been widely used for the representation of shape, appearance, motion, etc., in computer vision applications. Methods for learning linear models can be seen as a special case of subspace fitting. One draw-back of previous learning methods is that they are based on least squares estimation techniques and hence fail to account for “outliers” which are common in realistic training sets. We review previous approaches for making linear learning methods robust to outliers and present a new method that uses an intra-sample outlier process to account for pixel outliers. We develop the theory of Robust Subspace Learning (RSL) for linear models within a continuous optimization framework based on robust M-estimation. The framework applies to a variety of linear learning problems in computer vision including eigen-analysis and structure from motion. Several synthetic and natural examples are used to develop and illustrate the theory and applications of robust subspace learning in computer vision.

ps

pdf code pdf from publisher Project Page [BibTex]

pdf code pdf from publisher Project Page [BibTex]


Thumb xl ijcvcoverhd
Guest editorial: Computational vision at Brown

Black, M. J., Kimia, B.

International Journal of Computer Vision, 54(1-3):5-11, August 2003 (article)

ps

pdf pdf from publisher [BibTex]

pdf pdf from publisher [BibTex]


Thumb xl cviu91teaser
Robust parameterized component analysis: Theory and applications to 2D facial appearance models

De la Torre, F., Black, M. J.

Computer Vision and Image Understanding, 91(1-2):53-71, July 2003 (article)

Abstract
Principal component analysis (PCA) has been successfully applied to construct linear models of shape, graylevel, and motion in images. In particular, PCA has been widely used to model the variation in the appearance of people's faces. We extend previous work on facial modeling for tracking faces in video sequences as they undergo significant changes due to facial expressions. Here we consider person-specific facial appearance models (PSFAM), which use modular PCA to model complex intra-person appearance changes. Such models require aligned visual training data; in previous work, this has involved a time consuming and error-prone hand alignment and cropping process. Instead, the main contribution of this paper is to introduce parameterized component analysis to learn a subspace that is invariant to affine (or higher order) geometric transformations. The automatic learning of a PSFAM given a training image sequence is posed as a continuous optimization problem and is solved with a mixture of stochastic and deterministic techniques achieving sub-pixel accuracy. We illustrate the use of the 2D PSFAM model with preliminary experiments relevant to applications including video-conferencing and avatar animation.

ps

pdf [BibTex]

pdf [BibTex]


Thumb xl toc image
New electro-optic effect: Sum-frequency generation from optically active liquids in the presence of a dc electric field

Fischer, P., Buckingham, A., Beckwitt, K., Wiersma, D., Wise, F.

PHYSICAL REVIEW LETTERS, 91(17), 2003 (article)

Abstract
We report the observation of sum-frequency signals that depend linearly on an applied electrostatic field and that change sign with the handedness of an optically active solute. This recently predicted chiral electro-optic effect exists in the electric-dipole approximation. The static electric field gives rise to an electric-field-induced sum-frequency signal (an achiral third-order process) that interferes with the chirality-specific sum-frequency at second order. The cross-terms linear in the electrostatic field constitute the effect and may be used to determine the absolute sign of second- and third-order nonlinear-optical susceptibilities in isotropic media.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Chiral and achiral contributions to sum-frequency generation from optically active solutions of binaphthol

Fischer, P., Wise, F., Albrecht, A.

JOURNAL OF PHYSICAL CHEMISTRY A, 107(40):8232-8238, 2003 (article)

Abstract
The nonlinear sum- and difference-frequency generation spectroscopies can be probes of molecular chirality in optically active systems. We present a tensorial analysis of the chirality-specific electric-dipolar sum-frequency-generation susceptibility and the achiral electric-quadrupolar and magnetic-dipolar nonlinearities at second order in isotropic media. The chiral and achiral contributions to the sum-frequency signal from the bulk of optically active solutions of 1,1'-bi-2-naphthol (2,2'-dehydroxy-1,1'-binaphthyl) can be distinguished, and the former dominates. Ab initio computations reveal the dramatic resonance enhancement that the isotropic component of the electric-dipolar three-wave mixing hyperpolarizability experiences. Away from resonance its magnitude rapidly decreases, as-unlike the vector component-it is zero in the static limit. The dispersion of the first hyperpolarizability is computed by a configuration interaction singles sum-over-states approach with explicit regard to the Franck-Condon active vibrational substructure for all resonant electronic states.

pf

DOI [BibTex]

DOI [BibTex]

1998


Thumb xl bildschirmfoto 2012 12 06 um 10.05.20
Summarization of video-taped presentations: Automatic analysis of motion and gesture

Ju, S. X., Black, M. J., Minneman, S., Kimber, D.

IEEE Trans. on Circuits and Systems for Video Technology, 8(5):686-696, September 1998 (article)

Abstract
This paper presents an automatic system for analyzing and annotating video sequences of technical talks. Our method uses a robust motion estimation technique to detect key frames and segment the video sequence into subsequences containing a single overhead slide. The subsequences are stabilized to remove motion that occurs when the speaker adjusts their slides. Any changes remaining between frames in the stabilized sequences may be due to speaker gestures such as pointing or writing, and we use active contours to automatically track these potential gestures. Given the constrained domain, we define a simple set of actions that can be recognized based on the active contour shape and motion. The recognized actions provide an annotation of the sequence that can be used to access a condensed version of the talk from a Web page.

ps

pdf pdf from publisher DOI [BibTex]

1998


pdf pdf from publisher DOI [BibTex]


Thumb xl bildschirmfoto 2012 12 06 um 12.22.18
Robust anisotropic diffusion

Black, M. J., Sapiro, G., Marimont, D., Heeger, D.

IEEE Transactions on Image Processing, 7(3):421-432, March 1998 (article)

Abstract
Relations between anisotropic diffusion and robust statistics are described in this paper. Specifically, we show that anisotropic diffusion can be seen as a robust estimation procedure that estimates a piecewise smooth image from a noisy input image. The edge-stopping; function in the anisotropic diffusion equation is closely related to the error norm and influence function in the robust estimation framework. This connection leads to a new edge-stopping; function based on Tukey's biweight robust estimator that preserves sharper boundaries than previous formulations and improves the automatic stopping of the diffusion. The robust statistical interpretation also provides a means for detecting the boundaries (edges) between the piecewise smooth regions in an image that has been smoothed with anisotropic diffusion. Additionally, we derive a relationship between anisotropic diffusion and regularization with line processes. Adding constraints on the spatial organization of the line processes allows us to develop new anisotropic diffusion equations that result in a qualitative improvement in the continuity of edges

ps

pdf pdf from publisher [BibTex]

pdf pdf from publisher [BibTex]


Thumb xl toc image
Surface second-order nonlinear optical activity

Fischer, P., Buckingham, A.

JOURNAL OF THE OPTICAL SOCIETY OF AMERICA B-OPTICAL PHYSICS, 15(12):2951-2957, 1998 (article)

Abstract
Following the recent observation of a large second-harmonic intensity difference from a monolayer of chiral molecules with left and right circularly polarized light, the scattering theory is generalized and extended to predict linear and circular intensity differences for the more Versatile sum-frequency spectroscopy. Estimates indicate that intensity differences should be detectable for a typical experimental arrangement. The second-order nonlinear surface susceptibility tensor is given for different surface point groups in the electric dipole approximation; it is shown that nonlinear optical activity phenomena unambiguously probe molecular chirality only for molecular monolayers that are symmetric about the normal. Other surface symmetries can give rise to intensity differences from monolayers composed of achiral molecules. A water surface is predicted to show Linear and nonlinear optical activity in the presence of an electric field parallel to the surface. (C) 1998 Optical Society of America {[}S0740-3224(98)01311-3] OCIS codes: 190.0190, 190.4350, 240.6490.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Linear electro-optic effect in optically active liquids

Buckingham, A., Fischer, P.

CHEMICAL PHYSICS LETTERS, 297(3-4):239-246, 1998 (article)

Abstract
A linear effect of an electrostatic field F on the intensity of sum- and difference-frequency generation in a chiral liquid is predicted. It arises in the electric dipole approximation. The effect changes sign with the enantiomer and on reversing the direction of the electrostatic field. The sum-frequency generator chi(alpha beta gamma)((2)) (-omega(3);omega(1),omega(2)), where omega(3) = omega(1) + omega(2), and the electric field-induced sum-frequency generator chi(alpha beta gamma delta)((3))(-omega(3);omega(1),omega(2),0)F-delta interfere and their contributions to the scattering power can be distinguished. Encouraging predictions are given for a typical experimental arrangement. (C) 1998 Elsevier Science B.V. All rights reserved.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Monolayers of hexadecyltrimethylammonium p-tosylate at the air-water interface. 1. Sum-frequency spectroscopy

Bell, G., Li, Z., Bain, C., Fischer, P., Duffy, D.

JOURNAL OF PHYSICAL CHEMISTRY B, 102(47):9461-9472, 1998 (article)

Abstract
Sum-frequency vibrational spectroscopy has been used to determine the structure of monolayers of the cationic surfactant, hexadecyltrimethylammonium p-tosylate (C(16)TA(+)Ts(-)), at the surface of water. Selective deuteration of the cation or the anion allowed the separate detection of sum-frequency spectra of the surfactant and of counterions that are bound to the monolayer. The p-tosylate ions an oriented with their methyl groups pointing away from the aqueous subphase and with the C-2 axis tilted, on average, by 30-40 degrees from the surface normal. The vibrational spectra of C(16)TA(+) indicate that the number of gauche defects in the monolayer does not change dramatically when bromide counterions are replaced by p-tosylate. The ends of the hydrocarbon chains of C16TA+ are, however, tilted much further from the surface normal in the presence of p-tosylate than in the presence of bromide. A quantitative analysis of the sum-frequency spectra requires a knowledge of the molecular hyperpolarizability tensor: the role of ab initio calculations and Raman spectroscopy in determining the components of this tensor is discussed.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Ultraviolet resonance Raman study of drug binding in dihydrofolate reductase, gyrase, and catechol O-methyltransferase

Couling, V., Fischer, P., Klenerman, D., Huber, W.

BIOPHYSICAL JOURNAL, 75(2):1097-1106, 1998 (article)

Abstract
This paper presents a study of the use of ultraviolet resonance Raman (UVRR) spectroscopic methods as a means of elucidating aspects of drug-protein interactions. Some of the RR vibrational bands of the aromatic amino acids tyrosine and tryptophan are sensitive to the microenvironment, and the use of UV excitation radiation allows selective enhancement of the spectral features of the aromatic amino acids, enabling observation specifically of their change in microenvironment upon drug binding. The three drug-protein systems investigated in this study are dihydrofolate reductase with its inhibitor trimethoprim, gyrase with novobiocin, and catechol O-methyltransferase with dinitrocatechol. It is demonstrated that UVRR spectroscopy has adequate sensitivity to be a useful means of detecting drug-protein interactions in those systems for which the electronic absorption of the aromatic amino acids changes because of hydrogen bonding and/or possible dipole-dipole and dipole-polarizability interactions with the ligand.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl paybotteaser
PLAYBOT: A visually-guided robot for physically disabled children

Tsotsos, J. K., Verghese, G., Dickinson, S., Jenkin, M., Jepson, A., Milios, E., Nuflo, F., Stevenson, S., Black, M., Metaxas, D., Culhane, S., Ye, Y., Mann, R.

Image & Vision Computing, Special Issue on Vision for the Disabled, 16(4):275-292, 1998 (article)

Abstract
This paper overviews the PLAYBOT project, a long-term, large-scale research program whose goal is to provide a directable robot which may enable physically disabled children to access and manipulate toys. This domain is the first test domain, but there is nothing inherent in the design of PLAYBOT that prohibits its extension to other tasks. The research is guided by several important goals: vision is the primary sensor; vision is task directed; the robot must be able to visually search its environment; object and event recognition are basic capabilities; environments must be natural and dynamic; users and environments are assumed to be unpredictable; task direction and reactivity must be smoothly integrated; and safety is of high importance. The emphasis of the research has been on vision for the robot this is the most challenging research aspect and the major bottleneck to the development of intelligent robots. Since the control framework is behavior-based, the visual capabilities of PLAYBOT are described in terms of visual behaviors. Many of the components of PLAYBOT are briefly described and several examples of implemented sub-systems are shown. The paper concludes with a description of the current overall system implementation, and a complete example of PLAYBOT performing a simple task.

ps

pdf pdf from publisher DOI [BibTex]

pdf pdf from publisher DOI [BibTex]


Thumb xl bildschirmfoto 2012 12 06 um 12.33.38
EigenTracking: Robust matching and tracking of articulated objects using a view-based representation

Black, M. J., Jepson, A.

International Journal of Computer Vision, 26(1):63-84, 1998 (article)

Abstract
This paper describes an approach for tracking rigid and articulated objects using a view-based representation. The approach builds on and extends work on eigenspace representations, robust estimation techniques, and parameterized optical flow estimation. First, we note that the least-squares image reconstruction of standard eigenspace techniques has a number of problems and we reformulate the reconstruction problem as one of robust estimation. Second we define a “subspace constancy assumption” that allows us to exploit techniques for parameterized optical flow estimation to simultaneously solve for the view of an object and the affine transformation between the eigenspace and the image. To account for large affine transformations between the eigenspace and the image we define a multi-scale eigenspace representation and a coarse-to-fine matching strategy. Finally, we use these techniques to track objects over long image sequences in which the objects simultaneously undergo both affine image motions and changes of view. In particular we use this “EigenTracking” technique to track and recognize the gestures of a moving hand.

ps

pdf pdf from publisher video [BibTex]