Thiago T. Santos

personal site

Archive for the ‘research’ Category

Image-based 3D digitizing for plant architecture analysis and phenotyping

leave a comment »

3D reconstruction for a basil specimen

Workshop on Industry Applications (WGARI) in SIBGRAPI 2012 (XXV Conference on Graphics, Patterns and Images)
August, 2012

Thiago T. Santos and Alberto A. de Oliveira

Functional-structural plant modeling and plant phenotyping require the measurement of geometric features in specimens. This data acquisition is called plant digitizing. Actually, these measurements are performed manually, in invasive or even destructive ways, or using expensive laser scanning equipment. Computer vision based 3D reconstruction is an accurate and low cost alternative for the digitizing of plants not presenting a dense canopy. Sparse canopies are found in several important annual plants in agriculture as soybean and maize, at least in their early stages of development.

This paper shows as the state of the art in structure from motion and multiple view stereo is able to produce accurate 3D models for specimens presenting sparse canopies. Three-dimensional triangular meshes are computed from a set of non-calibrated images, modeling a basil and an Ixora specimens and accurately representing their leaves and branches.

paper website

Written by Thiago Santos

August 30, 2012 at 7:51 pm

Automatic camera control in virtual environments augmented using multiple sparse videos

leave a comment »

Novel view renderings from the PETS 2006 scenes

Computers & Graphics
Volume 35, Issue 2, April 2011, Pages 412-421
Virtual Reality in Brazil; Visual Computing in Biology and Medicine; Semantic 3D media and content; Cultural Heritage

Jeferson R. Silva, Thiago T. Santos and Carlos H. Morimoto

Automated virtual camera control has been widely used in animation and interactive virtual environments. We have developed a multiple sparse camera based free view video system prototype that allows users to control the position and orientation of a virtual camera, enabling the observation of a real scene in three dimensions (3D) from any desired viewpoint. Automatic camera control can be activated to follow selected objects by the user. Our method combines a simple geometric model of the scene composed of planes (virtual environment), augmented with visual information from the cameras and pre-computed tracking information of moving targets to generate novel perspective corrected 3D views of the virtual camera and moving objects. To achieve real-time rendering performance, view-dependent textured mapped billboards are used to render the moving objects at their correct locations and foreground masks are used to remove the moving objects from the projected video streams. The current prototype runs on a PC with a common graphics card and can generate virtual 2D views from three cameras of resolution 768×576 with several moving objects at about 11 fps.

doi:10.1016/j.cag.2011.01.012

Multiple camera people detection and tracking using support integration

leave a comment »

Results, camera 2, frame 1721

Multi-view people detection

Pattern Recognition Letters
Volume 32, Issue 1, 1 January 2011, Pages 47-55
Image Processing, Computer Vision and Pattern Recognition in Latin America

Thiago T. Santos and Carlos H. Morimoto

This paper proposes a method to locate and track people by combining evidence from multiple cameras using the homography constraint. The proposed method use foreground pixels from simple background subtraction to compute evidence of the location of people on a reference ground plane. The algorithm computes the amount of support that basically corresponds to the “foreground mass” above each pixel. Therefore, pixels that correspond to ground points have more support. The support is normalized to compensate for perspective effects and accumulated on the reference plane for all camera views. The detection of people on the reference plane becomes a search for regions of local maxima in the accumulator. Many false positives are filtered by checking the visibility consistency of the detected candidates against all camera views. The remaining candidates are tracked using Kalman filters and appearance models. Experimental results using challenging data from PETS’06 show good performance of the method in the presence of severe occlusion. Ground truth data also confirms the robustness of the method.

doi:10.1016/j.patrec.2010.05.016

Written by Thiago Santos

January 1, 2011 at 5:02 pm

People Detection under Occlusion in Multiple Camera Views

leave a comment »

Resuls for frame 3300, PETS 2006 dataset

Five subjects found, marked on the floorplan

XXI Brazilian Symposium on Computer Graphics and Image Processing, 2008. SIBGRAPI ’08

Thiago T. Santos and Carlos H. Morimoto

This paper proposes a method to locate people on a reference plane using multiple cameras. Previous works rely on people trajectories and color models to solve occlusion.This new approach solves people detection under occlusion by accumulating evidence from multiple views instantaneously and does not rely on previous segmentation of individuals in foreground data or any tracking information.First, foreground data from one view, segmented using background subtraction, is projected onto the ground plane or reference image. The projected foreground of a second view overlaps the first projected foreground only on the points where the foreground intersects the ground plane.Thus, by accumulating the evidence from multiple views,people can be located by detecting local maxima on the accumulated reference image. Experimental results using publicly available data from PETSpsila06 [9] show that the method robustly locates people in very challenging situations with occlusion in most of the views. The locations on the ground plane can further be used for segmentation and tracking on each camera view under severe occlusion.

doi:10.1109/SIBGRAPI.2008.25

full paper

Written by Thiago Santos

October 12, 2008 at 5:34 pm

Follow

Get every new post delivered to your Inbox.