-
Notifications
You must be signed in to change notification settings - Fork 12
huangqx/image_shape_align
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% License and attribution:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
This package provides software modules for image and shape analysis. The code is released under the MIT license and can be used for any purpose with proper attribution. The code accompanies the following paper, which should be cited in publications that use the provided modules:
Single-View Reconstruction via Joint Analysis of Image and Shape Collections
Qixing Huang, Hai Wang, and Vladlen Koltun
ACM Transactions on Graphics 34(4), 2015
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Loading the parameters
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
load('data\parameters.mat');
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Attention: The rendering setting in Para (i.e., Para.rHeights,
% Para.rWidhts) is tested on Windows machines. If you are using
% Linux or Mac, please call the following function to update
% these two vectors:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Para = update_para(Para);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Loading the images and shapes
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
load('data\car.mat');
Note that the shapes are assumed to be consistently oriented in
a world coordinate system
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module I: Pose estimation
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
The pose estimation module predicts the camera pose for the underlying object with respect to the world coordinate system of the shapes. This is done by sampling the camera poses, rendering the shapes from the camera poses, and comparing real images to rendered images. The most time-consuming part is image rendering, which is done by precomputation.
Cameras = cam_camera_sampling(Para); % camera simulation
hogRender = cam_shape_hog_dess(... %
Shapes,... % The aligned input shapes
Cameras,... % The sampled camera poses
Para,... % Please refer to the function body
verbose); % verbose = 1 if you want to print it
out
Camera estimation can be done in two ways: for a single image
in isolation or for a collection of images
Camera_init = cam_pose_est_single(...
Image,... % The input image object
Shapes,... % The aligned input shapes
Cameras,... % The sampled camera poses
hogRender,... % The rendered hog descriptors
Para,... % Please refer to the function body
verbose); % verbose = 1 if you want to print it
out
Cameras_init = cam_pose_est_joint(...
inputImages,... % The input images
Shapes,... % The aligned input shapes
cameraSamples,...% The sampled camera poses
hogRender,... % The rendered hog descriptors
Para,... % Please refer to the function body
verbose) % verbose = 1 if you want to print
it out
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module II: Pairwise image-shape alignment
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
This module performs image-shape alignment for one image. It selects the closest shape and deforms it to fit the image object. Note that the pose estimation module is packed in the following main function:
[Camera_opt, Shape_opt] = demo_i2s_align(...
Image,... % Input image
Shapes,... % Input shapes
Para); % Parameters used in alignment. Please refer to the
function body for details
Output argments:
'Shape_opt' : The optimized shape, which aligns with the input
image object
'Camera_opt': The associated optimized camera configuration
Demo:
>load('data\chair.mat');
>load('data\parameters.mat');
>[Camera_opt, Shape_opt] = demo_i2s_align(Images{1}, Shapes, Para);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module III: Assembly-based reconstruction
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Assembly-based reconstruction aims to recover the underlying 3D model of an image object by assembling parts from a small collection of relevant shapes. The input consists of one image and a few shapes, each of which comprises a number of components. We assume that the shapes are consistently aligned in a world coordinate system. We assume that the pose of the image object is given. The main function is
[Shape_opt, Camera_opt] = demo_ibm_mix_and_match(...
Image,... % The input image
Camera_init,... % The initial camera configuration for
all input shapes
Shapes,... % The input shapes
Para); % Please refer to the functional body
for details
Output argments:
'Shape_opt' : The reconstructed shape in the world coordinate
system
'Camera_opt': The associated optimized camera configuration
Demo:
>load('data\mix_and_match.mat');
>load('data\parameters.mat');
>[Shape_opt, Camera_opt] = demo_ibm_mix_and_match(Image, Camera_init, Shapes, Para);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module IV: Joint image-shape correspondence
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Joint image-shape correspondence module estimates dense pixelwise correspondences among a collection of images and a collection of shapes. We assume that (i) the camera poses of the images are pre-computed (e.g., using Module I), and (ii) the shapes are consistently aligned in a world coordinate system. The main function is
PairMatches = demo_joint_i2s_corres_main(...
Images,... % Input images
ImageCameras,... % The camera configurations associated with
each input image
Shapes,... % The input shapes (aligned in a
world coordinate system)
Top,... % (#Images x #Shapes) is a sparse matrix
that specifies image-shape pairs
for matching.
Para); %
Warning: If you use >100 images and/or >100 shapes,
the computation may take a very long time.
Demo:
>load('data\joint_matching.mat');
>load('data\parameters.mat');
>Corres_rs = demo_joint_i2s_corres_main(Images(1:10), ImageCameras(1:10), Shapes(1:5), ones(10,5), Para);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% External libraries
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
The software package uses the following external libraries:
Piotr's Image & Video Matlab Toolbox:
http://vision.ucsd.edu/~pdollar/toolbox/doc/
http://research.microsoft.com/en-us/downloads/389109f6-b4e8-404c-84bf-239f7cbf4e3d/
The SIFT flow package:
http://people.csail.mit.edu/celiu/SIFTflow/
The OPENGM2 structure predication package:
http://hci.iwr.uni-heidelberg.de/opengm2/?l0=library
HOG descriptor: functions 'imResample', 'hog'
Edge map: function 'edgesDetect'
OPENGM2: the trws algorithm
References:
Navneet Dalal, Bill Triggs: Histograms of Oriented Gradients for Human Detection. CVPR 2005
Piotr Dollár, C. Lawrence Zitnick: Fast Edge Detection Using Structured Forests. IEEE Trans. Pattern Anal. Mach. Intell. 37(8): 1558-1570 (2015)
Ce Liu, Jenny Yuen, Antonio Torralba: SIFT Flow: Dense Correspondence across Scenes and Its Applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5): 978-994 (2011)
Andres, B. and Beier T. and Kappes, J.H. : OpenGM: A C++ Library for Discrete Graphical Models. http://arxiv.org/abs/1206.0111About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published