University of Houston • University of Houston-Clear Lake • ISSO Annual Report Y2002—pp. 37-46

Optical Tracking

Ioannis Kakadiaris (UH), Karolos Grigoriadis (UH), Darby Magruder (NASA-JSC), Kenneth Baker (NASA-JSC), and Carlos Barrón (UH)

Abstract
Recently, we developed a technique that allows semi-automatic estimation of anthropometry and pose from a single image. However, estimation was limited to a class of images for which an adequate number of human body segments were almost parallel to the image plane. In this paper, we present a generalization of that estimation algorithm that exploits pairwise geometric relationships of body segments to allow estimation from a broader class of images. In addition, we refine our search space by constructing a fully populated discrete hyper-ellipsoid of stick human body models in order to capture the variance of the statistical anthropometric information. As a result, a better initial estimate can be computed by our algorithm; thus, the number of iterations needed during minimization are reduced tenfold. We present our results over a variety of images to demonstrate the broad coverage of our algorithm.

MOST OF THE RESEARCH IN THE AREA OF HUMAN MOTION analysis is directed at tracking humans in monocular or multi-camera image sequences1-4 because of the wide range of applications that could benefit from estimating and analyzing human motion. These applications include surveillance, performance measurement for athletes, the performance of patients with disabilities, motion capture for entertainment applications, and vision-based user interfaces.5,6 The important problem of estimating an individual’s anthropometry and pose from a single uncalibrated image has, however, received considerably less attention even though it constitutes the first step in many human tracking (from monocular images) algorithms.

Recently, we developed a technique that allows semiautomatic estimation of anthropometry (up to a scale parameter) and pose from a single image.7,8 We initially selected a set of image points that constituted the projection of selected landmarks. Using this information, along with a priori statistical information about the human body, we generated a set of plausible segment-length estimates. The third step produced a set of plausible poses based on joint limit constraints using a geometric method. In the fourth step, pose and anthropometric measurements were obtained by minimizing an appropriate cost function subject to the associated constraints. The novelty of that approach was the use of anthropometric statistics to constrain the estimation process that allowed the simultaneous estimation of both anthropometry and pose. Estimation was limited, however, to a class of images for which an adequate number of human body segments were almost parallel to the image plane. For example, that method could handle images like the one depicted in Fig. 1(a), but not images like the one depicted in Fig. 1(b).

(a)

(b)

Figure 1. (a) Instance of an image that can be handled by the algorithm described in Barrón and Kakadiaris7 and Yamamoto et al.19; (b) Instance of an image that could not be handled using the algorithm in Barrón and Kakadiaris7 and Yamamoto et al.,19 but can be handled using the algorithm described in this paper.

In this study, we present a generalization of that reconstruction algorithm to allow estimation from a broader class of images including the one depicted in Fig. 1(b). In particular, we first explore the projective properties of segments that have pairwise similar orientation with respect to the camera. Second, we extend our technique for exploiting prior statistical anthropometric information to allow us to obtain a better initial estimate of the human body model. As a result, the number of iterations needed by our algorithm to converge is reduced tenfold. Also, our algorithm can signal the absence of adequate information for producing a reliable estimate.

The following discussion describes our enhancements in more detail.9 In section 2, we review prior work in the area; in section 3 we provide our analysis and our proposed enhancements. In particular, we explore how to exploit additional geometric relationships of the human subject’s body segments by considering their foreshortening in the image. We describe our refinements of the prior statistical models to further constrain the selection of initial estimates for the minimization process. Finally, we illustrate results from our system.

Prior work
One of the challenges in model-based human tracking algorithms is the initialization of the model in the first frame of the image sequence. Tracking and posture estimation methods have been presented that use either one10-19 or multiple cameras.20-25 In most of the existing tracking approaches, the user specifies an approximate position and posture from the human model at the first frame of the image sequence.13,22,26-28 For the initialization step of their human tracking method, Bregler and Malik minimize a cost function over position, angles, and body dimensions.12 Unfortunately, no information is provided about the accuracy of their method nor for what class of postures their method worked.

Taylor29 presented a method for recovering information about the configuration of articulated objects from a single image. Their effort is similar to our work in that both methods assume a scaled orthographic projection, and both use geometrical information as constraints. The difference is that our work combines the geometrical constraints with the prior statistical anthropometric information to drive the estimation process. Thus, our method begins and ends with a plausible human model. Rosales and Sclaroff30 presented a method for pose estimation based on selecting the most likely pose given the learned probability distribution and the visual features’ similarities between hypothesis and input. For our algorithm, there is no need for a learning phase. Moeslund and Granum31 represented the human model in phase space spanned by its different degrees of freedom, and they used an analysis-bysynthesis approach to match the phase space model with real images for the case of a human-arm model. They currently examine what would be the impact of a significantly increased size of the phase space (as in the case of estimating the pose of a full human body model) in the efficiency of their algorithm. The contribution of our research is a systematic study and an improved technique that takes into consideration statistical anthropometric information to constrain the estimation process.

Analysis
The problem of anthropometry and pose estimation from a single image can be formulated as follows: Given a set of points in an image that corresponds to the projection of landmark points of a human subject, estimate both the anthropometric measurements (up to a scale) of the subject and his/her pose that best matches the observed image. In the following description, it is assumed that the selected landmarks at the image are the result of a scaled orthographic projection of three-dimensional landmark points of a human subject.

Stick human body model
For the purposes of this research, we have developed a generic stick human body model, SM, whose complete description can be found in Barrón and Kakadiaris.7,8 Briefly, SM is a tree (s, S,A), where S is a set of sites/landmarks and A is a collection of edges (segments) with endpoints in S, and s Î S is the root. In our case, A = { HD, RY, LY, NK, UT, RC, LC, RUA, LUA, RLA, LLA, RHD, LHD, LT, RHP, LHP, RUL, LUL, RLL, LLL, RF, LF} as enumerated in Fig. 2b, and the set of landmarks consists of a set of joints J ={at, sp, la, lc, le, lh, lk, ls, lw, ra, rc, re, rh, rk, rs, rw, wt} (information about the SM’s joints is provided in Table 1), and other landmarks M = {ry (right eye), ly (left eye), rhd (base of the right middle finger), lhd (base of the left middle finger), rf (tip of the right foot), lf (tip of the left foot)} (S = J È M). Default data for the joints and the anthropometric measurements are extracted from Churchill et al.32

kakadiaris-f2.gif (4055 bytes)

(a)

ID Segment
HD Head
RY Right Eye
LY Left Eye
NK Neck
UT Upper Torso
RC Right Clavicle
LC Left Clavicle
RUA Right Upper Arm
LUA Left Upper Arm
RLA Right Lower Arm
LLA Left Lower Arm
RHD Right Hand
LHD Left Hand
LT Lower Torso
RHP Right Hip
LHP Left Hip
RUL Right Upper Leg
LUL Left Upper Leg
RLL Right Lower Leg
LLL Left Lower Leg
RF Right Foot
LF Left Foot

(b)

Figure 2. (a) SM and associated coordinate systems; (b) Names of the SM’s segments (adapted from Barrón and Kakadiaris7,8)

Table 1. Information Related to Joints of the SM

ID Joint From To DOF
at atlanto occipital NK HD Tz*Rz*Ry*Rx
sp solar plexus UT NK Tz*Ry*Rz*x
la left ankle LLL LF Tx*Rz*Rx*Ry
lc left clavicle UT LC T Tz*Rx*Ry
le left elbow LUA LLA Tz*Ry
lh left hip LT LUL Tz*Rz*Rx*Ry
lk left knee LUL LLL Tz*R-y
ls left shoulder LC LUA Tz*Rz*Rx*Ry
lw left wrist LLA LHD Tz*Ry*Rx*Rz
ra right ankle RLL RF Tx*R-z*R-x*Ry
rc right clavicle UT RC Tz*R-x*Ry
re right elbow RUA RLA Tz*Ry
rh right hip LT RUL Tz*R-z*R-x*Ry
rk right knee RUL RLL Tz*R-y
rs right shoulder RC RUA Tz*R-z*R-x*Ry
rw right wrist RLA RHD Tz*Ry*R-x*R-z
wt waist LT UT Tz*Ry*Rz*Rx

Exploiting geometric relationships
In this section, we examine the foreshortening of the body segments in the image, under the assumption of scaled orthographic projection. Let = [XC, YC, ZC]T be the origin of the camera (see Fig. 3) and assume that the image plane is located at ZIM on the z-axis of the camera. As known, under scaled orthographic projection the point  = [X1, Y1, Z1] (see Fig. 3) projects to the point

     = [x1, y1, z1] = [Xc + l1(X1 - Xc), Yc + l1(Y1 - Yc), Zim]T,

where l1 = ZIM - ZC.

Figure 3. Notation pertaining to a scaled orthographic projection camera model assumed in this paper

Similarly, the point  = [X2, Y2, Z2] projects to the point

     = [x2, y2, z2] = [Xc + l2(X2 - Xc), Yc + l2(Y2 - Yc), Zim]T,

where l2 = ZIM - ZC/Z2 - ZC.

If we assume that this point lies on the same plane (normal to the camera z-axis) with point P1 then l1 = l2. Thus, for any point on the line P1P2, its projection is given by the equation [x, y]T = l1S[X, Y, Z]T, where

    

Similarly, for any point on the line , its projection is given by the equation [x,y]T == l3S[X, Y, Z]T, where l3 = (ZIM - ZC/Z3 - ZC) = (ZIM - ZC/Z4 - ZC) = l4. If az is a real number such that

     (1 +az) = (Z1 - ZC/Z3 - ZC),

(1)

then Z3 - ZC = Z1 - ZC/1 + az
and l3 = l1(1 + az).

Therefore, the scaled orthographic projection for the points of  is given by:

     [x,y]T = l1(1 + az)S[X, Y, Z]T.

Let L12 = || - || and l12 = || - ||, then
l
12 = [(x2 - x1)2 + (y2 - y1)2]1/2 = l1[ (X2 - X1)2 + (Y2 - Y1)2]1 / 2 = l1L12.

Also, we can obtain that l34 = l3L34, where L34 = || - || and l34 = || - ||. Using the relation l3 = l1(1 + az), we can obtain that l34 = l1(1 + az)L34. Finally, the ratio between l12 and l34 is given by l12/l34 = l1L12/(l1(1 + az)L34) = L12/(1 + az)L34 that implies the relation

     L12/L34 = (1 + az) l12/l34,

(2)

which suggests Proposition 1.

Proposition 1. For segments that lie in planes almost parallel to the image plane, the ratio of segment lengths in 3D and the ratio of their corresponding projected lengths are similar if and only if az is very small.

Proof. The result is obtained from Eq. (2).

Our previous work7,8 was based on the above proposition. In this work, we examine the relationship of segment lengths that lie in different planes using the distance between planes. For example, let’s examine the ratio of L13 = || - || and L24 = || - ||, and the ratio of l13 = || - || and l24 = || - ||.

Since, l13 = [(x3 - x1)2 + (y3 - y1)2]1/2 = |l1| {[(1 + az)X3 - X1]2 + [(1 + az)Y3 - Y1]2}1/2 = |l1| {L132 - (Z3 - Z1)2 + 2az[X3(X3 - X1) + Y3(Y3 - Y1)] + az2(X32 + Y32)}1/2, then, l12L132 = l132 + l12{(Z3 - Z1)2 - 2az[X3(X3 - X1) + Y3(Y3 - Y1)] - az2(X32 + Y32)}.

Using Eq. (1), we obtain L13 = l13/ 1(1 + ez), where 1 + ez = {1 + l2/l132 {az2(Z3 - ZC)2 - (X32 + Y32)] - 2az[X3(X3 - X1) + Y3(Y3 - Y1)]}1/2.

Similar arguments hold for L24 and l24. Thus,

     L13/L24 = l13 (1 + e13)/l24(1 + e24),

(3)

where ij depends on the position of points .

Proposition 2. Assume that two segments have the following two properties: (a) their endpoints lie on planes perpendicular to the camera’s Z axis, and (b) their orientation with regard to the camera is almost similar. Then the ratio of lengths of these two segments is similar to the ratio of lengths of their corresponding projected segments.

Proof. According to our hypotheses, e13 and e24 must have similar values; thus, the conclusion follows from Eq. (3).

In summary, the following two observations hold: (a) For segments that are almost parallel to the image plane, az takes a small value, as per Eq. (2); (b) For segments that have orientation almost parallel to each other, the corresponding factors, 1 + ez, for these segments must have similar value, as shown in Eq. (3). Based on these observations, during the initialization step of our algorithm, we mark the segments that have orientation almost parallel to the image plane (as in Barrón and Kakadiaris7,8); in addition, we mark any pair of segments that have similar orientation with respect to the camera. These projected segments are employed to compute the ratios that will be used as input for the selection of the initial stick model.

Exploiting prior statistical information
Using prior statistical anthropometric information, we build our cadre family as a multivariate representation of the extremes of the population distribution, as described in Barrón and Kakadiaris.7,8 The probability density function of the multivariate normal distribution is defined by

    

(4)

where k is the number of dimensions. In our case, the variables are the lengths of the 22 segments of our stick model, is a random vector, , and S are the mean and the covariance matrix of the population, and we use the quadratic form Q() = ( - )TS( - ) whose shape depends on S. We compute the principal components of S, and we select the first seven (i = 1, . . . , 7) with large eigenvalues such that l1 > l2 > · · · > l7. The following notation relates the SM and the eigenvectors of

    

where = [a1, . . . , a7]T such that

    

(5)

That is, we are using linear combinations of the eigenvectors constrained by Eq. (5) to span on the hyper-ellipsoid of S. In particular, we build a family of q different models (SM(q)), and we compute ratios of lengths rk,q as

    

(6)

where m(ln) is the mean length of the segment ln.

Furthermore, we compute the covariance coefficients for these ratios as

     vkj = m[(rk,q - m(rk))(rj,q - m(rj))].

(7)

We select an initial SM(q*) that satisfies the equation

    

(8)

where rk,q, rj,q are ratios obtained from the SMs in our cadre family and sk, sj are the corresponding ratios computed from the input data as follows:

    

Our objective is to find a finite subset of

    

in order to build our cadre family SM(q) that will provide a good initial estimate SM(q*). The complete algorithm for the SM selection is described in Barrón and Kakadiaris.8

Intuitively, we build all the linear combinations of the eigenvectors with at least one nonzero coefficient under the constraint that all nonzero coefficients must be equal, and we minimize Eq. (8) using a discrete cadre family. Note that in Eq. (8) we can use instead of

The following proposition justifies this modification:

Proposition 3.

Proof. Indeed,

    

Therefore, we can restrict D to

    

To analyze the extended set of selected models {SM() | Î DD, we introduce a metric space using the quadratic form Q, since DD depends on the matrix S. Let êi = , i = 1, . . . ,7 where ||.|| is the Euclidean norm of R22. The following proposition presents the matrix S and an explanation of the bounds of our cadre family.

Proposition 4. For every (,) Î DD and for every unit vector êi, the following properties hold:

  1. The inner product induced by S, (,)S = aTSb , "a,b Î R22 is commutative.
  2. (êi,êi)S = li ¹ 0, and (êi,êj)S = 0, i ¹ j (thus {êj} is an orthogonal set).

Proposition 3 explains why we restrict by the constraint

     .

The following equality holds:

    

Thus,

    

and by the previous proposition we obtain the inequality

    

This expression provides a theoretical bound for the extension of hyper-ellipsoid of Q(SM()) with the center in .

In the following, we describe our new full discretization DD of DD. Let I = {1, 2, . . . , 7} and k Í I. We select Î R7 such that SiÎI ai2 = 1, aj = 0, "j Ï I(k) and (ai =aj) " i, j Î k. For example, from this formulation, we can use a1 = a2 = . . . = a7 = ± 1/Ö7, or a1 = 0, and a2 = . . . = a7 = ± 1/Ö6, etc. This formulation extends our previous approach, and the number of SMs that we employ is

    

The coefficients and the number of elements for each SM are given in Table 2. The distribution of the SM(), Î DD can be estimated in terms of S and {}.

Table 2. Factor and Its Number of Linear Combinations

Coefficeint      Linear combinations

Proposition 5. Given , Î DD, then

    

and "k

    

Proof. The result follows from

    

However, this approach provides a large number of possible combinations for the initial anthropometric estimates. In particular, if Î DD, then

    

where = [l1m, l2m,. . . ,l22m] is the average human model, and = (dl1i, dl2i, . . . , dl22i), i = 1, . . . , 7 are the eigenvectors of S. For example, for a particular that corresponds to the index q, the ratios rk,q are given by the Eq. (6)

    

The terms and are used to compute the ratio, and they depend on the selection of ai,q,q. The benefit of using a larger cadre family than the one used in our previous approach is the improvement of the initial estimation using only a few ratios. It enables finding a solution for the images that depict humans in complicated postures and pruning the user-selected ratios in order to compute a good initial SM which, in turn, improves the accuracy of our minimization process.

Algorithm
Our technique for simultaneously estimating the anthropometry and the pose from a single uncalibrated image has four basic steps.

Algorithm. Anthropometry and pose estimation
Step 1: Selection of projected landmarks and ratios.
Step 2: Ratio pruning and choice of initial Stick Model.
Step 3: Initial estimates for pose.
Step 4: Iterative minimization over lengths and angles.

In this work, we have modified the first and second step of our algorithm presented in Barrón and Kakadiaris,7,8 as described in the following paragraphs.

Selection of projected landmarks and ratio computation. We have extended the user-interface developed for Barrón and Kakadiaris7,8 to allow the user not only to select the projection of the visible landmarks of a subject’s body, but also to define the ratios to be used for the initial estimates using the properties described in Section 3.2. Each ratio is treated independently, since the properties described in Section 3.2 are local and are valid for at least two segments. Note that the selected segments must be in parallel planes or in a similar orientation.

Ratio pruning and choice of initial stick model. Our basic assumption is that there are a number of segments that have the properties described in Section 3.2. The user defines the input ratios from segments with one of these properties. In this step, our algorithm compares the selected ratios with the range of our cadre family and selects the ones that lie inside the range. For the case in which at least one selected ratio lies within the range of our cadre family, the algorithm selects an SM using the technique described in Barrón and Kakadiaris.8 This step weighs the ratios using the Mahalanobis distance in order to select a model that closely matches the input ratios obtained from the image. Otherwise, the user is informed that the selected ratios lie outside the range of the cadre family (which means that our algorithm cannot handle this image), and the user is asked to select additional ratios, if possible.

Experimental results
We have performed a number of experiments on synthetic and real data to assess the accuracy, limitations, and advantages of our approach. In the first experiment, we applied our technique to an image created using the virtual human modeling tool EAI Jack. Figures 4(a) and 4(b) depict the reconstructed 3D model from two views. Tables 4 and 5 contain statistical information related to the accuracy of the estimation process. In this experiment, the maximum estimated error of the anthropometry dimensions decreases from 3.1743% to 0.4782%. In the second experiment, we applied our technique to a real image from the subject Vanessa whose anthropometric dimensions were manually measured. Figure 4(c) depicts the selected points, and Figs. 4(d)–(f) depict the reconstructed model from a novel view. Table 3 captures the percentage errors (PE) of the estimation process. We observe that the estimation of anthropometric information is within 1% of the anthropometric dimensions of the subject, clearly an improvement of our previous method whose accuracy was within 3.2%. In general, we have performed numerous other experiments with a variety of subjects whose anthropometric dimensions are known with similar, very encouraging results. In the third experiment, we applied our algorithm to a variety of images from a variety of application domains where anthropometric information about the subjects was not available. Figure 4(g) depicts the input image along with the selected points. Figure 4(h) is the reconstructed SM from another point of view. Figures 5(a)-(c), 6(a)-(c), and 7 depict the input images and their reconstructed 3D models.

Figure 4. (a) Front and (b) side views of the virtual human along with our estimates. (c) Input image for the subject Vanessa, and (d) camera model and reconstructed 3D model. (e) and (f) novel view of the reconstructed 3D model. (g) landmarks selected by a user on this baseball player image; (h) anthropometry and pose estimates from our algorithm.

Table 3. Accuracy of the Length Estimates for the Subject Vanessa

LC/(UT+LT) LLA/LUA LHP/LUA LF/LUL LF/LLL
Actual 0.6279 0.8625 0.6949 0.5517 0.4778
Estimated 0.6402 0.8516 0.6728 0.5594 0.4888
PE % 1.9589 1.2638 3.1803 1.3957 2.3022
New Estimates 0.6316 0.8652 0.6887 0.5501 0.4807
PE % 0.5893 0.313 0.8922 0.29 0.6069

Table 4. Accuracy of the Length Estimates for the Synthetic Experiment

LC/(UT+LT) LLA/LUA LHP/LUA LF/LUL LF/LLL
Actual 0.6553 0.9829 0.5700 0.6397 0.6341
Estimated 0.6517 0.9517 0.5713 0.6595 0.6460
PE % 0.5494 3.1743 0.2281 3.0952 1.8767
New Estimates 0.6526 0.9782 0.5692 0.6414 0.6350
PE % 0.4120 0.4782 0.1404 0.2657 0.1419

Table 5. Accuracy of the Pose Estimates for the Synthetic Experiment

Joint

Actual Values Estimated PE%

at

(0.0o,0.0o,0.0o) (0.00o,0.00o,0.00o) 0

sp

(-44.0o,0.0o,0.0o) (-44.05o,0.08o,0.0o) 0.21

la

(10.0o,-20.00o,10.0o) (10.02o,-19.90o,10.0o) 0.42

lc

(0.0o,0.0o) (0.0o,0.0o) 0

le

(90.0o) (90.11o) 0.12

lh

(0.0o,15.0o,10.0o) (0.0o,15.40o,9.96o) 2.23

lk

(15.0o) (15.15o) 1

ls

(44.0o,45.0o,145.0o) (43.38o,44.68o,145.31o) 0.48

lw

(0.0o,0.0o,0.0o) (0.0o,0.0o,0.0o) 0

ra

(0.0o,0.0o,0.0o) (0.0o,0.0o,0.0o) 0

rc

(0.0o,0.0o) (0.0o,0.0o) 0

re

(110.0o) (110.02o) 0.02

rh

(0.0o,15.0o,50.0o) (-0.18o,15.04o,49.87o) 0.43

rk

(55.0o) (54.94o) 0.11

rs

(70.0o,50.0o,20.0o) (70.24o,50.05o,19.91o) 0.3

rw

(0.0o,0.0o,0.0o) (0.0o,0.0o,0.0o) 0

wt

(25.0o,0.0o,0.0o) (25.15o,-0.12o,0.01o) 0.77

Figure 5. (a) Front input image depicting a basketball player along with the user-selected landmarks. (b) reconstructed model overlaid on the image. (c) novel view of the reconstructed model.

Figure 6. Examples from sports: (a) cyclist, (b) tennis player, and (c) golfer

Figure 7. Examples drawn from the automotive industry

Concluding Remarks
In this study, we investigated the problem of recovering the anthropometry (up to a scale parameter) and pose of the human figure from a single image. Specifically, we have to constrain the estimation process and allow the simultaneous estimation of both anthropometry and pose. Experimental results on a variety of images indicate that our method produces accurate results for a broad class of images.

Acknowledgments
This work was supported in part by the University of Houston Institute for Space Systems Operations.

References
1I. A. Kakadiaris and R. Sharma, ed. Proc., IEEE Human Motion Analysis and Synthesis Workshop, New York: IEEE Press, 2000.
2J. Aggarwal and Q. Cai. "Human Motion Analysis: AReview," Comput Vision Image Underst 73.3 (1999):428-40.
3D. Gavrila. "The Visual Analysis of Human Movement: A Survey," Comput Vision Image Underst 73.1 (1999): 82-88.
4T. B. Moeslund, E. Granum. "A Survey of Computer Vision Based Human Motion Capture," IEEE Trans. Syst. Man Cybernet 81.3 (2001): 231-68.
5I. A. Kakadiaris. "Optical Tracking for Telepresence and Teleoperation Space Applications," White paper, University of Houston, Houston, TX (1999).
6G. Martinez, I. A. Kakadiaris, D. Magruder. "Optical Tracking for Teleoperation Space Applications," Technical Report UH-CS-02-01, University of Houston, Houston, TX, 2002.
7C. Barrón and I. A. Kakadiaris. "Estimating Anthropometry and Pose from a Single Image," Proc., 2000 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2002. pp 669-76.
8C. Barrón and I. A. Kakadiaris. "Estimating Anthropometry and Pose from a Single Image," Comput Vision Image Underst 81 (2001): 269-84.
9Parts of this work appear in C. Barrón and I. A. Kakadiaris. "On the Improvement of Anthropometry and Pose Estimation from a Single Uncalibrated Image," IEEE Workshop on Human Motion, Austin, TX, Dec. 7-8, 2000. 53-60.
10H. J. Lee and Z. Chen. "Determination of 3D Human Body Postures from a Single View," Comput Vision Graph Image Process 30 (1985): 148-68.
11S. Wachter and H.-H. Nagel. "Tracking of Persons in Monocular Image Sequences," Proc., IEEE Nonrigid and Articulated Motion Workshop, New York: IEEE Press, 1997. 2-9.
12C. Bregler and J. Malik. "Tracking People with Twists and Exponential Maps," Proc., IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Santa Barbara, CA, June 23-25, 1998. 8-15.
13T. Cham and J. Rehg. "A Multiple Hypothesis Approach to Figure Tracking," Proc., 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2, New York: IEEE Press, 1999. 239-45.
14R. Bowden. "Learning Statistical Models of Human Motion," in I. Kakadiaris and R. Sharma, ed. Proc., IEEE Workshop on Human Modeling, Analysis and Synthesis, New York: IEEE Press, 2000. 10-17.
15J. Deutscher, A. Blake, and I. Reid. "Articulated Body Motion Capture by Annealed Particle Filtering," Proc., IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2000. 126-33.
16I. Kakadiaris and D. Metaxas. "Model-Based Estimation of 3D Human Motion," IEEE Trans. on Pattern Anal Mach Intell 22.12 (2000): 1453-59.
17D. Ormoneit, H. Sidenbladth, M. Black, T. Hastie, D. Fleet. "Learning and Tracking Human Motion using Functional Analysis," in P ro c., IEEE Workshop on Human Modeling, Analysis and Synthesis. I. A. Kakadiaris and R. Sharma, ed. New York: IEEE Press, 2000. 2-9. (Preliminary publication.)
18M. Yamamoto and K. Yagishita. "Scene Constraints-Aided Tracking of Human Body," Proc., 2000 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2000. 151-56.
19M. Yamamoto, Y. Ohta, T. Yamagiwa, and K. Yagishita. "Human Action Tracking Guided by Key-Frames," Proc., 4th International Conference on Automatic Face and Gesture Recognition, March 2000. 354-61.
20A. Azarbayejani, C. Wren, A. Pentland. "Real-Time 3-D Tracking of the Human Body," Proc., IMAGE’COM 96, Bordeaux, France, May 1996.
21D. Gavrila and L. Davis. "3-D Model-Based Tracking of Humans in Action: A Multi-View Approach," Proc., 1996 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, June 18-20, 1996. 73-80.
22I. A. Kakadiaris and D. Metaxas. "Model-Based Estimation of 3D Human Motion with Occlusion Based on Active Multi-Viewpoint Selection," Proc., IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, June 18-20, 1996. 81-87.
23M. M. Covell, A. Rahimi, T. J. Darrell. "Articulated-Pose Estimation using Brightness- and Depth-Constancy Constraints," Proc., IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2000. 438-45.
24Q. Delamarre and O. Faugeras. "3D Articulated Models and Multi-View Tracking with Silhouettes," Proc., 7th International Conference on Computer Vision, Kerkyra, Greece, Sept. 20-27, 1999. 716-21.
25S. Iwasawa, J. Ohya, K. Takahashi, T. Sakaguchi, S. Kawato, K. Ebihara, and S. Morishima. "Real-Time, 3D Estimation of Human Body Postures from Trinocular Images," IEEE International Workshop on Modeling People, Corfu, Greece, Sept. 20, 1999. 3-10.
26R. Plankers, P. Fua, and N. D’Apuzzo. "Automated Body Modeling from Video Sequences," Proc., IEEE International Workshop on Modeling People, Corfu, Greece, Sept. 20, 1999. 45-52.
27I. A. Kakadiaris and D. Metaxas. "3D Human Body Model Acquisition from Multiple Views," Int. J. Comput. Vision 30.3 (1998): 191-218.
28I. A. Kakadiaris and D. Metaxas. "3D Human Body Model Acquisition from Multiple Views," Proc., International Conference on Computer Vision, Boston, MA, June 20-23, 1995. 618-23.
29C. J. Taylor. "Reconstruction of Articulated Objects from Point Correspondences," Proc., 2000 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2000. 677-84.
30R. Rosales and S. Sclaroff. "Inferring Body Pose without Tracking Body Parts," Proc., 2000 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2000. 721-27.
31T. B. Moeslund and E. Granum. "3D Human Pose Estimation Using 2D-Data and an Alternative Phase Space Representation," I. A. Kakadiaris and R. Sharma, ed. Proc., IEEE Workshop on Human Modeling, Analysis and Synthesis, New York: IEEE Press, 2000. 26-33.
32E. Churchill, J. T. McConville, L. L. Laubach, T. Churchill, P. Erskine, and K. Downing. Anthropometric Source Book, Vol. II: A Handbook of Anthropometric Data. NASA Technical Report 1024, Johnson Space Center, Houston, TX: NASA Scientific and Technical Information Office, 1978.

Publications
Barrón, C. and I. A. Kakadiaris. "Estimating Anthropometry and Pose from a Single Image," Computer Vision and Image Understanding 81.3 (2001): 269-84.
Barrón, C. and I. A. Kakadiaris. "On the Improvement of Anthropometry and Pose Estimation from a Single Uncalibrated Image," Machine Vision & Applications (2003). (In press.)

Presentations
Barrón, C. and I. A. Kakadiaris. "Estimating Anthropometry and Pose from a Single Image," IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Hilton Head Island, SC, June 13-15, 2000. 669-76.
Barrón, C. and I. A. Kakadiaris. "On the Improvement of Anthropometry and Pose Estimation from a Single Uncalibrated Image," IEEE Workshop on Human Motion, Austin, TX, Dec. 7-8 2000. 53-60.

Funding and proposals
"Control and Coordination for Teleoperation in Space Robotics." National Science Foundation, Co-PI: K. Grigoriadis (UH) and C. Layne (UH), June 2003-May 2006, $494,588 (not funded).

Investigative Team

UH PI: Ioannis Kakadiaris, Ph.D., Assistant Professor
Visual Computing Laboratory
Department of Computer Science
School of Natural Sciences and Mathematics
University of Houston
Houston, TX 77204-3010
Phone: (713) 743-1255; Fax: (713) 743-1250
Email: ioannisk@uh.edu

UH Co-PI: Karolos Grigoriadis, Ph.D., Associate Professor
Department of Mechanical Engineering
Cullen College of Engineering
University of Houston
Houston, TX 77204-4006
Phone: (713) 743-4387; Fax: (713) 743-4503
Email: karolos@uh.edu

NASA-JSC PI: Darby F. Magruder
Robotic Systems Technology Branch
Automation and Robotics Division
2101 NASA Road 1, Code ER4
Houston, TX 77058
Phone: (281) 483-7069; Fax: (281) 483-7580
Email: darby.f.magruder1@jsc.nasa.gov

NASA-JSC Co-PI: Kenneth Baker, Ph.D.
Robotic Systems Technology Branch
Automation and Robotics Division
2101 NASA Road 1, Code ER4
Houston, TX-77058
Phone: (281) 483-2041; Fax: (281) 483-7580
Email: kenneth.baker1@jsc.nasa.gov

UH PDAF: Carlos Barrón, Ph.D.
Visual Computing Laboratory
Department of Computer Science
School of Natural Sciences and Mathematics
University of Houston
Houston, TX 77204-3010
Phone: (713) 743-1303; Fax: (713) 743-1250
Email: cbarron@uh.edu

PDF (K)
Table of Contents

Institute for Space Systems Operations - Y2002 Annual Report
Copyright © 2003

Navigation Bar

foot-black.gif (4301 bytes)