Literature DB >> 26161521

An Effective Approach for NRSFM of Small-Size Image Sequences.

Ya-Ping Wang1, Zhan-Li Sun1, Kin-Man Lam2.   

Abstract

In recent years, non-rigid structure from motion (NRSFM) has become one of the hottest issues in computer vision due to its wide applications. In practice, the number of available high-quality images may be limited in many cases. Under such a condition, the performances may not be satisfactory when existing NRSFM algorithms are applied directly to estimate the 3D coordinates of a small-size image sequence. In this paper, a sub-sequence-based integrated algorithm is proposed to deal with the NRSFM problem with small sequence sizes. In the proposed method, sub-sequences are first extracted from the original sequence. In order to obtain diversified estimations, multiple weaker estimators are constructed by applying the extracted sub-sequences to a recent NRSFM algorithm with a rotation-invariant kernel (RIK). Compared to other first-order statistics, the trimmed mean is a relatively robust statistic. Considering the fact that the estimations of some weaker estimators may have large errors, the trimmed means of the outputs for all the weaker estimators are computed to determine the final estimated 3D shapes. Compared to some existing methods, the proposed algorithm can achieve a higher estimation accuracy, and has better robustness. Experimental results on several widely used image sequences demonstrate the effectiveness and feasibility of the proposed algorithm.

Entities:  

Mesh:

Year:  2015        PMID: 26161521      PMCID: PMC4498923          DOI: 10.1371/journal.pone.0132370

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Non-rigid structure from motion (NRSFM) is the process of recovering the relative camera motion, and the time-varying 3D coordinates of feature points on a deforming object, by means of the corresponding 2D points in a sequence of images. In many cases, the recovered 3D shapes can effectively enhance the performances of existing systems in object recognition, face perception, etc. [1-3]. Nevertheless, in the NRSFM model, the objects generally undergo a series of shape deformations and pose variations. Thus, in the absence of necessary prior knowledge on shape deformation, recovering the 3D shape and motion of nonrigid objects from 2D point tracks remains a difficult and ill-posed problem. As a pioneering work, a non-rigid model was proposed in [4] by formulating the 3D shape in each frame of a sequence as a linear combination of a set of basis shapes. Nevertheless, due to a lack of sufficient constraints on the shape deformation, the recovered 3D shapes are not unique under this model. In order to alleviate the ambiguities, recent research works have attempted to define additional constraints to make NRSFM more tractable [5]. More determined solutions are given in [6] by utilizing the facts that the bases degenerate under some special cases. In [7, 8], the 3D shape at each time instant is assumed to be drawn from a Gaussian distribution. Assuming that the 3D shape deformation is smooth over time, the time-varying structure of a nonrigid object is represented as a linear combination of a set of basis trajectories [9-11], e.g. the Discrete Cosine Transform (DCT) basis. Since the basis trajectories are known a priori, this method can significantly reduce the number of unknown parameters and improve the estimation stability. Instead of the time-varying structure, the camera’s trajectory is modeled as a linear combination of DCT basis vectors, which provides better results on complex articulated deformations [12, 13]. In [14], the complex deformable 3D shapes are represented as the outputs of a non-linear mapping via the kernel trick [15]. Recently, a novel NRSFM with a rotation-invariant kernel (RIK) was proposed in [16], which utilizes the spatial-variation constraint. A prominent advantage of this method is that it is able to deal with the data lacking temporal ordering or with abrupt deformations. In practice, the number of available high-quality images may be limited in many cases, such as the face images in a surveillance system, etc. If the existing NRSFM algorithms are directly used to estimate the 3D coordinates of a small-size image sequence, the estimation accuracy may be relatively low. In this paper, a sub-sequence based integrated algorithm is proposed to deal with the small-sequence problem. In the proposed method, the 3D coordinates of each frame are estimated one by one. For a test frame, except for itself, a few frames are first randomly extracted from the original sequence. Then, the extracted frames, together with the test frame, form a sub-sequence to be applied to RIK. Similar to the classifier committee learning [17], the sub-sequence and the estimation process of RIK constitute a weaker estimator. Finally, the z-coordinates obtained by multiple weaker estimators are integrated and used as the final estimation for the test frame. Experimental results on several widely used image sequences demonstrate the effectiveness and feasibility of the proposed algorithm.

Methodology

Fig 1 shows the flowchart of the sub-sequence-based integrated RIK algorithm. There are three main steps in our algorithm: extract the sub-sequences from the original sequences, construct the weaker estimators based on the RIK algorithm, and integrate the outputs of the weaker estimators. A detailed description of these three steps is presented in the following subsections.
Fig 1

Flowchart of the sub-sequence-based integrated RIK algorithm.

Sub-Sequence Extraction

The first step of our proposed method is to extract sub-sequences from a small-size sequence, as shown in Fig 2. For a sequence with F frames and n feature points in each of the frames, denote [x , y ] (t = 1, 2, ⋯, F, j = 1, 2, ⋯, n) as the 2D projection of the jth 3D point observed on the tth image. The n 2D point tracks of the F images can be represented as a 2F × n observation matrix W, i.e. For the tth frame, the observation w is a 2 × n matrix, as follows: The observations of an original sequence with F images are derived. When the 3D coordinates of the tth image are to be estimated, the matrix W shown in Fig 2 can be given as follows: Assuming that the number of frames in a sub-sequence is F , the observation matrix is constructed by randomly selecting F −1 observations from W and merging them with w . Thus, N sub-sequences are obtained when the sub-sequence extraction process is repeated N times.
Fig 2

The extraction of sub-sequences.

RIK-based Weaker Estimator

For each test frame w , we construct N sub-sequence observation matrices . In order to estimate the 3D coordinates of the tth frame, one sub-sequence is applied to the RIK algorithm. Assume that the number of basis shapes is K. In terms of the linear-subspace model [8], is factorized as a product of two matrices via singular value decomposition, i.e. where M is a 2F × 3K camera matrix, and S includes K basis shapes, i.e. Further, M is decomposed as follows: where the block-diagonal rotation matrix D is obtained via an Euclidean upgrade step [10], and C and I 3 represent a shape coefficient matrix and a 3 × 3 identity matrix, respectively. The operator ⊗ denotes the Kronecker product. Further, C is represented as a product of the coefficient matrix X and a new basis matrix B [13], i.e. In the optimization procedure, X can be initialized as a low-rank identity matrix, and B is computed via the kernel mapping [15]. Let be the tth row of C. The 3D shape of the tth image can be given as follows: where M † denotes the Moore-Penrose pseudo-inverse of M [16].

Integration of Weaker Estimators

For the tth test frame, we can see from Section 1 that one set of estimated z can be obtained for the jth sub-sequence . When each sub-sequence is applied in turn to RIK, we can obtain N sets of estimated z (j = 1, ⋯, N). Similar to the notation of classifier-committee learning [17] in pattern recognition, here each input and the corresponding reconstruction model can be considered as a weaker estimator. In order to integrate the results obtained by the N weaker estimators, the arithmetic average of z , ⋯, z is a relatively simple implementation, i.e. which can be used as the final estimated z-coordinates of the tth test image. Compared to the arithmetic average, the trimmed mean is a more robust integration estimation. Assuming that P percentage of the observations is trimmed, the number (N ) of the smallest or the largest observations to be discarded is where [⋅] denotes a rounding operation. Further, assuming that the entries of z are ordered such that z < z < ⋯ < z , the trimmed mean can be computed as follows:

Experimental results

Experimental data

We evaluate the performance of our proposed method on three synthetic-image sequences (stretch, face1, face2) and three real-image sequences (cubes, dance, matrix), which are widely used sequences and are publicly available [11, 16]. For these 6 sequences, the corresponding number of frames (T) and the number of point tracks (n) are shown in Table 1.
Table 1

The number of frames (T) and the number of point tracks (n) for 6 sequences.

Sequences T n
1stretch37041
2face17437
3face231640
4cubes20014
5matrix10530
6dance26475
Besides these data, some real face-image sequences from the Bosphorus database are also used in the experiments. Bosphorus is a relatively new 3D face database that includes face images with a rich set of expressions and a systematic variation in poses [18]. To evaluate the estimation accuracy, two performance indices are adopted here to compare the true 3D shapes and the estimated results. One performance index is the Pearson’s linear correlation coefficient between the true z-coordinates z and the estimated z-coordinates , i.e. where μ and σ are the respective mean and standard deviation of z, and and are the respective mean and standard deviation of . A higher absolute value of means that is closer to z. The other performance index is the mean error between the true z-coordinates z and the estimated z-coordinates , i.e.

Experiments

In order to verify the performance of our proposed sub-sequence-based integrated RIK algorithm (denoted as SSI-RIK), we compare it to the original RIK method [16], EM-SFM [7], and CSF [14], which have relatively good performances among existing algorithms. As the challenge addressed in this paper is the NRSFM problem with small-size image sequences, we first extract a small sequence from an original sequence, to be used as the experimental data. Take the sequence stretch, for example: the first 15 frames are used to form a small sequence. i.e. F = 15. The length of sub-sequences (F ) and the number of weaker estimators (N) are set at 6 and 10, respectively. For the four algorithms, Table 2 shows the correlation coefficients of the 15 frames, and the corresponding mean (μ) and standard deviation (σ). Table 3 shows the correlation coefficient increasing percentages (%) of SSI-RIK compared to EM-SFM, CSF and RIK. Additionally, Tables 4 and 5 show the similar performance comparisons of the z-coordinate errors. In these Tables, the numbers 1 to 15 denote the 1th to 15th frame in the small sequence.
Table 2

The correlation coefficients, and the corresponding mean (μ) and standard deviation (σ), of 15 frames of the sequence stretch for 4 algorithms.

Frame NumberEM-SFMCSFRIKSSI-RIK
10.18730.60870.17000.9813
20.19970.64590.24250.9879
30.20830.69660.32950.9859
40.21340.77190.46020.9882
50.19970.84200.63250.9944
60.16320.90110.86960.9915
70.14260.96120.92050.9954
80.11380.99210.94970.9960
90.10070.98930.96010.9946
100.07280.95540.95510.9934
110.03750.89510.92070.9903
120.02680.81200.90100.9874
130.02190.72510.89330.9802
140.02240.64360.88430.9817
150.02620.57050.87110.9762
μ 0.11570.80070.73070.9883
σ 0.07650.14610.28510.0062
Table 3

The correlation coefficient increasing percentages (%) of SSI-RIK compared to EM-SFM, CSF and RIK.

Frame Number (SSIRIKEMSFM1)*100 (SSIRIKCSF1)*100 (SSIRIKRIK1)*100
1423.924161.2107477.2671
2394.649052.9373307.3797
3373.405441.5431199.2156
4362.982028.0223114.7493
5398.082118.105157.2236
6507.624910.036514.0165
7597.96633.55808.1411
8775.52850.39794.8737
9888.16520.53413.5889
101264.53.97634.0102
112540.410.62787.5507
123588.521.59689.5802
134366.235.18609.7305
144284.452.545211.0215
153619.771.125912.0690
μ 753.826823.430035.2578
Table 4

The z-coordinate errors, and the corresponding mean (μ) and standard deviation (σ), of 15 frames of the sequence stretch for 4 algorithms.

Frame NumberEM-SFMCSFRIKSSI-RIK
10.57830.29360.47930.0420
20.53530.26800.44000.0370
30.49600.23500.38680.0441
40.46020.18520.30670.0388
50.42470.14330.21650.0309
60.40070.10780.11090.0295
70.39890.06300.08330.0218
80.39480.02690.07200.0210
90.39340.03040.06450.0273
100.39460.06610.06400.0284
110.39970.10660.09220.0325
120.40330.15140.10820.0357
130.40680.19630.11300.0449
140.41100.24030.11690.0422
150.41180.28430.12270.0497
μ 0.43420.15990.18510.0351
σ 0.05770.09170.14390.0086
Table 5

The z-coordinate error decreasing percentages (%) of SSI-RIK compared to EM-SFM, CSF and RIK.

Frame Number (1SSIRIKEMSFM)*100 (1SSIRIKCSF)*100 (1SSIRIKRIK)*100
192.733585.685691.1333
293.096186.209191.6044
391.112881.240388.6056
491.572579.060487.3552
592.724578.435087.7288
692.625572.579473.3644
794.532365.352175.2942
894.673921.811270.8067
993.050510.121957.6105
1092.813457.099255.6592
1191.881469.571564.8051
1291.145476.412266.9998
1388.954777.114660.2482
1489.731182.432863.9039
1588.027482.534659.5209
μ 91.926878.075681.0650
From Tables 2 and 3, we can see that the correlation coefficients of SSI-RIK are obviously higher than those of EM-SFM, CSF and RIK. Moreover, it can be seen from Tables 4 and 5 that the z-coordinate errors of SSI-RIK are significantly lower than those of EM-SFM, CSF and RIK. Thus, SSI-RIK has a higher estimation accuracy than the other methods. In addition, we can see from Tables 2 and 4 that the standard deviations of SSI-RIK are lower than those of the other three methods. This indicates that SSI-RIK is a more robust approach. Taking the first frame of stretch as an example, Figs 3 and 4 show the comparisons of the true values and the estimated values for the z-coordinate values and the 3D feature points, respectively. We can see that the z-coordinate values and the 3D feature points estimated by SSI-RIK are closer to the true values than those estimated by the other three methods, which coincides with the performance indices of the correlation coefficients and the z-coordinate errors.
Fig 3

The comparisons of the true z-coordinate values and the estimated z-coordinate values of the first frame of stretch for the four methods.

Fig 4

The comparisons of the true 3D feature points and the estimated 3D feature points of the first frame of stretch for the four methods.

In order to investigate the effect of sequence size (F) on the performances of the various algorithms, Tables 6 and 7 tabulate the mean and standard deviation (μ ± σ) of the correlation coefficients and the z-coordinates errors, respectively, when the sequence sizes vary from 15 to 50 with an equal interval of 5. Moreover, for the mean values of the correlation coefficients and the z-coordinates errors, Tables 8 and 9 show the corresponding increasing percentages and decreasing percentages of SSI-RIK compared to EM-SFM, CSF and RIK, respectively.
Table 6

The mean and standard deviation (μ ± σ) of correlation coefficients when the sequence sizes vary from 15 to 50 with an equal interval 5.

EM-SFMCSFRIKSSI-RIK
150.1157±0.07650.8007±0.14610.7307±0.28510.9883±0.0062
200.3508±0.05680.7093±0.18410.9702±0.02040.9868±0.0114
250.3104±0.09980.6626±0.21560.9659±0.03180.9870±0.0134
300.3042± 0.11750.5896±0.24550.8659±0.15650.9763±0.0213
350.1448± 0.08970.5185±0.27060.9705±0.04490.9708±0.0220
400.3888±0.07670.4432±0.29830.9490±0.05270.9719±0.0191
450.4564±0.06660.3875±0.31080.8095±0.28740.9678±0.0393
500.1167±0.07520.3658±0.30240.8296±0.24270.9710±0.0330
Table 7

The mean and standard deviation (μ ± σ) of z-coordinate errors when the sequence sizes vary from 15 to 50 with an equal interval 5.

EM-SFMCSFRIKSSI-RIK
150.4342±0.02450.1599±0.09170.1851±0.14390.0351±0.0086
201.0501±0.00640.2217±0.11960.0551±0.01770.0352±0.0148
251.6247±0.01070.2620±0.14410.0522±0.02380.0328±0.0176
301.5909±0.01370.3101±0.16350.1028±0.05490.0409±0.0198
351.5841±0.01400.3534±0.17510.0488±0.02590.0473±0.0210
401.7107±0.00590.3935±0.18400.0666±0.03650.0482±0.0175
451.1868±0.01150.4153±0.18350.1504±0.15290.0478±0.0279
500.4857 ±0.01250.4232±0.17010.1341±0.12550.0458±0.0214
Table 8

The mean correlation coefficient increasing percentages (%) of SSI-RIK compared to EM-SFM, CSF and RIK, when the sequence sizes vary from 15 to 50 with an equal interval 5.

(SSIRIKEMSFM1)*100 (SSIRIKCSF1)*100 (SSIRIKRIK1)*100
15753.826823.430035.2578
20181.261239.12491.7095
25217.966848.95742.1781
30220.947865.608412.7546
35570.234887.22870.0319
40149.9892119.30562.4215
45112.0418149.774219.5537
50732.0983165.430517.0438
Table 9

The mean z-coordinate error decreasing percentages (%) of SSI-RIK compared to EM-SFM, CSF and RIK, when the sequence sizes vary from 15 to 50 with an equal interval 5.

(1SSIRIKEMSFM)*100 (1SSIRIKCSF)*100 (1SSIRIKRIK)*100
1591.926878.075681.0650
2096.643884.105536.0142
2597.983187.493437.1732
3097.431386.821360.2506
3597.015586.62323.0704
4097.184087.759327.6980
4595.968488.478368.1849
5090.562589.168665.8147
Further, Figs 5 and 6 show the overall mean and standard deviation (μ ± σ) of the correlation coefficients and the z-coordinate errors for different sequence sizes, respectively. In these two figures, the x axis denotes image sequences in terms of the numbers shown in Table 1. From Tables 6–9 and Figs 5 and 6, we can see that SSI-RIK has a better performance than EM-SFM, CSF and RIK for different sequence sizes.
Fig 5

The overall mean and standard deviation (μ ± σ) of correlation coefficients for different sequence sizes.

Fig 6

The overall mean and standard deviation (μ ± σ) of z-coordinate errors for different sequence sizes.

We also present the experimental results on the real Bosphorus database. In experiments, the z-coordinates of the frontal-view images are estimated. As an example, Tables 10 and 11 show the correlation coefficients and the z-coordinate errors, respectively, when the sequence sizes vary from 7 to 14 for one individual. Moreover, Tables 12 and 13 show the corresponding increasing and decreasing percentages of SSI-RIK compared to EM-SFM, CSF and RIK, respectively. It can be seen that, for different sequence sizes, SSI-RIK generally achieves a better performance than EM-SFM, CSF and RIK.
Table 10

The correlation coefficients, and the corresponding mean (μ) and standard deviation (σ), when the sequence sizes vary from 7 to 14 for one individual.

EM-SFMCSFRIKSSI-RIK
70.34030.35310.48840.8279
80.40700.37400.48070.7392
90.54980.05570.38360.6499
100.54870.53620.16820.6713
110.55130.78930.74900.4687
120.57400.82340.44670.6623
130.59780.81290.76750.7120
140.60200.19200.55130.6531
μ 0.52140.49210.50440.6731
σ 0.09520.29680.19390.1019
Table 11

The z-coordinate errors, and the corresponding mean (μ) and standard deviation (σ), when the sequence sizes vary from 7 to 14 for one individual.

EM-SFMCSFRIKSSI-RIK
70.86700.49390.17740.1152
80.90050.33420.29350.2265
91.17270.57460.60610.2901
101.17300.17370.21840.3266
111.15600.21910.14940.3216
121.10920.11000.45180.2804
131.00990.15070.12960.1813
141.02890.42310.55200.2551
μ 1.05220.30990.32230.2496
σ 0.12120.17300.18890.0727
Table 12

The correlation coefficient increasing percentages (%) of EM-SFM, CSF and RIK to SSI-RIK, when the sequence sizes vary from 7 to 14 for one individual.

(SSIRIKEMSFM1)*100 (SSIRIKCSF1)*100 (SSIRIKRIK1)*100
7143.2793134.501469.5317
881.596797.660953.7704
918.18881066.969.4185
1022.345225.1935299.1271
11-14.9772-40.6099-37.4219
1215.3750-19.563548.5639
1319.0956-12.4199-7.2393
148.5008240.256018.4719
μ 29.090636.783233.4294
Table 13

The z-coordinate error decreasing percentages (%) of EM-SFM, CSF and RIK to SSI-RIK, when the sequence sizes vary from 7 to 14 for one individual.

(1SSIRIKEMSFM)*100 (1SSIRIKCSF)*100 (1SSIRIKRIK)*100
784.781676.679235.0948
871.280532.233322.8317
976.884349.506752.1282
1073.9893-88.0403-49.5110
1174.0310-46.7908-115.2227
1276.4741-154.774937.9501
1383.4065-20.2712-39.8828
1477.048039.717753.7929
μ 77.023119.468022.5591
Further, Figs 7 and 8 show the overall mean and standard deviation (μ ± σ) of correlation coefficients and z-coordinate errors for 10 individuals, respectively. In these two figures, the x axis denotes the individuals in terms of their corresponding number in the database. We can see that, again, SSI-RIK has a better performance than EM-SFM, CSF and RIK for different individuals.
Fig 7

The overall mean and standard deviation (μ ± σ) of correlation coefficients for 10 individuals.

Fig 8

The overall mean and standard deviation (μ ± σ) of z-coordinate errors for 10 individuals.

Discussions

There are two possible methods to integrate the outputs of the weaker estimators, i.e. the arithmetic average (denoted as AA-SSI-RIK) and the trimmed mean (denoted as TM-SSI-RIK). For the results given in Tables 10, 11 and 14 tabulates the correlation coefficients, the z coordinate errors, and the corresponding mean (μ) and standard deviation (σ) when the sequence sizes vary from 7to 14 using the different integration methods. Moreover, Table 15 shows the corresponding increasing and decreasing percentages of TM-SSI-RIK compared to AA-SSI-RIK. We can see that TM-SSI-RIK generally has a higher estimation accuracy than AA-SSI-RIK. Therefore, the trimmed mean is adopted in our proposed method to integrate the outputs of the weaker estimators.
Table 14

The correlation coefficients, z-coordinate errors, and the corresponding mean (μ) and standard deviation (σ), when the sequence sizes vary from 7 to 14 for different integration methods.

correlation coefficients z coordinates errors
AA-SSI-RIKTM-SSI-RIKAA-SSI-RIKTM-SSI-RIK
70.63330.82790.31140.1152
80.62620.73920.44370.2265
90.65850.64990.35840.2901
100.64080.67130.48010.3266
110.56890.46870.35890.3216
120.64700.66230.38690.2804
130.66810.71200.21470.1813
140.67750.65310.24490.2551
μ 0.64000.67310.34990.2496
σ 0.03350.10190.09110.0727
Table 15

The corresponding increasing and decreasing percentages of SSI-RIK compared to EM-SFM, CSF and RIK for the results given in Table 14.

(AASSIRIKTMSSIRIK1)*100 (1AASSIRIKTMSSIRIK)*100
730.733463.0102
818.035348.9562
9-1.317319.0484
104.774131.9808
11-17.612010.3814
122.355527.5474
136.566515.5656
14-3.5952-4.1499
μ 5.156728.6650
As RIK has been developed originally for the long sequences, we also present here the experimental comparison of RIK and SSI-RIK when the entire sequence is used to estimate the 3D shapes. Tables 16 and 17 show the mean and standard deviation (μ ± σ) of the correlation coefficients and the z-coordinate errors, respectively. We can see that the performance of SSI-RIK is better than RIK for most sequences.
Table 16

The mean and standard deviation (μ ± σ) of the correlation coefficients when the entire sequences are used in the experiments for RIK and SSI-RIK.

RIKSSI-RIK
stretch0.9421±0.00860.9778 ±0.0200
face10.9994±4.48e-40.9791±0.0234
face20.7447±0.17770.9209 ±0.0617
cubes0.9793±0.00130.9855 ±0.0018
matrix0.3081±0.15600.4577 ± 0.1912
dance0.9065 ± 0.01320.9500 ± 0.0109
Table 17

The mean and standard deviation (μ ± σ) of the z-coordinate errors when the entire sequences are used in the experiments for RIK and SSI-RIK.

RIKSSI-RIK
stretch0.0705±0.01030.0460 ±0.0232
face10.0079±0.00310.0503±0.0380
face20.1384±0.05390.0657±0.0345
cubes0.1212±0.01180.0766±0.0155
matrix0.5439±0.07240.2540±0.0378
dance0.0737±0.00510.0622±0.0071
Similar to pattern recognition, we tried to search for the optimal values of parameters Fs, N and P with the cross validation method, which is a widely used parameter selection approach. After the small-size sequences are extracted from the original sequences, the remained frames are divided into 5 folds and used as the validation sets. Furthermore, the grid divisions are carried out on the three parameters. The z-coordinates of the validation sets are estimated via the proposed method with each possible set of parameters Fs, N and P. Take the sequence stretch for example, Fig 9 shows the mean z-coordinate errors of 5-fold validation sets for different Fs, N and P. Correspondingly, Fig 10 shows the z-coordinate errors of the testing sequences. We can see that the testing error may not be small for a set of parameter with a small validation error. Thus, it is not effective to search for the optimal parameters with the cross validation method. On the other hand, it can be seen from Fig 10 that the z-coordinate errors vary with different parameter values, but the variations are not so significant. Besides the cross validation, there are many other parameter selection methods. Thus, how to devise a more effective method to accurately determine the optimal parameter values should be a meaningful and valuable work.
Fig 9

The mean z-coordinate errors of the 5-fold validation sets with different parameters Fs, N and P for the sequence stretch.

Fig 10

The z-coordinate errors of the testing sets with different parameters Fs, N and P for the sequence stretch.

Conclusions

In this paper, a sub-sequence-based RIK algorithm is proposed for NRSFM for small-size sequences. Compared to some existing algorithms, the proposed method has a higher estimation accuracy. Moreover, the robustness of the proposed method is better than those of the existing algorithms. The experimental results on both the artificial and the real data have verified the effectiveness and feasibility of the proposed method.
  8 in total

1.  Nonrigid structure-from-motion: estimating shape and motion with hierarchical Priors.

Authors:  Lorenzo Torresani; Aaron Hertzmann; Chris Bregler
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2008-05       Impact factor: 6.226

2.  Kernel Non-Rigid Structure from Motion.

Authors:  Paulo F U Gotardo; Aleix M Martinez
Journal:  Proc IEEE Int Conf Comput Vis       Date:  2011

3.  Learning Spatially-Smooth Mappings in Non-Rigid Structure from Motion.

Authors:  Onur C Hamsici; Paulo F U Gotardo; Aleix M Martinez
Journal:  Comput Vis ECCV       Date:  2012

4.  Computing Smooth Time Trajectories for Camera and Deformable Shape in Structure from Motion with Occlusion.

Authors:  Paulo F U Gotardo; Aleix M Martinez
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2011-03-10       Impact factor: 6.226

5.  Trajectory Space: A Dual Representation for Nonrigid Structure from Motion.

Authors:  Ijaz Akhter; Yaser Sheikh; Sohaib Khan; Takeo Kanade
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-11-18       Impact factor: 6.226

6.  Face recognition with multi-resolution spectral feature images.

Authors:  Zhan-Li Sun; Kin-Man Lam; Zhao-Yang Dong; Han Wang; Qing-Wei Gao; Chun-Hou Zheng
Journal:  PLoS One       Date:  2013-02-13       Impact factor: 3.240

7.  The role of attention in ambiguous reversals of structure-from-motion.

Authors:  Solveiga Stonkute; Jochen Braun; Alexander Pastukhov
Journal:  PLoS One       Date:  2012-05-22       Impact factor: 3.240

8.  Visual working memory contents bias ambiguous structure from motion perception.

Authors:  Lisa Scocchia; Matteo Valsecchi; Karl R Gegenfurtner; Jochen Triesch
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.