Feature extraction and pattern classification are two key problems in face recognition. In order to solve the high-dimensional and Small Sample Size (SSS) problem of face recognition, start with the feature extraction of human face and dimensionality reduction algorithms, a quadratic feature extraction and dimensionality reduction algorithm model was put forward based on Restricted Boltzmann Machine (RBM). At first, the image was evenly divided into a number of local image blocks and quantified, then the image was processed by Gabor wavelet transformation. The Gabor facial features were encoded by RBM to learn more intrinsic characteristics of data, so as to achieve the purpose of dimensionality reduction of high-dimensional facial features. On the basis of that, a multimodal face recognition algorithm based on Deep Belief Network (DBN) was proposed. The recognition results on ORL, UMIST and FERET face databases with different sample sizes and different resolution images show that, compared with the linear dimension reduction method and shallow network method, the proposed method achieves better learning efficiency and good recognition result.
Video background completion is attracting more and more attention. For videos captured by complexly moving camera, the problem is even more challenging. In order to solve the problem, a motion-guided optimizing algorithm was proposed to complete the spatio-temporal hole left by foreground object removal. First, to estimate the motion field in the hole, a global objective function was established, and a hierarchical iterative method was applied to find its optimal solution. Completion problem was then abstracted into a Markov Random Field (MRF) problem. Using motion field as the guidance, video background was completed by optimally assigning available pixels from other parts to the missing regions. Finally, traditional illumination transfer strategy was improved, and a new illumination adjusting method was proposed to eliminate the illumination inconsistency in the completed parts. This approach got good results on a variety of videos. Compared with previous methods, this approach works better in keeping spatio-temporal coherence, and can be applied on videos with complex background captured by complexly moving camera.