Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (4): 1300-1308.DOI: 10.11772/j.issn.1001-9081.2025040398
• Multimedia computing and computer simulation • Previous Articles
Received:2025-04-14
Revised:2025-06-30
Accepted:2025-07-02
Online:2025-07-07
Published:2026-04-10
Contact:
Bo LIU
About author:CUI Xuan, born in 1999, M. S. candidate. His research interests include computer vision, image processing.
Supported by:通讯作者:
刘波
作者简介:崔选(1999—),男,湖南娄底人,硕士研究生,主要研究方向:计算机视觉、图像处理
基金资助:CLC Number:
Xuan CUI, Bo LIU. Unsupervised face attribute editing method based on dynamic convolutional autoencoder[J]. Journal of Computer Applications, 2026, 46(4): 1300-1308.
崔选, 刘波. 基于动态卷积自编码器的无监督人脸属性编辑方法[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1300-1308.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2025040398
| 环境 | 项目 | 具体内容 |
|---|---|---|
| 硬件配置 | GPU | NVIDIA A100 80 GB PCIe GPU |
| CPU | Intel Xeon Platinum 8358 | |
| 软件配置 | 操作系统 | Ubuntu 18.04.1 |
| 开发环境 | PyTorch 1.10 |
Tab. 1 Experimental environment configuration
| 环境 | 项目 | 具体内容 |
|---|---|---|
| 硬件配置 | GPU | NVIDIA A100 80 GB PCIe GPU |
| CPU | Intel Xeon Platinum 8358 | |
| 软件配置 | 操作系统 | Ubuntu 18.04.1 |
| 开发环境 | PyTorch 1.10 |
| 生成模型 | 属性名 | InterFaceGAN | AdaTrans | SDFlow | AUFAE | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FID | LPIPS | SSIM | FID | LPIPS | SSIM | FID | LPIPS | SSIM | FID | LPIPS | SSIM | ||
| ProGAN | Age | 79.10 | 0.32 | 0.69 | 70.65 | 0.29 | 0.72 | 62.95 | 0.26 | 0.74 | 34.54 | 0.18 | 0.81 |
| Hair | 50.31 | 0.25 | 0.77 | 76.77 | 0.30 | 0.71 | 60.76 | 0.24 | 0.77 | 38.06 | 0.20 | 0.79 | |
| Gender | 89.52 | 0.36 | 0.65 | 76.47 | 0.30 | 0.71 | 74.35 | 0.31 | 0.69 | 47.44 | 0.25 | 0.73 | |
| Pose | 65.06 | 0.39 | 0.61 | — | — | — | — | — | — | 45.90 | 0.28 | 0.70 | |
| Smile | 45.82 | 0.21 | 0.80 | 75.86 | 0.32 | 0.69 | 60.66 | 0.23 | 0.78 | 33.76 | 0.18 | 0.81 | |
| 平均 | 65.96 | 0.31 | 0.70 | 74.94 | 0.30 | 0.71 | 64.68 | 0.26 | 0.75 | 39.94 | 0.22 | 0.77 | |
| StyleGAN | Age | 68.93 | 0.33 | 0.68 | 62.34 | 0.26 | 0.74 | 59.39 | 0.24 | 0.72 | 41.57 | 0.21 | 0.80 |
| Hair | 68.63 | 0.32 | 0.71 | 72.55 | 0.35 | 0.66 | 66.50 | 0.25 | 0.71 | 35.78 | 0.19 | 0.81 | |
| Gender | 82.38 | 0.38 | 0.63 | 78.59 | 0.33 | 0.68 | 68.08 | 0.26 | 0.69 | 46.36 | 0.24 | 0.76 | |
| Pose | 75.65 | 0.41 | 0.61 | — | — | — | — | — | — | 43.25 | 0.25 | 0.76 | |
| Smile | 30.65 | 0.17 | 0.84 | 68.21 | 0.33 | 0.68 | 57.42 | 0.22 | 0.73 | 39.59 | 0.20 | 0.81 | |
| 平均 | 65.25 | 0.32 | 0.69 | 70.42 | 0.32 | 0.69 | 62.85 | 0.24 | 0.71 | 41.31 | 0.22 | 0.79 | |
| StyleGAN2 | Age | 69.39 | 0.37 | 0.76 | 74.03 | 0.26 | 0.79 | 52.41 | 0.18 | 0.81 | 28.98 | 0.15 | 0.85 |
| Hair | 61.16 | 0.34 | 0.78 | 81.85 | 0.25 | 0.78 | 45.57 | 0.28 | 0.78 | 37.70 | 0.20 | 0.80 | |
| Gender | 62.77 | 0.33 | 0.79 | 79.92 | 0.26 | 0.79 | 59.00 | 0.22 | 0.77 | 27.23 | 0.14 | 0.86 | |
| Pose | 74.95 | 0.38 | 0.76 | — | — | — | — | — | — | 30.18 | 0.16 | 0.84 | |
| Smile | 69.95 | 0.36 | 0.77 | 44.18 | 0.25 | 0.80 | 50.21 | 0.26 | 0.78 | 30.29 | 0.16 | 0.84 | |
| 平均 | 67.64 | 0.37 | 0.77 | 70.00 | 0.26 | 0.79 | 51.80 | 0.24 | 0.79 | 30.88 | 0.16 | 0.84 | |
Tab. 2 Comparison of facial attribute editing effects by different methods
| 生成模型 | 属性名 | InterFaceGAN | AdaTrans | SDFlow | AUFAE | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FID | LPIPS | SSIM | FID | LPIPS | SSIM | FID | LPIPS | SSIM | FID | LPIPS | SSIM | ||
| ProGAN | Age | 79.10 | 0.32 | 0.69 | 70.65 | 0.29 | 0.72 | 62.95 | 0.26 | 0.74 | 34.54 | 0.18 | 0.81 |
| Hair | 50.31 | 0.25 | 0.77 | 76.77 | 0.30 | 0.71 | 60.76 | 0.24 | 0.77 | 38.06 | 0.20 | 0.79 | |
| Gender | 89.52 | 0.36 | 0.65 | 76.47 | 0.30 | 0.71 | 74.35 | 0.31 | 0.69 | 47.44 | 0.25 | 0.73 | |
| Pose | 65.06 | 0.39 | 0.61 | — | — | — | — | — | — | 45.90 | 0.28 | 0.70 | |
| Smile | 45.82 | 0.21 | 0.80 | 75.86 | 0.32 | 0.69 | 60.66 | 0.23 | 0.78 | 33.76 | 0.18 | 0.81 | |
| 平均 | 65.96 | 0.31 | 0.70 | 74.94 | 0.30 | 0.71 | 64.68 | 0.26 | 0.75 | 39.94 | 0.22 | 0.77 | |
| StyleGAN | Age | 68.93 | 0.33 | 0.68 | 62.34 | 0.26 | 0.74 | 59.39 | 0.24 | 0.72 | 41.57 | 0.21 | 0.80 |
| Hair | 68.63 | 0.32 | 0.71 | 72.55 | 0.35 | 0.66 | 66.50 | 0.25 | 0.71 | 35.78 | 0.19 | 0.81 | |
| Gender | 82.38 | 0.38 | 0.63 | 78.59 | 0.33 | 0.68 | 68.08 | 0.26 | 0.69 | 46.36 | 0.24 | 0.76 | |
| Pose | 75.65 | 0.41 | 0.61 | — | — | — | — | — | — | 43.25 | 0.25 | 0.76 | |
| Smile | 30.65 | 0.17 | 0.84 | 68.21 | 0.33 | 0.68 | 57.42 | 0.22 | 0.73 | 39.59 | 0.20 | 0.81 | |
| 平均 | 65.25 | 0.32 | 0.69 | 70.42 | 0.32 | 0.69 | 62.85 | 0.24 | 0.71 | 41.31 | 0.22 | 0.79 | |
| StyleGAN2 | Age | 69.39 | 0.37 | 0.76 | 74.03 | 0.26 | 0.79 | 52.41 | 0.18 | 0.81 | 28.98 | 0.15 | 0.85 |
| Hair | 61.16 | 0.34 | 0.78 | 81.85 | 0.25 | 0.78 | 45.57 | 0.28 | 0.78 | 37.70 | 0.20 | 0.80 | |
| Gender | 62.77 | 0.33 | 0.79 | 79.92 | 0.26 | 0.79 | 59.00 | 0.22 | 0.77 | 27.23 | 0.14 | 0.86 | |
| Pose | 74.95 | 0.38 | 0.76 | — | — | — | — | — | — | 30.18 | 0.16 | 0.84 | |
| Smile | 69.95 | 0.36 | 0.77 | 44.18 | 0.25 | 0.80 | 50.21 | 0.26 | 0.78 | 30.29 | 0.16 | 0.84 | |
| 平均 | 67.64 | 0.37 | 0.77 | 70.00 | 0.26 | 0.79 | 51.80 | 0.24 | 0.79 | 30.88 | 0.16 | 0.84 | |
| 方法序号 | 组成 | FID | LPIPS | SSIM |
|---|---|---|---|---|
| 1 | DCAE-Net | 47.16 | 0.24 | 0.78 |
| 2 | MLP | 45.01 | 0.24 | 0.77 |
| 3 | DCAE-Net | 53.24 | 0.28 | 0.71 |
| 4 | DCAE-Net | 42.34 | 0.23 | 0.77 |
| 5 | DCAE-Net | 41.31 | 0.22 | 0.79 |
Tab. 3 Quantitative results of ablation experiments
| 方法序号 | 组成 | FID | LPIPS | SSIM |
|---|---|---|---|---|
| 1 | DCAE-Net | 47.16 | 0.24 | 0.78 |
| 2 | MLP | 45.01 | 0.24 | 0.77 |
| 3 | DCAE-Net | 53.24 | 0.28 | 0.71 |
| 4 | DCAE-Net | 42.34 | 0.23 | 0.77 |
| 5 | DCAE-Net | 41.31 | 0.22 | 0.79 |
| [1] | 靳聪,周满玲,林美秀,等. VR/AR-AdaptFace:面向虚拟现实与增强现实的自适应多模态面部替换模型[J]. 中国传媒大学学报(自然科学版), 2024, 31(4): 55-63. |
| JIN C, ZHOU M L, LIN M X, et al. VR/AR-AdaptFace: an adaptive multimodal face replacement model for virtual and augmented reality [J]. Journal of Communication University of China (Science and Technology), 2024, 31(4): 55-63. | |
| [2] | XU Y, YIN Y, JIANG L, et al. TransEditor: Transformer-based dual-space GAN for highly controllable facial editing [C]// Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2022: 7673-7682. |
| [3] | HE X, ZHU M, CHEN D, et al. Diff-Privacy: diffusion-based face privacy protection [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(12): 13164-13176. |
| [4] | ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks [C]// Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway: IEEE, 2017: 2242-2251. |
| [5] | TORBUNOV D, HUANG Y, YU H, et al. UVCGAN: UNet vision Transformer cycle-consistent GAN for unpaired image-to-image translation [C]// Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision. Piscataway: IEEE, 2023: 702-712. |
| [6] | HUANG X, LIU M Y, BELONGIE S, et al. Multimodal unsupervised image-to-image translation [C]// Proceedings of the 2018 European Conference on Computer Vision, LNCS 11207. Cham: Springer, 2018: 179-196. |
| [7] | MIRZA M, OSINDERO S. Conditional generative adversarial nets[EB/OL]. [2025-04-10].. |
| [8] | LIU M, DING Y, XIA M, et al. STGAN: a unified selective transfer network for arbitrary image attribute editing [C]// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2019: 3668-3677. |
| [9] | PERNUŠ M, ŠTRUC V, DOBRIŠEK S. MaskFaceGAN: high-resolution face editing with masked GAN latent code optimization [J]. IEEE Transactions on Image Processing, 2023, 32: 5893-5908. |
| [10] | 陶玲玲,刘波,李文博,等. 有闭解的可控人脸编辑算法[J]. 计算机应用, 2023, 43(2): 601-607. |
| TAO L L, LIU B, LI W B, et al. Controllable face editing algorithm with closed-form solutions[J]. Journal of Computer Applications, 2023, 43(2): 601-607. | |
| [11] | CHOI Y, CHOI M, KIM M, et al. StarGAN: unified generative adversarial networks for multi-domain image-to-image translation[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 8789-8797. |
| [12] | CHOI Y, UH Y, YOO J, et al. StarGAN v2: diverse image synthesis for multiple domains [C]// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2019: 8185-8194. |
| [13] | HE Z, ZUO W, KAN M, et al. AttGAN: facial attribute editing by only changing what you want [J]. IEEE Transactions on Image Processing, 2019, 28(11): 5464-5478. |
| [14] | NAVEH C. Multi-directional subspace editing in style-space [C]// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2023: 7104-7114. |
| [15] | ZHUANG P, KOYEJO O, SCHWING A G. Enjoy your editing: controllable GANs for image editing via latent space navigation[EB/OL]. [2025-04-14]. . |
| [16] | SHEN Y, YANG C, TANG X, et al. InterFaceGAN: interpreting the disentangled face representation learned by GANs [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(4): 2004-2018. |
| [17] | HÄRKÖNEN E, HERTZMANN A, LEHTINEN J, et al. GANSpace: discovering interpretable GAN controls [C]// Proceedings of the 34th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2020: 9841-9850. |
| [18] | CHOI J, LEE J, YOON C, et al. Do not escape from the manifold: discovering the local coordinates on the latent space of GANs[EB/OL]. [2025-04-14].. |
| [19] | SHEN Y, ZHOU B. Closed-form factorization of latent semantics in GANs [C]// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2021: 1532-1540. |
| [20] | HUANG Z, MA S, ZHANG J, et al. Adaptive nonlinear latent transformation for conditional face editing [C]// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2023: 20965-20974. |
| [21] | LI B, HUANG Z, SHAN H, et al. Semantic latent decomposition with normalizing flows for face editing [C]// Proceedings of the 2024 IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2024: 4165-4169. |
| [22] | ABDAL R, ZHU P, MITRA N J, et al. StyleFlow: attribute-conditioned exploration of StyleGAN-generated images using conditional continuous normalizing flows [J]. ACM Transactions on Graphics, 2021, 40(3): No.21. |
| [23] | BAYKAL A C, ANEES A B, CEYLAN D, et al. CLIP-guided StyleGAN inversion for text-driven real image editing [J]. ACM Transactions on Graphics, 2023, 42(5): No.172. |
| [24] | PATASHNIK O, WU Z, SHECHTMAN E, et al. StyleCLIP: text-driven manipulation of StyleGAN imagery [C]// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2021: 2065-2074. |
| [25] | KINGMA D P, WELLING M. Auto-encoding variational Bayes[EB/OL]. [2025-03-11].. |
| [26] | RADFORD A, METZ L, CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks[EB/OL]. [2025-02-15].. |
| [27] | JOGIN M, MOHANA, MADHULIKA M S, et al. Feature extraction using Convolution Neural Networks (CNN) and deep learning [C]// Proceedings of the 3rd IEEE International Conference on Recent Trends in Electronics, Information and Communication Technology. Piscataway: IEEE, 2018: 2319-2323. |
| [28] | MENG C, YANG J, LIN W, et al. CTA-Net: a CNN-Transformer aggregation network for improving multi-scale feature extraction[EB/OL]. [2025-04-09].. |
| [29] | CUI T, LI J, LIU L. TAOTF: a two-stage approximately orthogonal training framework in deep neural networks [C]// Proceedings of the 26th European Conference on Artificial Intelligence/ 12th Conference on Prestigious Applications of Intelligent Systems. Amsterdam: IOS Press, 2023: 509-516. |
| [30] | KARRAS T, AILA T, LAINE S, et al. Progressive growing of GANs for improved quality, stability, and variation [EB/OL]. [2025-04-14].. |
| [31] | KARRAS T, LAINE S, AILA T. A style-based generator architecture for generative adversarial networks [C]// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2019: 4396-4405. |
| [32] | KARRAS T, LAINE S, AITTALA M, et al. Analyzing and improving the image quality of StyleGAN [C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2020: 8107-8116. |
| [33] | LIU Z, LUO P, WANG X, et al. Deep learning face attributes in the wild [C]// Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway: IEEE, 2015: 3730-3738. |
| [34] | HEUSEL M, RAMSAUER H, UNTERTHINER T, et al. GANs trained by a two time-scale update rule converge to a local nash equilibrium [C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2017: 6629-6640. |
| [35] | ZHANG R, ISOLA P, EFROS A A, et al. The unreasonable effectiveness of deep features as a perceptual metric [C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 586-595. |
| [36] | WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: from error visibility to structural similarity [J]. IEEE Transactions on Image Processing, 2004, 13(4): 600-612. |
| [1] | Min CHEN, Xiaolin QIN, Shaohan LI, Hao YANG, Taohong LI. Review of deep learning applications in severe convective weather prediction [J]. Journal of Computer Applications, 2026, 46(3): 980-992. |
| [2] | Yinlong JIAN, Xuebin CHEN, Zhongrui JING, Qi ZHONG, Zhenbo ZHANG. Data augmentation scheme based on conditional generative adversarial network in federated learning [J]. Journal of Computer Applications, 2026, 46(1): 21-32. |
| [3] | Na FAN, Chuang LUO, Zehui ZHANG, Mengyao ZHANG, Ding MU. Semantic privacy protection mechanism of vehicle trajectory based on improved generative adversarial network [J]. Journal of Computer Applications, 2026, 46(1): 169-180. |
| [4] | Lifang WANG, Wenjing REN, Xiaodong GUO, Rongguo ZHANG, Lihua HU. Trident generative adversarial network for low-dose CT image denoising [J]. Journal of Computer Applications, 2026, 46(1): 270-279. |
| [5] | Yilin DENG, Fajiang YU. Pseudo random number generator based on LSTM and separable self-attention mechanism [J]. Journal of Computer Applications, 2025, 45(9): 2893-2901. |
| [6] | Jin ZHOU, Yuzhi LI, Xu ZHANG, Shuo GAO, Li ZHANG, Jiachuan SHENG. Modulation recognition network for complex electromagnetic environments [J]. Journal of Computer Applications, 2025, 45(8): 2672-2682. |
| [7] | Ying HUANG, Shengmei GAO, Guang CHEN, Su LIU. Low-light image enhancement network combining signal-to-noise ratio guided dual-branch structure and histogram equalization [J]. Journal of Computer Applications, 2025, 45(6): 1971-1979. |
| [8] | Hui LI, Bingzhi JIA, Chenxi WANG, Ziyu DONG, Jilong LI, Zhaoman ZHONG, Yanyan CHEN. Generative adversarial network underwater image enhancement model based on Swin Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1439-1446. |
| [9] | Lihu PAN, Shouxin PENG, Rui ZHANG, Zhiyang XUE, Xuzhen MAO. Video anomaly detection for moving foreground regions [J]. Journal of Computer Applications, 2025, 45(4): 1300-1309. |
| [10] | Hong SHANGGUAN, Huiying REN, Xiong ZHANG, Xinglong HAN, Zhiguo GUI, Yanling WANG. Low-dose CT denoising model based on dual encoder-decoder generative adversarial network [J]. Journal of Computer Applications, 2025, 45(2): 624-632. |
| [11] | Jing HUANG, Xin PENG, Wenhao LI, Kai HU, Teng WANG, Yamin HUANG, Yuanqiao WEN. High-quality sonar image generation method based on multi-scale feature fusion [J]. Journal of Computer Applications, 2025, 45(12): 3987-3994. |
| [12] | Guoyu XU, Xiaolong YAN, Yidan ZHANG. DU-FastGAN: lightweight generative adversarial network based on dynamic-upsample [J]. Journal of Computer Applications, 2025, 45(10): 3067-3073. |
| [13] | You SHANG, Xianghua MIAO. Bayesian membership inference attacks for generative adversarial networks [J]. Journal of Computer Applications, 2025, 45(10): 3252-3258. |
| [14] | Li LIU, Haijin HOU, Anhong WANG, Tao ZHANG. Generative data hiding algorithm based on multi-scale attention [J]. Journal of Computer Applications, 2024, 44(7): 2102-2109. |
| [15] | Haoran WANG, Dan YU, Yuli YANG, Yao MA, Yongle CHEN. Domain transfer intrusion detection method for unknown attacks on industrial control systems [J]. Journal of Computer Applications, 2024, 44(4): 1158-1165. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||
