Unsupervised face attribute editing method based on dynamic convolutional autoencoder

doi:10.11772/j.issn.1001-9081.2025040398

Abstract

Abstract: Unsupervised facial attribute editing methods based on the latent space of generative adversarial networks (GANs) offer advantages of high efficiency and annotation-free operation, yet they still face challenges in terms of attribute disentanglement and controllability—for instance, modifying a specific facial attribute may inadvertently alter other attributes, compromising editing quality, while precise control over the degree of attribute modification remains difficult. To address these issues, an Autoencoder-based Unsupervised Face Attribute Editing (AUFAE) method was proposed, which achieved precise facial attribute editing by learning effective semantic vectors in the latent space. Specifically, a Dynamic Convolutional Autoencoder Network (DCAE-Net) was employed as the backbone, where Dynamic Convolution (DyConv) was utilized by the encoder to adaptively extract local latent-space features, thereby enabling the learning of semantically meaningful vectors with localized characteristics. A Channel Attention (CA) mechanism was incorporated into the decoder to establish nonlinear dependencies between channels, allowing the model to autonomously focus on feature channels relevant to different semantics and enhancing the independence of learned semantic vectors. To improve disentanglement and controllability, an attribute boundary vector-based loss function was introduced to train the DCAE-Net. Additionally, a soft orthogonality loss was applied to ensure mutual independence among semantic vectors, further boosting disentanglement performance. Experiments conducted on three pre-trained GAN models compare AUFAE with three state-of-the-art face attribute editing methods. The experimental results demonstrate that compared to the supervised method InterFaceGAN, the proposed AUFAE achieves an average reduction of 9% in the Learned Perceptual Image Patch Similarity （LPIPS）metric and an average improvement of 7% in the Structural Similarity Index Measure （SSIM） metric. When compared to the unsupervised method SDFlow, AUFAE shows an average reduction of 5% in LPIPS and an average improvement of 5% in SSIM. In terms of visual perception, AUFAE also did not exhibit any attribute coupling phenomenon during the facial attribute editing process. The above results demonstrate that AUFAE can effectively mitigate the issue of attribute coupling in facial editing and achieve more precise face attribute manipulation.

Key words: Generative Adversarial Network (GAN), semantic vectors, face attribute Editing, attribute boundaries vector, Dynamic Convolution (DyConv)

摘要： 基于生成对抗网（GANs）潜空间的无监督人脸属性编辑方法具有效率高、无需标注数据的优点，但这些方法在解耦性和可控性方面仍面临挑战，如在操控特定人脸属性时，可能会引起其他属性的意外变化，从而影响编辑效果；另外，还难以精确控制所编辑人脸属性的变化程度。针对基于GANs潜空间的无监督人脸属性编辑方法中在操控特定人脸属性时，可能会引起其他属性的意外变化等属性耦合问题，提出基于自编码器的无监督人脸属性编辑（AUFAE）方法。该方法通过在潜空间中学习有效的语义向量，实现对人脸属性的精准编辑。具体地，设计动态卷积自编码器网络（DCAE-Net）作为主干网络，该网络的编码器部分采用动态卷积（DyConv）的方式动态提取潜空间的局部特征，从而学习具有局部特性的语义向量；在解码器部分则融入通道注意力（CA）机制建立通道间的非线性依赖关系，使模型能够自主地聚焦不同语义相关的特征通道，有效促进语义向量的独立性学习。为了增强语义向量的解耦性和可控性，引入基于属性边界向量的损失函数训练DCAE-Net。此外，引入软正交损失确保语义向量之间相互独立，以进一步提升解耦性能。在3个预训练GAN生成模型上，AUFAE与3种主流的人脸属性编辑方法的对比实验结果表明，AUFAE与监督方法InterFaceGAN相比，学习感知图像块相似度（LPIPS）值平均减少了9%，结构相似性指数（SSIM）平均提升了7%；与无监督方法SDFlow相比，LPIPS值平均减少了5%，SSIM平均提升了5%；在直观视觉上，AUFAE在人脸属性编辑过程中也未出现属性耦合现象。以上结果说明AUFAE能够有效地缓解人脸编辑过程中的属性耦合问题，并实现更精确的人脸属性编辑。

关键词: 生成对抗网络, 语义向量, 人脸属性编辑, 属性边界向量, 动态卷积

CLC Number:

TP391

崔选刘波. 基于动态卷积自编码器的无监督人脸属性编辑方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025040398.

[1]	Yilin DENG, Fajiang YU. Pseudo random number generator based on LSTM and separable self-attention mechanism [J]. Journal of Computer Applications, 2025, 45(9): 2893-2901.
[2]	Jin ZHOU, Yuzhi LI, Xu ZHANG, Shuo GAO, Li ZHANG, Jiachuan SHENG. Modulation recognition network for complex electromagnetic environments [J]. Journal of Computer Applications, 2025, 45(8): 2672-2682.
[3]	Ying HUANG, Shengmei GAO, Guang CHEN, Su LIU. Low-light image enhancement network combining signal-to-noise ratio guided dual-branch structure and histogram equalization [J]. Journal of Computer Applications, 2025, 45(6): 1971-1979.
[4]	Hui LI, Bingzhi JIA, Chenxi WANG, Ziyu DONG, Jilong LI, Zhaoman ZHONG, Yanyan CHEN. Generative adversarial network underwater image enhancement model based on Swin Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1439-1446.
[5]	Lihu PAN, Shouxin PENG, Rui ZHANG, Zhiyang XUE, Xuzhen MAO. Video anomaly detection for moving foreground regions [J]. Journal of Computer Applications, 2025, 45(4): 1300-1309.
[6]	Hong SHANGGUAN, Huiying REN, Xiong ZHANG, Xinglong HAN, Zhiguo GUI, Yanling WANG. Low-dose CT denoising model based on dual encoder-decoder generative adversarial network [J]. Journal of Computer Applications, 2025, 45(2): 624-632.
[7]	Guoyu XU, Xiaolong YAN, Yidan ZHANG. DU-FastGAN： lightweight generative adversarial network based on dynamic-upsample [J]. Journal of Computer Applications, 2025, 45(10): 3067-3073.
[8]	Yangyi GAO, Tao LEI, Xiaogang DU, Suiyong LI, Yingbo WANG, Chongdan MIN. Crowd counting and locating method based on pixel distance map and four-dimensional dynamic convolutional network [J]. Journal of Computer Applications, 2024, 44(7): 2233-2242.
[9]	Li LIU, Haijin HOU, Anhong WANG, Tao ZHANG. Generative data hiding algorithm based on multi-scale attention [J]. Journal of Computer Applications, 2024, 44(7): 2102-2109.
[10]	Xun SUN, Ruifeng FENG, Yanru CHEN. Monocular 3D object detection method integrating depth and instance segmentation [J]. Journal of Computer Applications, 2024, 44(7): 2208-2215.
[11]	Haoran WANG, Dan YU, Yuli YANG, Yao MA, Yongle CHEN. Domain transfer intrusion detection method for unknown attacks on industrial control systems [J]. Journal of Computer Applications, 2024, 44(4): 1158-1165.
[12]	Yi ZHENG, Cunyi LIAO, Tianqian ZHANG, Ji WANG, Shouyin LIU. Image denoising-based cell-level RSRP estimation method for urban areas [J]. Journal of Computer Applications, 2024, 44(3): 855-862.
[13]	Hui ZHOU, Yuling CHEN, Xuewei WANG, Yangwen ZHANG, Jianjiang HE. Deep shadow defense scheme of federated learning based on generative adversarial network [J]. Journal of Computer Applications, 2024, 44(1): 223-232.
[14]	Haiwei FAN, Xinsiyu LU, Limiao ZHANG, Yisheng AN. Citation recommendation algorithm fusing knowledge graph and graph attention network [J]. Journal of Computer Applications, 2023, 43(8): 2420-2425.
[15]	Anyang LIU, Huaici ZHAO, Wenlong CAI, Zechao XU, Ruideng XIE. Adaptive image deblurring generative adversarial network algorithm based on active discrimination mechanism [J]. Journal of Computer Applications, 2023, 43(7): 2288-2294.

Unsupervised face attribute editing method based on dynamic convolutional autoencoder

基于动态卷积自编码器的无监督人脸属性编辑方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics