In medical image segmentation networks, Convolutional Neural Network (CNN) can extract rich local feature details, but has the problem of insufficient capture of long-range information, and Transformer can capture long-range global feature dependencies, but destroys local feature details. To make full use of the complementarity of characteristics of the two networks, a parallel fusion network of CNN and Transformer for medical image segmentation was proposed, named PFNet. In the parallel fusion module of this network, a pair of interdependent parallel branches based on CNN and Transformer were used to learn both local and global discriminative features efficiently, and fuse local features and long-distance feature dependencies interactively. At the same time, to recover the spatial information lost during downsampling to enhance detail retention, a Multi-Scale Interaction (MSI) module was proposed to extract the local context of multi-scale features generated by hierarchical CNN branches for long-range dependency modeling. Experimental results show that PFNet outperforms other advanced methods such as MISSFormer (Medical Image Segmentation tranSFormer) and UCTransNet (U-Net with Channel Transformer module). On Synapse and ACDC (Automated Cardiac Diagnosis Challenge) datasets, compared to the optimal baseline method MISSFormer, PFNet increases the average Dice Similarity Coefficient (DSC) by 1.27% and 0.81%, respectively. It can be seen that PFNet can realize more accurate medical image segmentation.
Convolutional Neural Networks (CNN) are used for image forensics because of their high recognizable property, easy understanding, and strong learnability. However, their inherent disadvantages of the receptive field increasing slowly and neglecting long-range dependencies, and high computational cost cause the unsatisfactory accuracy and lightweight deployment of deep learning algorithms. To solve the above problems, a lightweight network-based image copy-paste tamper detection algorithm namely LKA-EfficientNet (Large Kernel Attention EfficientNet) was proposed. The characteristics of long-range dependencies and global receptive field were contained in LKA-EfficientNet, and the number of EfficientNetV2 parameters was optimized. As a result, the localization speed and detection accuracy of image tamper were improved. Firstly, the image was inputted into and processed in the backbone network based on Large Kernel Attention (LKA) to obtain the candidate feature maps. Then, the feature maps of different scales were used to construct the feature pyramid for feature matching. Finally, the candidate feature maps after feature matching were fused to locate the tampered area of the image. In addition, the triple cross entropy loss function was used by LKA-EfficientNet to further improve the accuracy of the algorithm in image tamper localization. Experimental results show that LKA-EfficientNet can not only reduce the floating-point operations by 29.54% but also increase the F1 by 4.88% compared to the same type algorithm — Dense-InceptionNet. The above verifies that LKA-EfficientNet can reduce computational cost and maintain high detection performance at the same time.