The traditional Segment Anything Model (SAM) relies on manual prompts during segmentation of Meibomian gland images, making it difficult to handle issues such as dense glands, irregular shapes, and blurred boundaries. To address this, an improved model, namely ResSAM, was proposed. ResSAM eliminated the reliance on manual intervention by introducing an automatic prompt encoder. The backbone network was pruned and optimized to further enhance the model's segmentation efficiency. Focal Loss and Smooth IoU Loss were used for training optimization, and the SE (Squeeze-and-Excitation) and cross-attention mechanisms were integrated to reduce the impact of individual differences and blurred boundaries, thereby improving the model's segmentation accuracy. Experimental results on two self-built datasets, Lower Lid and Upper Lid, showed that ResSAM achieved the best performance in terms of the number of parameters and Giga FLoating-point OPerations (GFLOPs); its segmentation results obtained the highest Dice scores (88.69% and 87.75%, respectively) and the highest Intersection-over-Union (IoU) values (79.69% and 78.58%, respectively). The research results indicate that the ResSAM optimizes both efficiency and accuracy, supporting early prevention and clinical diagnosis of Meibomian Gland Dysfunction (MGD).
Existing underwater small object detection methods are primarily based on deep learning algorithms, which face challenges in balancing lightweight design and detection accuracy, so that they unable to meet the requirements of real-time and resource-constrained platforms. Therefore, Graph-DETR, a lightweight underwater small object detection model based on RT-DETR (Real-Time DEtection TRansformer) and a graph Transformer, was proposed. The model used a lightweight MobileNetV4 backbone improved with the Large Separable Kernel Attention mechanism (LSKAttention) and the Context-Mixing dynamic convolutional block (CM block) to enhance feature extraction efficiency and reduce model complexity. Additionally, a hierarchical Graph Transformer Feature Pyramid Network (GTFPN) was proposed to strengthen multi-scale feature fusion, and the hybrid encoder was optimized via Wavelet Transform Convolution (WTConv), Adaptive downsampling (Adown), and path pruning, thereby achieving convolutional receptive field expansion of the CNN-based Cross-scale Feature Fusion (CCFF) module with low parameterization. Experimental results on the underwater public dataset URPC2020 show that, compared to RT-DETR, Graph-DETR reduces the parameters by 66.9% and the reasoning latency by 6.8 ms, achieving a mean Average Precision (mAP) of 53.2% and an Average Precision of 86.8% at an IoU threshold of 0.5 (AP@0.5); on URPC2021, it has 81.3% recall, 54.1% mAP, 87.6% AP@0.5 with only 10.5 ms latency, outperforming the existing methods. Graph-DETR exhibits excellent performance in underwater small object detection and is practical for deployment on resource-constrained underwater platforms.
To address the problems posed by highly realistic AI-generated text, driven by the rapid development of Large Language Models (LLMs), and the performance degradation of traditional detection methods, an adaptive multi-feature fusion detection method for AI-generated text was proposed. Firstly, a language style feature set covering text statistical features, language structural features, and language uncertainty features was constructed to capture differences between real and AI-generated texts; then, deep semantic features of texts were extracted using independent encoding technology. Based on these, a dual-path mapping feature-adaptive fusion strategy was designed: language-style features and deep semantic features were first fused at a primary level, and secondary fusion was then performed using deep learning to enhance the capability of adaptive feature fusion. Experimental results demonstrate that the proposed method achieves detection accuracies of 98.1% on the Chinese SocialAI-Detect dataset and 98.5% on the English TuringBench dataset; compared with the best-performing baseline, J-Guard (Journalism Guided adversarially robust detection of AI-generated news), the improvements are 2.3 and 2.1 percentage points, respectively, verifying the effectiveness of the proposed method.
Distributed Denial of Service (DDoS) attacks, as a highly destructive type of cyber attacks, have become one of the most severe threats and challenges in the field of cybersecurity in recent years due to their low attack costs, high attack efficiency, and strong concealment. DDoS attacks employ a distributed control approach to mix malicious traffic with legitimate network requests, making it difficult for traditional security defense mechanisms such as Intrusion Detection System (IDS) and firewalls to identify and mitigate such attacks effectively. Consequently, the efficient detection and effective defense against DDoS attacks have become research hotspots and difficulties in the field of cybersecurity. Based on systematic survey of the existing research on DDoS attacks, the following was performed. Firstly, the classification methods of DDoS attacks were sorted out, and DDoS attacks were summed up from multiple perspectives, so as to provide a deeper understanding of DDoS attack mechanisms. Secondly, an analysis of the current development of DDoS attacks was conducted, with particular focuses on discussing the development trends in attack intensity, attack methods, and attack distribution, thereby providing support for the research on more efficient DDoS defense technologies. Thirdly, an in-depth analysis and evaluation of the status of DDoS attack defense technologies was conducted from both industrial and academic perspectives, which focused on DDoS detection and defense methods based on programmable switches and machine learning in the academic aspect, and compared and analyzed the defense architectures adopted by different participants in DDoS defense in the industrial aspect as well as summarized the technical characteristics, application scenarios, and the existing challenges of the architecture. Finally, based on a comprehensive analysis of the current DDoS attack situations, the future development directions, opportunities, and challenges of DDoS defense technology were prospected, providing new ideas and directions for researchers in the field of cybersecurity and promoting further innovation and development of DDoS defense technology.
For Expensive Multi-objective Optimization Problem (EMOP), although numerous related algorithms have been proposed, most existing algorithms have not achieved satisfactory results. The primary reason is that the infill sampling criteria in these algorithms fail to balance the convergence, diversity and uncertainty of selected individuals. Therefore, a Two-stage Infill Sampling-based Expensive Multi-Objective Evolutionary Algorithm (TISEMOEA) was proposed. In the first stage, a convergence-based infill sampling criterion was proposed, so as to select individuals with both good convergence and diversity, and then balance convergence and diversity. In the second stage, a diversity-based infill sampling criterion was proposed, so as to select individuals with great uncertainty without damaging convergence, and then improve the accuracy of the model and the diversity of the population. Furthermore, an adaptive diversity enhancement strategy was proposed to adjust the frequency of selecting individuals using the diversity-based infill sampling criterion, thereby enhancing population diversity and balancing exploration and exploitation capabilities of the algorithm. TISEMOEA was compared with five state-of-the-art algorithms, MOEA/D-EGO (MOEA/D with the Gaussian process model), HeE-MOEA (Heterogeneous Ensemble-based infill criterion for MOEA), TISS-EMOA (Two-stage Infill Sampling-based Semi-supervised EMOA), PCSAEA (Pairwise Comparison based Surrogate-Assisted Evolutionary Algorithm), and SFA/DE (Evolutionary multiobjective optimization assisted by scalarization function approximation for high-dimensional expensive problems), on the DTLZ and WFG test sets with 28 and 27 test problems, and the Inverted Generational Distance (IGD) metric was analyzed. The results show that TISEMOEA achieves the best results in 19 and 16 test problems, respectively.
When detecting small targets in multi-scale remote sensing images, target detection algorithms based on deep learning are prone to false detection and missed detection. One of the reasons is that the feature extraction module carries out multiple down-sampling operations. The second reason is the failure to pay attention to the contextual information required by different categories and different scales of targets. To solve this problem, a small object detection algorithm in remote sensing images integrating attention and contextual information ACM-YOLO (Attention-Context-Multiscale YOLO) was proposed. Firstly, to reduce the loss of small target feature information, fine-grained query aware sparse attention was applied, thereby avoiding missed detection. Secondly, to pay more attention to the contextual information required by different categories of remote sensing targets, the Local Contextual Enhancement (LCE) function was designed, thereby avoiding false detection. Finally, to strengthen multi-scale feature fusion capability of the feature fusion module on small targets in remote sensing images, the weighted Bi-directional Feature Pyramid Network (BiFPN) was adopted, thereby improving detection effect of the algorithm. Comparison experiments and ablation experiments were performed on DOTA dataset and NWPU VHR-10 dataset to verify effectiveness and generalization of the proposed algorithm. Experimental results show that on the two datasets, the proposed algorithm has the mean Average Precision (mAP) reached 77.33% and 96.12% respectively, and the Recall increases by 10.00 and 7.50 percentage points, respectively, compared with YOLOv5 algorithm. It can be seen that the proposed algorithm improves mAP and recall effectively, which reduces false detection and missed detection.
Both the Damped Least Squares (DLS) and Genetic Algorithm (GA) are applicable to automatic design of optical systems. Although DLS has a high search efficiency, it is susceptible to falling into local optima traps. Conversely, GA has strong global search capability in the parameter space of optical structures but weak local search capability. To address these challenges, a Correctable Reinforced Search GA (CRSGA) was proposed. Firstly, DLS was introduced after the GA crossover operation to enhance local search capability. Additionally, a correction strategy was introduced to rollback individuals with deteriorated fitness values before the next iteration, thereby achieving corrective evolutionary results. The improvement of two aspects to genetic algorithm enhanced strengths and compensated for weaknesses. Three typical optical system design experiments, including Double Gaussian (DG), Reversed Telephoto (RT), and Finite Conjugate Distance Imaging (FCDI), were conducted to validate the effectiveness of CRSGA. CRSGA outperforms both DLS and GA, and its optimization outcomes are about 8.92%, 12.19%, and 9.39% respectively better than those of commercial optical design software Zemax DLS. In particularly, the optimization outcomes achieve a significant improvement, reaching 99.98%, 94.33%, and 88.45% respectively compared to the Zemax HAMMER algorithm. In conclusion, it is shown that the proposed algorithm is effective for optical system optimization and can be used for automatic optical system design.
Aiming at the problem of the existing drift detection methods in balancing the detection delay, false positives, false negatives, and spatiotemporal efficiency, a new stage transition threshold parameter was proposed, and a multi-stage weighting mechanism including “stable stage-warning stage-drift stage” was introduced in the concept drift detection to weight the instances in stages, and the mechanism was applied to the double sliding window. Then a Multi-Stage weighted Drift Detection Method (MSDDM) based on Hoeffding inequality was proposed. On artificial datasets, MSDDM detected abrupt and gradual concept drift faster than Fast Hoeffding Drift Detection Method (FHDDM), Drift Detection Method based on Hoeffding’s bound (HDDM) and other drift detection methods, while maintained a low false detection rate and a false alarm rate. At the same time, MSDDM had the highest classification accuracy in most cases compared with other methods on real-world datasets. Experimental results show that MSDDM can detect concept drift in data streams with high drift detection performance and great spatiotemporal efficiency.
Aiming at the problem that the existing adversarial example generation methods require a lot of queries to the target model, which leads to poor attack effects, a Text Adversarial Examples Generation Method based on BERT (Bidirectional Encoder Representations from Transformers) model (TAEGM) was proposed. Firstly, the attention mechanism was adopted to locate the keywords that significantly influence the classification results without query of the target model. Secondly, word-level perturbation of keywords was performed by BERT model to generate candidate adversarial examples. Finally, the candidate examples were clustered, and the adversarial examples were selected from the clusters that have more influence on the classification results. Experimental results on Yelp Reviews, AG News, and IMDB Review datasets show that compared to the suboptimal adversarial example generation method CLARE (ContextuaLized AdversaRial Example generation model) on Success Rate (SR), TAEGM can reduce the Query Counts (QC) to the target model by 62.3% and time consumption by 68.6% averagely while ensuring the SR of adversarial attacks. Based on the above, further experimental results verify that the adversarial examples generated by TAEGM not only have good transferability, but also improve the robustness of the model through adversarial training.
To strengthen the control and management of local airspace routes, a route discovery method based on trajectory point clustering was proposed. Firstly, for the simulation data generated according to the distribution characteristics of the real data, the pre-processing module was used to weaken and remove the noise of the trajectory data. Secondly, a route discovery method including outlier elimination, trajectory resampling, trajectory point clustering, clustering center correction, and connecting clustering centers was proposed to extract the routes. Finally, the result of route extraction was visualized and the proposed method was validated using civil aviation data. The experimental results on the simulated data show that the node coverage and the length coverage of the proposed method is 99% and 94% respectively, under the noise intensity of 0.1° and the buffer area of 30 km. Compared with the rasterization method, the proposed method has higher accuracy and can extract the routes more effectively, achieving the purpose of extracting the common routes of aircraft.
Aiming at the problem of neglecting some narrow roads due to the formation constraints in the multi-UAV (Unmanned Aerial Vehicle) cooperative trajectory planning, a Fast Particle Swarm Optimization method based on Adaptive Distributed Model Predictive Control (ADMPC-FPSO) was proposed. In the method, the formation strategy combining leader-follower method and virtual structure method was used to construct adaptive virtual formation guidance points to complete the cooperative formation control task. According to the idea of model predictive control, combined with the distributed control method, the cooperative trajectory planning was transformed into a rolling online optimization problem, and the minimum distance and other performance indicators were used as cost functions. By designing the evaluation function criterion, the variable weight fast particle swarm optimization algorithm was used to solve the problem. The simulation results show that the proposed algorithm can effectively realize the multi-UAV cooperative trajectory planning, can quickly complete the adaptive formation transformation according to the environmental changes, and has lower cost than the traditional formation strategy.
The teaching of Chinese architecture history has building structures too complex, is limited to 2D planar teaching and is not easy for students to master and apply, therefore an implementation method of Chinese architecture history teaching system based on mixed reality technology was proposed. The wooden structure system of Baoguo Temple in Ningbo was taken as an example, and the mixed reality device Microsoft HoloLens was used as the teaching platform. Firstly, 3ds Max was applied to the 3D simulation modeling of the wooden structure system of Baoguo Temple based on the collected data, and a building model library was built. Then, the 3D human-computer interface of the virtual teaching system was constructed in unity3D, the key technologies were used including environment understanding and human-computer interaction based on C# scripts, and a Chinese architectural history teaching system using HoloLens was implemented with core functions of building structure recognition and cultural cognition. The results show that the system has good 3D visual effects and natural effective human-computer interaction, which can improve the efficiency of knowledge transfer and the initiative of students.
To deal with poor robustness in strong noise environment, weak adaptability to complex mixed noise that appear in the existing unsupervised image segmentation models, an improved noise-robust image segmentation model based on One-class SVM (Support Vector Machine) method was proposed. Firstly, a data outlier detection mechanism was constructed based on One-class SVM. Secondly, an outlier degree was introduced into the energy function, so that more accurate image information could be obtained by the proposed model under multiple noise intensities and the failure of weight-descend mechanism in strong noise environment was avoided. Finally, the segmentation contour was driven to the target edge by minimizing the energy function. In noise image segmentation experiments, the proposed model could obtain ideal segmentation results with different types and intensities of noise. Under F1-score metric, the proposed model is 0.2 to 0.3 higher than LCK (Local Correntropy-based K-means) model, and has better stability in strong noise environments. The segmentation convergence time of the proposed model is only slightly longer than that of LCK model by about 0.1 s. Experimental results show that the proposed model is more robust to probabilistic, extreme values and mixed noise without significantly increase of segmentation time, and can segment natural images with noise.