Visual perception, as one of the core technologies of environmental understanding, provides accurate environmental information for intelligent mobile systems (such as autonomous driving) and is an important prerequisite for ensuring safety decisions. 3D object detection technology based on Bird’s Eye View (BEV) has become the mainstream paradigm in the field of environmental perception because of its efficiency and accuracy. To further promote the research of 3D object detection algorithms based on BEV, the following was performed. Firstly, the BEV 3D object detection algorithms were classified systematically, and according to the modals of the input data, they were divided into three categories: pure camera algorithms, pure LiDAR algorithms and camera-LiDAR fusion algorithms. Secondly, the role of pre-training algorithms in improving detection performance was explored. Thirdly, the advantages and disadvantages of the algorithms fusing temporal features in dynamic scenarios and the performance of the algorithms fusing height features in complex environments were analyzed. Fourthly, the breakthrough progress made by large model collaborative BEV object detection in object detection accuracy and scenario understanding was sorted out. Finally, the core conclusions of BEV 3D object detection algorithms were summarized, and future research directions were looked forward, so as to provide new ideas for research work in this field.
Aiming at the problems that engine operation data are multi-modal and it is difficult to achieve effective engine life prediction, a parallel branching engine life prediction method was proposed on the basis of multi-modal features integrating potential relationship between images and engine operation time data. Firstly, a sliding window was used to segment the engine operation data, so as to construct sequence samples of engine operation data, and Gramian Angular Field (GAF) was used to convert the constructed sequence samples into images. Then, the sequence samples and images were processed by a Bi-directional Long Short-Term Memory (BiLSTM) network and a Convolutional Neural Network (CNN) to obtain potential relationship features between sensors such as trends and cycles. Finally, Cross-Attention Mechanism (CAM) was introduced to achieve fusion of the two modal features and realize life prediction of the engine. Experimental results on the public C-MAPSS dataset show that the R-squared (R2) of the prediction method is higher than 0.99 and the Root Mean Square Error (RMSE) of the method is less than 1. It can be seen that the method can improve computational efficiency while ensuring the prediction accuracy.
Betweenness centrality is a common metric for evaluating the importance of nodes in a graph. However, the update efficiency of betweenness centrality in large-scale dynamic graphs is not high enough to meet the application requirements. With the development of multi-core technology, algorithm parallelization has become one of the effective ways to solve this problem. Therefore, a Parallel Algorithm of Betweenness centrality for dynamic networks (PAB) was proposed. Firstly, the time cost of redundant point pairs was reduced through operations such as community filtering, equidistant pruning and classification screening. Then, the determinacy of the algorithm was analyzed and processed to realize parallelization. Comparison experiments were conducted on real datasets and synthetic datasets, and the results show that the update efficiency of PAB is 4 times that of the latest batch-iCENTRAL algorithm on average when adding edges. It can be seen that the proposed algorithm can improve the update efficiency of betweenness centrality in dynamic networks effectively.
The outlier detection algorithm based on autoencoder is easy to over-fit on small- and medium-sized datasets, and the traditional outlier detection algorithm based on ensemble learning does not optimize and select the base detectors, resulting in low detection accuracy. Aiming at the above problems, an Ensemble learning and Autoencoder-based Outlier Detection (EAOD) algorithm was proposed. Firstly, the outlier values and outlier label values of the data objects were obtained by randomly changing the connection structure of the autoencoder generate different base detectors. Secondly, local region around the object was constructed according to the Euclidean distance between the data objects calculated by the nearest neighbor algorithm. Finally, based on the similarity between the outlier values and the outlier label values, the base detectors with strong detection ability in the region were selected and combined together, and the object outlier value after combination was used as the final outlier value judged by EAOD algorithm. In the experiments, compared with the AutoEncoder (AE) algorithm, the proposed algorithm has the Area Under receiver operating characteristic Curve (AUC) and Average Precision (AP) scores increased by 8.08 percentage points and 9.17 percentage points respectively on Cardio dataset; compared with the Feature Bagging (FB) ensemble learning algorithm, the proposed algorithm has the detection time cost reduced by 21.33% on Mnist dataset. Experimental results show that the proposed algorithm has good detection performance and real-time performance under unsupervised learning.
A texture image segmentation algorithm based on wavelet transform and kd-tree clustering was studied in this paper. Firstly, texture features of an image were extracted using wavelet transform. Secondly, an improved algorithm based on quarter partition was given to smooth the texture feature image. Thirdly, the clustering algorithm using the kd-tree data structure was applied to the texture segmentation, and then a fast texture feature clustering effect was achieved. At last, simulations were performed on the presented algorithm, and the simulation result showed that the presented algorithm has lower segmentation error rate, higher accuracy and better in-time performance.