Overview

Select

Knowledge graph survey： representation， construction， reasoning and knowledge hypergraph theory

TIAN Ling, ZHANG Jinchuan, ZHANG Jinhao, ZHOU Wangtao, ZHOU Xue

Journal of Computer Applications 2021, 41 (8): 2161-2186. DOI: 10.11772/j.issn.1001-9081.2021040662

Abstract （3261）

PDF （2811KB）（4260）

Save

Knowledge Graph (KG) strongly support the research of knowledge-driven artificial intelligence. Aiming at this fact, the existing technologies of knowledge graph and knowledge hypergraph were analyzed and summarized. At first, from the definition and development history of knowledge graph, the classification and architecture of knowledge graph were introduced. Second, the existing knowledge representation and storage methods were explained. Then, based on the construction process of knowledge graph, several knowledge graph construction techniques were analyzed. Specifically, aiming at the knowledge reasoning, an important part of knowledge graph, three typical knowledge reasoning approaches were analyzed, which are logic rule-based, embedding representation-based, and neural network-based. Furthermore, the research progress of knowledge hypergraph was introduced along with heterogeneous hypergraph. To effectively present and extract hyper-relational characteristics and realize the modeling of hyper-relation data as well as the fast knowledge reasoning, a three-layer architecture of knowledge hypergraph was proposed. Finally, the typical application scenarios of knowledge graph and knowledge hypergraph were summed up, and the future researches were prospected.

Reference | Related Articles | Metrics

Select

Review of event causality extraction based on deep learning

WANG Zhujun, WANG Shi, LI Xueqing, ZHU Junwu

Journal of Computer Applications 2021, 41 (5): 1247-1255. DOI: 10.11772/j.issn.1001-9081.2020071080

Abstract （3082）

PDF （1460KB）（4007）

Save

Causality extraction is a kind of relation extraction task in Natural Language Processing (NLP), which mines event pairs with causality from text by constructing event graph, and play important role in applications of finance, security, biology and other fields. Firstly, the concepts such as event extraction and causality were introduced, and the evolution of mainstream methods and the common datasets of causality extraction were described. Then, the current mainstream causality extraction models were listed. Based on the detailed analysis of pipeline based models and joint extraction models, the advantages and disadvantages of various methods and models were compared. Furthermore, the experimental performance and related experimental data of the models were summarized and analyzed. Finally, the research difficulties and future key research directions of causality extraction were given.

Reference | Related Articles | Metrics

Select

Federated learning survey：concepts， technologies， applications and challenges

Tiankai LIANG, Bi ZENG, Guang CHEN

Journal of Computer Applications 2022, 42 (12): 3651-3662. DOI: 10.11772/j.issn.1001-9081.2021101821

Abstract （2994）

HTML （205）

PDF （2464KB）（2213）

Save

Under the background of emphasizing data right confirmation and privacy protection， federated learning， as a new machine learning paradigm， can solve the problem of data island and privacy protection without exposing the data of all participants. Since the modeling methods based on federated learning have become mainstream and achieved good effects at present， it is significant to summarize and analyze the concepts， technologies， applications and challenges of federated learning. Firstly， the development process of machine learning and the inevitability of the appearance of federated learning were elaborated， and the definition and classification of federated learning were given. Secondly， three federated learning methods （including horizontal federated learning， vertical federated learning and federated transfer learning） which were recognized by the industry currently were introduced and analyzed. Thirdly， concerning the privacy protection issue of federated learning， the existing common privacy protection technologies were generalized and summarized. In addition， the recent mainstream open-source frameworks were introduced and compared， and the application scenarios of federated learning were given at the same time. Finally， the challenges and future research directions of federated learning were prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Summarization of natural language generation

LI Xueqing, WANG Shi, WANG Zhujun, ZHU Junwu

Journal of Computer Applications 2021, 41 (5): 1227-1235. DOI: 10.11772/j.issn.1001-9081.2020071069

Abstract （2787）

PDF （1165KB）（3908）

Save

Natural Language Generation (NLG) technologies use artificial intelligence and linguistic methods to automatically generate understandable natural language texts. The difficulty of communication between human and computer is reduced by NLG, which is widely used in machine news writing, chatbot and other fields, and has become one of the research hotspots of artificial intelligence. Firstly, the current mainstream methods and models of NLG were listed, and the advantages and disadvantages of these methods and models were compared in detail. Then, aiming at three NLG technologies:text-to-text, data-to-text and image-to-text, the application fields, existing problems and current research progresses were summarized and analyzed respectively. Furthermore, the common evaluation methods and their application scopes of the above generation technologies were described. Finally, the development trends and research difficulties of NLG technologies were given.

Reference | Related Articles | Metrics

Select

Review of image edge detection algorithms based on deep learning

LI Cuijin, QU Zhong

Journal of Computer Applications 2020, 40 (11): 3280-3288. DOI: 10.11772/j.issn.1001-9081.2020030314

Abstract （2693）

PDF （922KB）（4054）

Save

Edge detection is the process of extracting the important information of mutations in the image. It is a research hotspot in the field of computer vision and the basis of many middle-and high-level vision tasks such as image segmentation, target detection and recognition. In recent years, in view of the problems of thick edge contour lines and low detection accuracy, edge detection algorithms based on deep learning such as spectral clustering, multi-scale fusion, and cross-layer fusion were proposed by the industry. In order to make more researchers understand the research status of edge detection, firstly, the implementation theory and methods of traditional edge detection were introduced. Then, the main edge detection methods based on deep learning in resent years were summarized, and these methods were classified according to the implementation technologies of the methods. And the analysis of the key technologies of these methods show that the multi-scale multi-level fusion and selection of loss function was the important research directions. Various methods were compared to each other through evaluation indicators. It can be seen that the Optimal Dataset Scale (ODS) of edge detection algorithm on the Berkeley Segmentation Data Set and benchmark 500 (BSDS500) was increased from 0.598 to 0.828, which was close to the level of human vision. Finally, the development direction of edge detection algorithm research was forecasted.

Reference | Related Articles | Metrics

Select

Review of recommendation system

Meng YU, Wentao HE, Xuchuan ZHOU, Mengtian CUI, Keqi WU, Wenjie ZHOU

Journal of Computer Applications 2022, 42 (6): 1898-1913. DOI: 10.11772/j.issn.1001-9081.2021040607

Abstract （2162）

HTML （201）

PDF （3152KB）（1755）

Save

With the continuous development of network applications， network resources are growing exponentially and information overload is becoming increasingly serious， so how to efficiently obtain the resources that meet the user needs has become one of the problems that bothering people. Recommendation system can effectively filter mass information and recommend the resources that meet the users needs. The research status of the recommendation system was introduced in detail， including three traditional recommendation methods of content-based recommendation， collaborative filtering recommendation and hybrid recommendation， and the research progress of four common deep learning recommendation models based on Convolutional Neural Network （CNN）， Deep Neural Network （DNN）， Recurrent Neural Network （RNN） and Graph Neural Network （GNN） were analyzed in focus. The commonly used datasets in recommendation field were summarized， and the differences between the traditional recommendation algorithms and the deep learning-based recommendation algorithms were analyzed and compared. Finally， the representative recommendation models in practical applications were summarized， and the challenges and the future research directions of recommendation system were discussed.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of deep learning-based medical image segmentation

CAO Yuhong, XU Hai, LIU Sun'ao, WANG Zixiao, LI Hongliang

Journal of Computer Applications 2021, 41 (8): 2273-2287. DOI: 10.11772/j.issn.1001-9081.2020101638

Abstract （2060）

PDF （2539KB）（1680）

Save

As a fundamental and key task in computer-aided diagnosis, medical image segmentation aims to accurately recognize the target regions such as organs, tissues and lesions at pixel level. Different from natural images, medical images show high complexity in texture and have the boundaries difficult to judge caused by ambiguity, which is the fault of much noise due to the limitations of the imaging technology and equipment. Furthermore, annotating medical images highly depends on expertise and experience of the experts, thereby leading to limited available annotations in the training and potential annotation errors. For medical images suffer from ambiguous boundary, limited annotated data and large errors in the annotations, which makes it is a great challenge for the auxiliary diagnosis systems based on traditional image segmentation algorithms to meet the demands of clinical applications. Recently, with the wide application of Convolutional Neural Network (CNN) in computer vision and natural language processing, deep learning-based medical segmentation algorithms have achieved tremendous success. Firstly the latest research progresses of deep learning-based medical image segmentation were summarized, including the basic architecture, loss function, and optimization method of the medical image segmentation algorithms. Then, for the limitation of medical image annotated data, the mainstream semi-supervised researches on medical image segmentation were summed up and analyzed. Besides, the studies related to measuring uncertainty of the annotation errors were introduced. Finally, the characteristics summary and analysis as well as the potential future trends of medical image segmentation were listed.

Reference | Related Articles | Metrics

Select

Survey of communication overhead of federated learning

Xinyuan QIU, Zecong YE, Xiaolong CUI, Zhiqiang GAO

Journal of Computer Applications 2022, 42 (2): 333-342. DOI: 10.11772/j.issn.1001-9081.2021020232

Abstract （1980）

HTML （305）

PDF （1356KB）（2552）

Save

To solve the irreconcilable contradiction between data sharing demands and requirements of privacy protection， federated learning was proposed. As a distributed machine learning， federated learning has a large number of model parameters needed to be exchanged between the participants and the central server， resulting in higher communication overhead. At the same time， federated learning is increasingly deployed on mobile devices with limited communication bandwidth and limited power， and the limited network bandwidth and the sharply raising client amount will make the communication bottleneck worse. For the communication bottleneck problem of federated learning， the basic workflow of federated learning was analyzed at first， and then from the perspective of methodology， three mainstream types of methods based on frequency reduction of model updating， model compression and client selection respectively as well as special methods such as model partition were introduced， and a deep comparative analysis of specific optimization schemes was carried out. Finally， the development trends of federated learning communication overhead technology research were summarized and prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of multi-modal medical image segmentation based on deep learning

Meng DOU, Zhebin CHEN, Xin WANG, Jitao ZHOU, Yu YAO

Journal of Computer Applications 2023, 43 (11): 3385-3395. DOI: 10.11772/j.issn.1001-9081.2022101636

Abstract （1881）

HTML （93）

PDF （3904KB）（2333）

Save

Multi-modal medical images can provide clinicians with rich information of target areas （such as tumors， organs or tissues）. However， effective fusion and segmentation of multi-modal images is still a challenging problem due to the independence and complementarity of multi-modal images. Traditional image fusion methods have difficulty in addressing this problem， leading to widespread research on deep learning-based multi-modal medical image segmentation algorithms. The multi-modal medical image segmentation task based on deep learning was reviewed in terms of principles， techniques， problems， and prospects. Firstly， the general theory of deep learning and multi-modal medical image segmentation was introduced， including the basic principles and development processes of deep learning and Convolutional Neural Network （CNN）， as well as the importance of the multi-modal medical image segmentation task. Secondly， the key concepts of multi-modal medical image segmentation was described， including data dimension， preprocessing， data enhancement， loss function， and post-processing， etc. Thirdly， different multi-modal segmentation networks based on different fusion strategies were summarized and analyzed. Finally， several common problems in medical image segmentation were discussed， the summary and prospects for future research were given.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of multimodal pre-training models

Huiru WANG, Xiuhong LI, Zhe LI, Chunming MA, Zeyu REN, Dan YANG

Journal of Computer Applications 2023, 43 (4): 991-1004. DOI: 10.11772/j.issn.1001-9081.2022020296

Abstract （1733）

HTML （148）

PDF （5539KB）（1401）

PDF（mobile）（3280KB）（111）

Save

By using complex pre-training targets and a large number of model parameters， Pre-Training Model （PTM） can effectively obtain rich knowledge from unlabeled data. However， the development of the multimodal PTMs is still in its infancy. According to the difference between modals， most of the current multimodal PTMs were divided into the image-text PTMs and video-text PTMs. According to the different data fusion methods， the multimodal PTMs were divided into two types： single-stream models and two-stream models. Firstly， common pre-training tasks and downstream tasks used in validation experiments were summarized. Secondly， the common models in the area of multimodal pre-training were sorted out， and the downstream tasks of each model and the performance and experimental data of the models were listed in tables for comparison. Thirdly， the application scenarios of M6 （Multi-Modality to Multi-Modality Multitask Mega-transformer） model， Cross-modal Prompt Tuning （CPT） model， VideoBERT （Video Bidirectional Encoder Representations from Transformers） model， and AliceMind （Alibaba’s collection of encoder-decoders from Mind） model in specific downstream tasks were introduced. Finally， the challenges and future research directions faced by related multimodal PTM work were summed up.

Table and Figures | Reference | Related Articles | Metrics

Select

Review on privacy-preserving technologies in federated learning

Teng WANG, Zheng HUO, Yaxin HUANG, Yilin FAN

Journal of Computer Applications 2023, 43 (2): 437-449. DOI: 10.11772/j.issn.1001-9081.2021122072

Abstract （1725）

HTML （165）

PDF （2014KB）（1293）

Save

In recent years， federated learning has become a new way to solve the problems of data island and privacy leakage in machine learning. Federated learning architecture does not require multiple parties to share data resources， in which participants only needed to train local models on local data and periodically upload parameters to the server to update the global model， and then a machine learning model can be built on large-scale global data. Federated learning architecture has the privacy-preserving nature and is a new scheme for large-scale data machine learning in the future. However， the parameter interaction mode of this architecture may lead to data privacy disclosure. At present， strengthening the privacy-preserving mechanism in federated learning architecture has become a new research hotspot. Starting from the privacy disclosure problem in federated learning， the attack models and sensitive information disclosure paths in federated learning were discussed， and several types of privacy-preserving techniques in federated learning were highlighted and reviewed， such as privacy-preserving technology based on differential privacy， privacy-preserving technology based on homomorphic encryption， and privacy-preserving technology based on Secure Multiparty Computation （SMC）. Finally， the key issues of privacy protection in federated learning were discussed， the future research directions were prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Transformer based U-shaped medical image segmentation network： a survey

Liyao FU, Mengxiao YIN, Feng YANG

Journal of Computer Applications 2023, 43 (5): 1584-1595. DOI: 10.11772/j.issn.1001-9081.2022040530

Abstract （1691）

HTML （85）

PDF （1887KB）（1179）

Save

U-shaped Network （U-Net） based on Fully Convolutional Network （FCN） is widely used as the backbone of medical image segmentation models， but Convolutional Neural Network （CNN） is not good at capturing long-range dependency， which limits the further performance improvement of segmentation models. To solve the above problem， researchers have applied Transformer to medical image segmentation models to make up for the deficiency of CNN， and U-shaped segmentation networks combining Transformer have become the hot research topics. After a detailed introduction of U-Net and Transformer， the related medical image segmentation models were categorized by the position in which the Transformer module was located， including only in the encoder or decoder， both in the encoder and decoder， as a skip-connection， and others， the basic contents， design concepts and possible improvement aspects about these models were discussed， the advantages and disadvantages of having Transformer in different positions were also analyzed. According to the analysis results， it can be seen that the biggest factor to decide the position of Transformer is the characteristics of the target segmentation task， and the segmentation models of Transformer combined with U-Net can make better use of the advantages of CNN and Transformer to improve segmentation performance of models， which has great development prospect and research value.

Table and Figures | Reference | Related Articles | Metrics

Select

Review on interpretability of deep learning

Xia LEI, Xionglin LUO

Journal of Computer Applications 2022, 42 (11): 3588-3602. DOI: 10.11772/j.issn.1001-9081.2021122118

Abstract （1637）

HTML （94）

PDF （1703KB）（1283）

Save

With the widespread application of deep learning， human beings are increasingly relying on a large number of complex systems that adopt deep learning techniques. However， the black?box property of deep learning models offers challenges to the use of these models in mission?critical applications and raises ethical and legal concerns. Therefore， making deep learning models interpretable is the first problem to be solved to make them trustworthy. As a result， researches in the field of interpretable artificial intelligence have emerged. These researches mainly focus on explaining model decisions or behaviors explicitly to human observers. A review of interpretability for deep learning was performed to build a good foundation for further in?depth research and establishment of more efficient and interpretable deep learning models. Firstly， the interpretability of deep learning was outlined， the requirements and definitions of interpretability research were clarified. Then， several typical models and algorithms of interpretability research were introduced from the three aspects of explaining the logic rules， decision attribution and internal structure representation of deep learning models. In addition， three common methods for constructing intrinsically interpretable models were pointed out. Finally， the four evaluation indicators of fidelity， accuracy， robustness and comprehensibility were introduced briefly， and the possible future development directions of deep learning interpretability were discussed.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of application analysis and research progress of deep learning in weather forecasting

Runting DONG, Li WU, Xiaoying WANG, Tengfei CAO, Jianqiang HUANG, Qin GUAN, Jiexia WU

Journal of Computer Applications 2023, 43 (6): 1958-1968. DOI: 10.11772/j.issn.1001-9081.2022050745

Abstract （1593）

HTML （130）

PDF （1570KB）（3949）

Save

With the advancement of technologies such as sensor networks and global positioning systems， the volume of meteorological data with both temporal and spatial characteristics has exploded， and the research on deep learning models for Spatiotemporal Sequence Forecasting （STSF） has developed rapidly. However， the traditional machine learning methods applied to weather forecasting for a long time have unsatisfactory effects in extracting the temporal correlations and spatial dependences of data， while the deep learning methods can extract features automatically through artificial neural networks to improve the accuracy of weather forecasting effectively， and have a very good effect in encoding long-term spatial information modeling. At the same time， the deep learning models driven by observational data and Numerical Weather Prediction （NWP） models based on physical theories are combined to build hybrid models with higher prediction accuracy and longer prediction time. Based on these， the application analysis and research progress of deep learning in the field of weather forecasting were reviewed. Firstly， the deep learning problems in the field of weather forecasting and the classical deep learning problems were compared and studied from three aspects： data format， problem model and evaluation metrics. Then， the development history and application status of deep learning in the field of weather forecasting were looked back， and the latest progress in combining deep learning technologies with NWP was summarized and analyzed. Finally， the future development directions and research focuses were prospected to provide a certain reference for future deep learning research in the field of weather forecasting.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of anonymity and tracking technology in Monero

Dingkang LIN, Jiaqi YAN, Nandeng BA, Zhenhao FU, Haochen JIANG

Journal of Computer Applications 2022, 42 (1): 148-156. DOI: 10.11772/j.issn.1001-9081.2021020296

Abstract （1587）

HTML （78）

PDF （723KB）（765）

Save

Virtual digital currency provides a breeding ground for terrorist financing， money laundering， drug trafficking and other criminal activities. As a representative emerging digital currency， Monero has a universally acknowledged high anonymity. Aiming at the problem of using Monroe anonymity to commit crimes， Monero anonymity technology and tracking technology were explored as well as the research progresses were reviewed in recent years， so as to provide technical supports for effectively tackling the crimes based on blockchain technology. In specific， the evolution of Monero anonymity technology was summarized， and the tracking strategies of Monero anonymity technology in academic circles were sorted out. Firstly， in the anonymity technologies， ring signature， guaranteed unlinkability （one-off public key）， guaranteed untraceability， and the important version upgrading for improving anonymity were introduced. Then， in tracking technologies， the attacks such as zero mixin attack， output merging attack， guess-newest attack， closed set attack， transaction flooding attack， tracing attacks from remote nodes and Monero ring attack were introduced. Finally， based on the analysis of anonymity technologies and tracking strategies， four conclusions were obtained： the development of anonymity technology and the development of tracking technology of Monero promote each other； the application of Ring Confidential Transactions （RingCT） is a two-edged sword， which makes the passive attack methods based on currency value ineffective， and also makes the active attack methods easier to succeed； output merging attack and zero mixin attack complement each other； Monero’s system security chain still needs to be sorted out.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of anomaly detection algorithms for multidimensional time series

HU Min, BAI Xue, XU Wei, WU Bingjian

Journal of Computer Applications 2020, 40 (6): 1553-1564. DOI: 10.11772/j.issn.1001-9081.2019101805

Abstract （1562）

PDF （930KB）（3566）

Save

With the continuous development of information technology, the scale of time series data has grown exponentially, which provides opportunities and challenges for the development of time series anomaly detection algorithm, making the algorithm in this field gradually become a new research hotspot in the field of data analysis. However, the research in this area is still in the initial stage and the research work is not systematic. Therefore, by sorting out and analyzing the domestic and foreign literature, this paper divides the research content of multidimensional time series anomaly detection into three aspects: dimension reduction, time series pattern representation and anomaly pattern detection in logical order, and summarizes the mainstream algorithms to comprehensively show the current research status and characteristics of anomaly detection. On this basis, the research difficulties and trends of multi-dimensional time series anomaly detection algorithms were summarized in order to provide useful reference for related theory and application research.

Reference | Related Articles | Metrics

Select

Overview of blockchain consensus mechanism for internet of things

TIAN Zhihong, ZHAO Jindong

Journal of Computer Applications 2021, 41 (4): 917-929. DOI: 10.11772/j.issn.1001-9081.2020111722

Abstract （1545）

PDF （1143KB）（2312）

Save

With the continuous development of digital currency, the blockchain technology has attracted more and more attention, and the research on its key technology, consensus mechanism, is particularly important. The application of blockchain technology in the Internet of Things(IoT) is one of the hot issues. Consensus mechanism is one of the core technologies of blockchain, which has an important impact on IoT in terms of decentralization degree, transaction processing speed, transaction confirmation delay, security, and scalability.Firstly, the architecture characteristics of IoT and the lightweight problem caused by resource limitation were described, the problems faced in the implementation of the blockchain in IoT were briefly summarized, and the demands of blockchain in IoT were analyzed by combining the operation flow of bitcoin. Secondly, the consensus mechanisms were divided into proof class, Byzantine class and Directed Acyclic Graph(DAG) class, and the working principles of these various classes of consensus mechanisms were studied, their adaptabilities to IoT were analyzed in terms of communication complexity, their advantages and disadvantages were summarized, and the combination architectures of the existing consensus mechanisms and IoT were investigated and analyzed. Finally, the problems of IoT, such as high operating cost, poor scalability and security risks were deeply studied, the analysis results show that the Internet of Things Application(IOTA) and Byteball consensus mechanisms based on DAG technology have the advantages of fast transaction processing speed, good scalability and strong security in the case of having a large number of transactions, and they are the development directions of blockchain consensus mechanism in the field of IoT in the future.

Reference | Related Articles | Metrics

Select

Review on deep learning-based pedestrian re-identification

YANG Feng, XU Yu, YIN Mengxiao, FU Jiacheng, HUANG Bing, LIANG Fangxuan

Journal of Computer Applications 2020, 40 (5): 1243-1252. DOI: 10.11772/j.issn.1001-9081.2019091703

Abstract （1301）

PDF （1156KB）（1389）

Save

Pedestrian Re-IDentification (Re-ID) is a hot issue in the field of computer vision and mainly focuses on “how to relate to specific person captured by different cameras in different physical locations”. Traditional methods of Re-ID were mainly based on the extraction of low-level features, such as local descriptors, color histograms and human poses. In recent years, in view of the problems in traditional methods such as pedestrian occlusion and posture disalignment, pedestrian Re-ID methods based on deep learning such as region, attention mechanism, posture and Generative Adversarial Network (GAN) were proposed and the experimental results became significantly better than before. Therefore, the researches of deep learning in pedestrian Re-ID were summarized and classified, and different from the previous reviews, the pedestrian Re-ID methods were divided into four categories to discuss in this review. Firstly, the pedestrian Re-ID methods based on deep learning were summarized by following four methods region, attention, posture, and GAN. Then the performances of mAP (mean Average Precision) and Rank-1 indicators of these methods on the mainstream datasets were analyzed. The results show that the deep learning-based methods can reduce the model overfitting by enhancing the connection between local features and narrowing domain gaps. Finally, the development direction of pedestrian Re-ID method research was forecasted.

Reference | Related Articles | Metrics

Select

Research advances in disentangled representation learning

Keyang CHENG, Chunyun MENG, Wenshan WANG, Wenxi SHI, Yongzhao ZHAN

Journal of Computer Applications 2021, 41 (12): 3409-3418. DOI: 10.11772/j.issn.1001-9081.2021060895

Abstract （1252）

HTML （151）

PDF （877KB）（573）

Save

The purpose of disentangled representation learning is to model the key factors that affect the form of data， so that the change of a key factor only causes the change of data on a certain feature， while the other features are not affected. It is conducive to face the challenge of machine learning in model interpretability， object generation and operation， zero-shot learning and other issues. Therefore， disentangled representation learning always be a research hotspot in the field of machine learning. Starting from the history and motives of disentangled representation learning， the research status and applications of disentangled representation learning were summarized， the invariance， reusability and other characteristics of disentangled representation learning were analyzed， and the research on the factors of variation via generative entangling， the research on the factors of variation with manifold interaction， and the research on the factors of variation using adversarial training were introduced， as well as the latest research trends such as a Variational Auto-Encoder （VAE） named β-VAE were introduced. At the same time， the typical applications of disentangled representation learning were shown， and the future research directions were prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of fine-grained image categorization

SHEN Zhijun, MU Lina, GAO Jing, SHI Yuanhang, LIU Zhiqiang

Journal of Computer Applications 2023, 43 (1): 51-60. DOI: 10.11772/j.issn.1001-9081.2021122090

Abstract （1202）

HTML （64）

PDF （2674KB）（692）

Save

The fine-grained image has characteristics of large intra-class variance and small inter-class variance， which makes Fine-Grained Image Categorization （FGIC） much more difficult than traditional image classification tasks. The application scenarios， task difficulties， algorithm development history and related common datasets of FGIC were described， and an overview of related algorithms was mainly presented. Classification methods based on local detection usually use operations of connection， summation and pooling， and the model training was complex and had many limitations in practical applications. Classification methods based on linear features simulated two neural pathways of human vision for recognition and localization respectively， and the classification effect is relatively better. Classification methods based on attention mechanism simulated the mechanism of human observation of external things， scanning the panorama first， and then locking the key attention area and forming the attention focus， and the classification effect was further improved. For the shortcomings of the current research， the next research directions of FGIC were proposed.

Reference | Related Articles | Metrics

Select

Review of remote sensing image change detection

REN Qiuru, YANG Wenzhong, WANG Chuanjian, WEI Wenyu, QIAN Yunyun

Journal of Computer Applications 2021, 41 (8): 2294-2305. DOI: 10.11772/j.issn.1001-9081.2020101632

Abstract （1186）

PDF （1683KB）（1687）

Save

As a key technology of land use/land cover detection, change detection aims to detect the changed part and its type in the remote sensing data of the same region in different periods. In view of the problems in traditional change detection methods, such as heavy manual labor and poor detection results, a large number of change detection methods based on remote sensing images have been proposed. In order to further understand the change detection technology based on remote sensing images and further study on the change detection methods, a comprehensive review of change detection was carried out by sorting, analyzing and comparing a large number of researches on change detection. Firstly, the development process of change detection was described. Then, the research progress of change detection was summarized in detail from three aspects:data selection and preprocessing, change detection technology, post-processing and precision evaluation, where the change detection technology was mainly summarized from analysis unit and comparison method respectively. Finally, the summary of the problems in each stage of change detection was performed and the future development directions were proposed.

Reference | Related Articles | Metrics

Select

Review of spatio-temporal trajectory sequence pattern mining methods

KANG Jun, HUANG Shan, DUAN Zongtao, LI Yixiu

Journal of Computer Applications 2021, 41 (8): 2379-2385. DOI: 10.11772/j.issn.1001-9081.2020101571

Abstract （1098）

PDF （1204KB）（1623）

Save

With the rapid development of global positioning technology and mobile communication technology, huge amounts of trajectory data appear. These data are true reflections of the moving patterns and behavior characteristics of moving objects in the spatio-temporal environment, and they contain a wealth of information which carries important application values for the fields such as urban planning, traffic management, service recommendation, and location prediction. And the applications of spatio-temporal trajectory data in these fields usually need to be achieved by sequence pattern mining of spatio-temporal trajectory data. Spatio-temporal trajectory sequence pattern mining aims to find frequently occurring sequence patterns from the spatio-temporal trajectory dataset, such as location patterns (frequent trajectories, hot spots), activity periodic patterns, and semantic behavior patterns, so as to mine hidden information in the spatio-temporal data. The research progress of spatial-temporal trajectory sequence pattern mining in recent years was summarized. Firstly, the data characteristics and applications of spatial-temporal trajectory sequence were introduced. Then, the mining process of spatial-temporal trajectory patterns was described:the research situation in this field was introduced from the perspectives of mining location patterns, periodic patterns and semantic patterns based on spatial-temporal trajectory sequence. Finally, the problems existing in the current spatio-temporal trajectory sequence pattern mining methods were elaborated, and the future development trends of spatio-temporal trajectory sequence pattern mining method were prospected.

Reference | Related Articles | Metrics

Select

Survey on interpretability research of deep learning

Lingmin LI, Mengran HOU, Kun CHEN, Junmin LIU

Journal of Computer Applications 2022, 42 (12): 3639-3650. DOI: 10.11772/j.issn.1001-9081.2021091649

Abstract （1089）

HTML （77）

PDF （4239KB）（731）

Save

In recent years， deep learning has been widely used in many fields. However， due to the highly nonlinear operation of deep neural network models， the interpretability of these models is poor， these models are often referred to as “black box” models， and cannot be applied to some key fields with high performance requirements. Therefore， it is very necessary to study the interpretability of deep learning. Firstly， deep learning was introduced briefly. Then， around the interpretability of deep learning， the existing research work was analyzed from eight aspects， including hidden layer visualization， Class Activation Mapping （CAM）， sensitivity analysis， frequency principle， robust disturbance test， information theory， interpretable module and optimization method. At the same time， the applications of deep learning in the fields of network security， recommender system， medical and social networks were demonstrated. Finally， the existing problems and future development directions of deep learning interpretability research were discussed.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of label noise learning algorithms based on deep learning

Boyi FU, Yuncong PENG, Xin LAN, Xiaolin QIN

Journal of Computer Applications 2023, 43 (3): 674-684. DOI: 10.11772/j.issn.1001-9081.2022020198

Abstract （1081）

HTML （83）

PDF （2083KB）（765）

PDF（mobile）（733KB）（49）

Save

In the field of deep learning， a large number of correctly labeled samples are essential for model training. However， in practical applications， labeling data requires high labeling cost. At the same time， the quality of labeled samples is affected by subjective factors or tool and technology of manual labeling， which inevitably introduces label noise in the annotation process. Therefore， existing training data available for practical applications is subject to a certain amount of label noise. How to effectively train training data with label noise has become a research hotspot. Aiming at label noise learning algorithms based on deep learning， firstly， the source， classification and impact of label noise learning strategies were elaborated； secondly， four label noise learning strategies based on data， loss function， model and training method were analyzed according to different elements of machine learning； then， a basic framework for learning label noise in various application scenarios was provided； finally， some optimization ideas were given， and challenges and future development directions of label noise learning algorithms were proposed.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of single target tracking algorithms based on Siamese network

Mengting WANG, Wenzhong YANG, Yongzhi WU

Journal of Computer Applications 2023, 43 (3): 661-673. DOI: 10.11772/j.issn.1001-9081.2022010150

Abstract （1064）

HTML （132）

PDF （2647KB）（784）

Save

Single object tracking is an important research direction in the field of computer vision， and has a wide range of applications in video surveillance， autonomous driving and other fields. For single object tracking algorithms， although a large number of summaries have been conducted， most of them are based on correlation filter or deep learning. In recent years， Siamese network-based tracking algorithms have received extensive attention from researchers for their balance between accuracy and speed， but there are relatively few summaries of this type of algorithms and it lacks systematic analysis of the algorithms at the architectural level. In order to deeply understand the single object tracking algorithms based on Siamese network， a large number of related literatures were organized and analyzed. Firstly， the structures and applications of the Siamese network were expounded， and each tracking algorithm was introduced according to the composition classification of the Siamese tracking algorithm architectures. Then， the commonly used datasets and evaluation metrics in the field of single object tracking were listed， the overall and each attribute performance of 25 mainstream tracking algorithms was compared and analyzed on OTB 2015 （Object Tracking Benchmark） dataset， and the performance and the reasoning speed of 23 Siamese network-based tracking algorithms on LaSOT （Large-scale Single Object Tracking） and GOT-10K （Generic Object Tracking） test sets were listed. Finally， the research on Siamese network-based tracking algorithms was summarized， and the possible future research directions of this type of algorithms were prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of event extraction

Chunming MA, Xiuhong LI, Zhe LI, Huiru WANG, Dan YANG

Journal of Computer Applications 2022, 42 (10): 2975-2989. DOI: 10.11772/j.issn.1001-9081.2021081542

Abstract （1057）

HTML （149）

PDF （3054KB）（605）

Save

The event that the user is interested in is extracted from the unstructured information， and then displayed to the user in a structured way， that is event extraction. Event extraction has a wide range of applications in information collection， information retrieval， document synthesis， and information questioning and answering. From the overall perspective， event extraction algorithms can be divided into four categories： pattern matching algorithms， trigger lexical methods， ontology-based algorithms， and cutting-edge joint model methods. In the research process， different evaluation methods and datasets can be used according to the related needs， and different event representation methods are also related to event extraction research. Distinguished by task type， meta-event extraction and subject event extraction are the two basic tasks of event extraction. Among them， meta-event extraction has three methods based on pattern matching， machine learning and neural network respectively， while there are two ways to extract subjective events： based on the event framework and based on ontology respectively. Event extraction research has achieved excellent results in single languages such as Chinese and English， but cross-language event extraction still faces many problems. Finally， the related works of event extraction were summarized and the future research directions were prospected in order to provide guidelines for subsequent research.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of pre-trained models for natural language processing tasks

LIU Ruiheng, YE Xia, YUE Zengying

Journal of Computer Applications 2021, 41 (5): 1236-1246. DOI: 10.11772/j.issn.1001-9081.2020081152

Abstract （1023）

PDF （1296KB）（3221）

Save

In recent years, deep learning technology has developed rapidly. In Natural Language Processing (NLP) tasks, with text representation technology rising from the word level to the document level, the unsupervised pre-training method using a large-scale corpus has been proved to be able to effectively improve the performance of models in downstream tasks. Firstly, according to the development of text feature extraction technology, typical models were analyzed from word level and document level. Secondly, the research status of the current pre-trained models was analyzed from the two stages of pre-training target task and downstream application, and the characteristics of the representative models were summed up. Finally, the main challenges faced by the development of pre-trained models were summarized and the prospects were proposed.

Reference | Related Articles | Metrics

Select

Survey on imbalanced multi‑class classification algorithms

Mengmeng LI, Yi LIU, Gengsong LI, Qibin ZHENG, Wei QIN, Xiaoguang REN

Journal of Computer Applications 2022, 42 (11): 3307-3321. DOI: 10.11772/j.issn.1001-9081.2021122060

Abstract （996）

HTML （99）

PDF （1861KB）（646）

Save

Imbalanced data classification is an important research content in machine learning， but most of the existing imbalanced data classification algorithms foucus on binary classification， and there are relatively few studies on imbalanced multi?class classification. However， datasets in practical applications usually have multiple classes and imbalanced data distribution， and the diversity of classes further increases the difficulty of imbalanced data classification， so the multi?class classification problem has become a research topic to be solved urgently. The imbalanced multi?class classification algorithms proposed in recent years were reviewed. According to whether the decomposition strategy was adopted， imbalanced multi?class classification algorithms were divided into decomposition methods and ad?hoc methods. Furthermore， according to the different adopted decomposition strategies， the decomposition methods were divided into two frameworks： One Vs. One （OVO） and One Vs. All （OVA）. And according to different used technologies， the ad?hoc methods were divided into data?level methods， algorithm?level methods， cost?sensitive methods， ensemble methods and deep network?based methods. The advantages and disadvantages of these methods and their representative algorithms were systematically described， the evaluation indicators of imbalanced multi?class classification methods were summarized， the performance of the representative methods were deeply analyzed through experiments， and the future development directions of imbalanced multi?class classification were discussed.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of YOLO algorithm and its applications to object detection in autonomous driving scenes

Yaping DENG, Yingjiang LI

Journal of Computer Applications 2024, 44 (6): 1949-1958. DOI: 10.11772/j.issn.1001-9081.2023060889

Abstract （993）

HTML （42）

PDF （1175KB）（881）

Save

Object detection in autonomous driving scenes is one of the important research directions in computer vision. The researches focus on ensuring real-time and accurate object detection of objects by autonomous vehicles. Recently， a rapid development in deep learning technology had been witnessed， and its wide application in the field of autonomous driving had prompted substantial progress in this field. An analysis was conducted on the research status of object detection by YOLO （You Only Look Once） algorithms in the field of autonomous driving from the following four aspects. Firstly， the ideas and improvement methods of the single-stage YOLO series of detection algorithms were summarized， and the advantages and disadvantages of the YOLO series of algorithms were analyzed. Secondly， the YOLO algorithm-based object detection applications in autonomous driving scenes were introduced， the research status and applications for the detection and recognition of traffic vehicles， pedestrians， and traffic signals were expounded and summarized respectively. Additionally， the commonly used evaluation indicators in object detection， as well as the object detection datasets and automatic driving scene datasets， were summarized. Lastly， the problems and future development directions of object detection were discussed.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of sentiment analysis based on image and text fusion

MENG Xiangrui, YANG Wenzhong, WANG Ting

Journal of Computer Applications 2021, 41 (2): 307-317. DOI: 10.11772/j.issn.1001-9081.2020060923

Abstract （929）

PDF （1277KB）（1821）

Save

With the continuous improvement of information technology, the amount of image-text data with orientation on various social platforms is growing rapidly, and the sentiment analysis with image and text fusion is widely concerned. The single sentiment analysis method can no longer meet the demand of multi-modal data. Aiming at the technical problems of image and text sentiment feature extraction and fusion, firstly, the widely used image and text emotional analysis datasets were listed, and the extraction methods of text features and image features were introduced. Then, the current fusion modes of image features and text features were focused on and the problems existing in the process of image-text sentiment analysis were briefly described. Finally, the research directions of sentiment analysis in the future were summarized and prospected for. In order to have a deeper understanding of image-text fusion technology, literature research method was adopted to review the study of image-text sentiment analysis, which is helpful to compare the differences between different fusion methods and find more valuable research schemes.

Reference | Related Articles | Metrics

Project Articles