Welcome to Journal of Software website!

微信服务号

微信订阅号

Current Issue
Online First
Archive
Click Rank
Most Downloaded
综述文章
专刊文章
分辑系列

Article Search

Search by issue

Select AllDeselectExport

Display Method:

Preface

FENG Ju-Fu, YU Yang, LIU Qi

2024,35(4):1585-1586, DOI: 10.13328/j.cnki.jos.007020

[Abstract] (241) [HTML] (51) [PDF 639.31 K] (277)

Abstract:

Diverse and Authentic Task Generation Method for Robust Few-shot Classification

LIU Xin, JING Li-Ping, YU Jian

2024,35(4):1587-1600, DOI: 10.13328/j.cnki.jos.007014

[Abstract] (571) [HTML] (45) [PDF 2.18 M] (1137)

Abstract:
With the development of technologies such as big data, computing, and the Internet, artificial intelligence techniques represented by machine learning and deep learning have achieved tremendous success. Particularly, the emergence of various large-scale models has greatly accelerated the application of artificial intelligence in various fields. However, the success of these techniques heavily relies on massive training data and abundant computing resources, which significantly limits their application in data or resource-scarce domains. Therefore, how to learn from limited samples, known as few-shot learning, has become a crucial research problem in the new wave of industrial transformation led by artificial intelligence. The most commonly used approach in few-shot learning is based on meta- learning. Such methods learn meta-knowledge for solving similar tasks by training on a series of related training tasks, which enables fast learning on new testing tasks using the acquired meta-knowledge. Although these methods have achieved sound results in few-shot classification tasks, they assume that the training and testing tasks come from the same distribution. This implies that a sufficient number of training tasks are required for the model to generalize the learned meta-knowledge to continuously changing testing tasks. However, in some real-world scenarios with truly limited data, ensuring an adequate number of training tasks is challenging. To address this issue, this study proposes a robust few-shot classification method based on diverse and authentic task generation (DATG). The method generates additional training tasks by applying Mixup to a small number of existing tasks, aiding the model in learning. By constraining the diversity and authenticity of the generated tasks, this method effectively improves the generalization of few-shot classification methods. Specifically, the base classes in the training set are firstly clustered to obtain different clusters and then tasks are selected from different clusters for Mixup to increase task diversity. Furthermore, performing inter-cluster tasks Mixup helps alleviate the learning of pseudo-discriminative features highly correlated with the categories. To ensure that the generated tasks do not deviate too much from the real distribution and mislead the model’s learning, the maximum mean discrepancy (MMD) between the generated tasks and real tasks is minimized, thus ensuring the authenticity of the generated tasks. Finally, it is theoretically analyzed why the inter-cluster task Mixup strategy can improve the model’s generalization performance. Experimental results on multiple datasets further demonstrate the effectiveness of the proposed method.

Pre-training Method for Enhanced Code Representation Based on Multimodal Contrastive Learning

YANG Hong-Yu, MA Jian-Hui, HOU Min, SHEN Shuang-Hong, CHEN En-Hong

2024,35(4):1601-1617, DOI: 10.13328/j.cnki.jos.007016

[Abstract] (880) [HTML] (50) [PDF 2.51 M] (1368)

Abstract:
Code representation aims to extract the characteristics of source code to obtain its semantic embedding, playing a crucial role in deep learning-based code intelligence. Traditional handcrafted code representation methods mainly rely on domain expert annotations, which are time-consuming and labor-intensive. Moreover, the obtained code representations are task-specific and not easily reusable for specific downstream tasks, which contradicts the green and sustainable development concept. To this end, many large-scale pretraining models for source code representation have shown remarkable success in recent years. These methods utilize massive source code for self-supervised learning to obtain universal code representations, which are then easily fine-tuned for various downstream tasks. Based on the abstraction levels of programming languages, code representations have four level features: text level, semantic level, functional level, and structural level. Nevertheless, current models for code representation treat programming languages merely as ordinary text sequences resembling natural language. They overlook the functional-level and structural-level features, which bring performance inferior. To overcome this drawback, this study proposes a representation enhanced contrastive multimodal pretraining (REcomp) framework for code representation pretraining. REcomp has developed a novel semantic-level to structure-level feature fusion algorithm, which is employed for serializing abstract syntax trees. Through a multi-modal contrastive learning approach, this composite feature is integrated with both the textual and functional features of programming languages, enabling a more precise semantic modeling. Extensive experiments are conducted on three real-world public datasets. Experimental results clearly validate the superiority of REcomp.

Survey of Meta-reinforcement Learning Research

CHEN Yi-Yu, HUO Jing, DING Tian-Yu, GAO Yang

2024,35(4):1618-1650, DOI: 10.13328/j.cnki.jos.007011

[Abstract] (2408) [HTML] (65) [PDF 4.71 M] (2470)

Abstract:
In recent years, deep reinforcement learning (DRL) has achieved remarkable success in many sequential decision-making tasks. However, the current success of deep reinforcement learning heavily relies on massive learning data and computing resources. The poor sample efficiency and strategy generalization ability are the key factors restricting DRL’s further development. Meta-reinforcement learning (Meta-RL) studies to adapt to a wider range of tasks with a smaller sample size. Related researches are expected to alleviate the above limitations and promote the development of reinforcement learning. Taking the scope of research object and application range of current research works, this study comprehensively combs the research progress in the field of meta-reinforcement learning. Firstly, a basic introduction is given to deep reinforcement learning and the background of meta-reinforcement learning. Then, meta-reinforcement learning is formally defined and common scene settings are summarized, and the current research progress of meta-reinforcement learning is also introduced from the perspective of application range of the research results. Finally, the research challenges and potential future development directions are discussed.

Local Consistent Active Learning for Source Free Open-set Domian Adaptation

WANG Fan, HAN Zhong-Yi, SU Wan, YIN Yi-Long

2024,35(4):1651-1666, DOI: 10.13328/j.cnki.jos.007010

[Abstract] (355) [HTML] (28) [PDF 2.28 M] (1096)

Abstract:
Unsupervised domain adaptation (UDA) has achieved success in solving the problem that the training set (source domain) and the test set (target domain) come from different distributions. In the low energy consumption and open dynamic task environment, with the emergence of resource constraints and public classes, existing UDA methods encounter severe challenges. Source free open-set domain adaptation (SF-ODA) aims to transfer the knowledge from the source model to the unlabeled target domain where public classes appear, thus realizing the identification of common classes and detection of public class without the source data. Existing SF-ODA methods focus on designing source models that accurately detect public class or modifying the model structures. However, they not only require extra storage space and training overhead, but also are difficult to be implemented in the strict privacy scenarios. This study proposes a more practical scenario: Active learning source free open-set domain adaptive adaptation (ASF-ODA), based on a common training source model and a small number of valuable target samples labeled by experts to achieve a robust transfer. A local consistent active learning (LCAL) algorithm is proposed to achieve this objective. First of all, LCAL includes a new proposed active selection method, local diversity selection, to select more valuable samples of target domain and promote the separation of threshold fuzzy samples by taking advantage of the feature local labels in the consistent target domain. Then, based on information entropy, LCAL initially selects possible common class set and public class set, and corrects these two sets with labeled samples obtained in the first step to obtain two corresponding reliable sets. Finally, LCAL introduces open set loss and information maximization loss to further promote the separation of common and public classes, and introduces cross entropy loss to realize the discrimination of common classes. A large number of experiments on three publicly available benchmark datasets, Office-31, Office-Home, and VisDA-C, show that with the help of a small number of valuable target samples, LCAL significantly outperforms the existing active learning methods and SF-ODA methods, with over 20% HOS improvements in some transfer tasks.

Towards Robust Test-time Adaptation Method for Open-set Recognition

ZHOU Zhi, ZHANG Ding-Chu, LI Yu-Feng, ZHANG Min-Ling

2024,35(4):1667-1681, DOI: 10.13328/j.cnki.jos.007009

[Abstract] (580) [HTML] (35) [PDF 2.61 M] (1079)

Abstract:
Open-set recognition is an important issue for ensuring the efficient and robust deployment of machine learning models in the open world. It aims to address the challenge of encountering samples from unseen classes that emerge during testing, i.e., to accurately classify the seen classes while identifying and rejecting the unseen ones. Current open-set recognition studies assume that the covariate distribution of the seen classes remains constant during both training and testing. However, in practical scenarios, the covariate distribution is constantly shifting, which can cause previous methods to fail, and their performance may even be worse than the baseline method. Therefore, it is urgent to study novel open-set recognition methods that can adapt to the constantly changing covariate distribution so that they can robustly classify seen categories and identify unseen categories during testing. This novel problem adaptation in the open world (AOW) is named and a test-time adaptation method is proposed for open-set recognition called open-set test-time adaptation (OTA). OTA method only utilizes unlabeled test data to update the model with adaptive entropy loss and open-set entropy loss, maintaining the model’s ability to discriminate seen classes while further enhancing its ability to recognize unseen classes. Comprehensive experiments are conducted on multiple benchmark datasets with different covariate shift levels. The results show that the proposal is robust to covariate shift and demonstrates superior performance compared to many state-of-the-art methods.

Survey on Neural Architecture Search for Brain Data Analysis

LI Qing, WANG Qi-Xin, LI Zi-Yu, ZHU Zhi-Yuan, ZHANG Shi-Hao, MOU Hao-Nan, YANG Wen-Ting, WU Xia

2024,35(4):1682-1702, DOI: 10.13328/j.cnki.jos.007012

[Abstract] (582) [HTML] (43) [PDF 2.64 M] (1173)

Abstract:
Neural architecture search (NAS) is an important part of automated machine learning, which has been widely used in multiple fields, including computer vision, speech recognition, etc. NAS can search the optimal deep neural network structures for specific data, scenarios, and tasks. In recent years, NAS has been increasingly applied to brain data analysis, significantly improving the performance in multiple application fields, such as brain image segment, feature extraction, brain disease auxiliary diagnosis, etc. Such researches have demonstrated the advantages of low-energy automated machine learning in the field of brain data analysis. NAS-based brain data analysis is one of the current research hotspots, and it still has certain challenges. At present, there are few review literatures available for reference in this field worldwide. This study conducts a detailed survey and analysis of relevant literature from different perspectives, including search frameworks, search space, search strategies, research tasks, and experimental data. At the same time, a systematic summary of brain data sets is also provided that can be used for NAS training. In addition, challenges and future research directions of NAS are prospected in brain data analysis.

External Links

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063