Institution

National University of Defense Technology

Education•Changsha, China•

About: National University of Defense Technology is a education organization based out in Changsha, China. It is known for research contribution in the topics: Radar & Synthetic aperture radar. The organization has 39430 authors who have published 40181 publications receiving 358979 citations. The organization is also known as: Guófáng Kēxuéjìshù Dàxué & NUDT.

...read moreread less

Topics: Radar, Synthetic aperture radar, Laser, Fiber laser, Radar imaging ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Multiobjective Reinforcement Learning: A Comprehensive Overview

[...]

Chunming Liu¹, Xin Xu¹, Dewen Hu¹•Institutions (1)

National University of Defense Technology¹

01 Mar 2015

TL;DR: The basic architecture, research topics, and naïve solutions of MORL are introduced at first and several representative MORL approaches and some important directions of recent research are comprehensively reviewed.

...read moreread less

Abstract: Reinforcement learning (RL) is a powerful paradigm for sequential decision-making under uncertainties, and most RL algorithms aim to maximize some numerical value which represents only one long-term objective. However, multiple long-term objectives are exhibited in many real-world decision and control systems, so recently there has been growing interest in solving multiobjective reinforcement learning (MORL) problems where there are multiple conflicting objectives. The aim of this paper is to present a comprehensive overview of MORL. The basic architecture, research topics, and naive solutions of MORL are introduced at first. Then, several representative MORL approaches and some important directions of recent research are comprehensively reviewed. The relationships between MORL and other related research are also discussed, which include multiobjective optimization, hierarchical RL, and multiagent RL. Moreover, research challenges and open problems of MORL techniques are suggested.

...read moreread less

283 citations

Journal Article•DOI•

Functionalization of monolayer MoS2 by substitutional doping: A first-principles study

[...]

Qu Yue¹, Shengli Chang¹, Shiqiao Qin¹, Jingbo Li²•Institutions (2)

National University of Defense Technology¹, Chinese Academy of Sciences²

15 Aug 2013-Physics Letters A

TL;DR: In this article, the structural, electronic and magnetic properties of monolayer MoS2 doped with nonmetal and transition-metal atoms are investigated using first-principles calculations.

...read moreread less

282 citations

Proceedings Article•DOI•

Hoaxy: A Platform for Tracking Online Misinformation

[...]

Chengcheng Shao¹, Giovanni Luca Ciampaglia², Alessandro Flammini², Filippo Menczer²•Institutions (2)

National University of Defense Technology¹, Indiana University²

11 Apr 2016

TL;DR: Haxy is introduced, a platform for the collection, detection, and analysis of online misinformation and its related fact-checking efforts, and a preliminary analysis of a sample of public tweets containing both fake news and fact checking is presented.

...read moreread less

Abstract: Massive amounts of misinformation have been observed to spread in uncontrolled fashion across social media. Examples include rumors, hoaxes, fake news, and conspiracy theories. At the same time, several journalistic organizations devote significant efforts to high-quality fact checking of online claims. The resulting information cascades contain instances of both accurate and inaccurate information, unfold over multiple time scales, and often reach audiences of considerable size. All these factors pose challenges for the study of the social dynamics of online news sharing. Here we introduce Hoaxy, a platform for the collection, detection, and analysis of online misinformation and its related fact-checking efforts. We discuss the design of the platform and present a preliminary analysis of a sample of public tweets containing both fake news and fact checking. We find that, in the aggregate, the sharing of fact-checking content typically lags that of misinformation by 10-20 hours. Moreover, fake news are dominated by very active users, while fact checking is a more grass-roots activity. With the increasing risks connected to massive online misinformation, social news observatories have the potential to help researchers, journalists, and the general public understand the dynamics of real and fake news sharing.

...read moreread less

280 citations

Journal Article•DOI•

Kernel-Based Least Squares Policy Iteration for Reinforcement Learning

[...]

Xin Xu¹, Dewen Hu¹, Xicheng Lu¹•Institutions (1)

National University of Defense Technology¹

01 Jul 2007-IEEE Transactions on Neural Networks

TL;DR: The KLSPI algorithm provides a general RL method with generalization performance and convergence guarantee for large-scale Markov decision problems (MDPs) and can be applied to online learning control by incorporating an initial controller to ensure online performance.

...read moreread less

Abstract: In this paper, we present a kernel-based least squares policy iteration (KLSPI) algorithm for reinforcement learning (RL) in large or continuous state spaces, which can be used to realize adaptive feedback control of uncertain dynamic systems. By using KLSPI, near-optimal control policies can be obtained without much a priori knowledge on dynamic models of control plants. In KLSPI, Mercer kernels are used in the policy evaluation of a policy iteration process, where a new kernel-based least squares temporal-difference algorithm called KLSTD-Q is proposed for efficient policy evaluation. To keep the sparsity and improve the generalization ability of KLSTD-Q solutions, a kernel sparsification procedure based on approximate linear dependency (ALD) is performed. Compared to the previous works on approximate RL methods, KLSPI makes two progresses to eliminate the main difficulties of existing results. One is the better convergence and (near) optimality guarantee by using the KLSTD-Q algorithm for policy evaluation with high precision. The other is the automatic feature selection using the ALD-based kernel sparsification. Therefore, the KLSPI algorithm provides a general RL method with generalization performance and convergence guarantee for large-scale Markov decision problems (MDPs). Experimental results on a typical RL task for a stochastic chain problem demonstrate that KLSPI can consistently achieve better learning efficiency and policy quality than the previous least squares policy iteration (LSPI) algorithm. Furthermore, the KLSPI method was also evaluated on two nonlinear feedback control problems, including a ship heading control problem and the swing up control of a double-link underactuated pendulum called acrobot. Simulation results illustrate that the proposed method can optimize controller performance using little a priori information of uncertain dynamic systems. It is also demonstrated that KLSPI can be applied to online learning control by incorporating an initial controller to ensure online performance.

...read moreread less

279 citations

Journal Article•DOI•

A Survey of Traffic Control With Vehicular Communications

[...]

Li Li¹, Ding Wen², Danya Yao¹•Institutions (2)

Tsinghua University¹, National University of Defense Technology²

01 Feb 2014-IEEE Transactions on Intelligent Transportation Systems

TL;DR: In this article, the authors present a survey of early-stage traffic control technologies and discuss potential benefits that will be gained by using vehicle-to-vehicle (V2V) communications.

...read moreread less

Abstract: During the last 60 years, incessant efforts have been made to improve the efficiency of traffic control systems to meet ever-increasing traffic demands. Some recent works attempt to enhance traffic efficiency via vehicle-to-vehicle communications. In this paper, we aim to give a survey of some research frontiers in this trend, identifying early-stage key technologies and discussing potential benefits that will be gained. Our survey focuses on the control side and aims to highlight that the design philosophy for traffic control systems is undergoing a transition from feedback character to feedforward character. Moreover, we discuss some contrasting preferences in the design of traffic control systems and their relations to vehicular communications. The first pair of contrasting preferences are model-based predictive control versus simulation-based predictive control. The second pair are global planning-based control versus local self-organization-based control. The third pair are control using rich information that may be highly redundant versus control using concise information that is necessary. Both the potentials and drawbacks of these control strategies are explained. We hope these comparisons can shed some interesting light on future traffic control studies.

...read moreread less

278 citations

Collapse

Authors

Showing all 39659 results

Name	H-index	Papers	Citations
Rui Zhang	151	2625	107917
Jian Li	133	2863	87131
Chi Lin	125	1313	102710
Wei Xu	103	1492	49624
Lei Liu	98	2041	51163
Xiang Li	97	1472	42301
Chang Liu	97	1099	39573
Jian Huang	97	1189	40362
Tao Wang	97	2720	55280
Wei Liu	96	1538	42459
Jian Chen	96	1718	52917
Wei Wang	95	3544	59660
Peng Li	95	1548	45198
Jianhong Wu	93	726	36427
Jianhua Zhang	92	415	28085

Network Information

Related Institutions (5)

Harbin Institute of Technology

109.2K papers, 1.6M citations

94% related

Tsinghua University

200.5K papers, 4.5M citations

101K papers, 2.4M citations

90% related

City University of Hong Kong

60.1K papers, 1.7M citations

89% related

Dalian University of Technology

71.9K papers, 1.1M citations

89% related

Performance

Metrics

40,649

Papers

498,930

Citations

No. of papers from the Institution in previous years
Year	Papers
2024	1
2023	97
2022	468
2021	2,986
2020	3,468
2019	3,695