scispace - formally typeset
Q

Quanjun Yin

Researcher at National University of Defense Technology

Publications -  52
Citations -  292

Quanjun Yin is an academic researcher from National University of Defense Technology. The author has contributed to research in topics: Computer science & Reinforcement learning. The author has an hindex of 8, co-authored 46 publications receiving 196 citations.

Papers
More filters
Journal ArticleDOI

Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning

TL;DR: Improvements of MK-A3C can efficiently navigate robots in unknown dynamic environments, and satisfy kinetic constraints while handling moving objects, and solve the non-convergence policies problems caused by sparse reward.
Journal ArticleDOI

Scheduling parallel jobs with tentative runs and consolidation in the cloud

TL;DR: The algorithm employs tentative run and workload consolidation under such a two-tier virtual machines architecture to enhance the popular FCFS algorithm, and it can even produce comparable performance to the runtime-estimation-based EASY algorithm.
Journal ArticleDOI

Learning Behavior Trees for Autonomous Agents with Hybrid Constraints Evolution

TL;DR: Preliminary experiments show that the hybrid EBT-HC outperforms other approaches in facilitating the BT design by achieving better behavior performance within fewer generations, and the generated behavior models are human readable and easy to be fine-tuned by domain experts.
Journal ArticleDOI

A Semi-Markov Decision Model for Recognizing the Destination of a Maneuvering Agent in Real Time Strategy Games

TL;DR: A novel semi-Markov decision model (SMDM) is proposed that outperforms another extension of the MDP in terms of precision, recall, and -measure, and Destinations are recognized efficiently by the SMDM no matter whether they are changed or not.
Proceedings ArticleDOI

Bridging the Gap between Observation and Decision Making: Goal Recognition and Flexible Resource Allocation in Dynamic Network Interdiction

TL;DR: A Markov Decision Process-based goal recognition model along with its dynamic Bayesian network representation and the applied goal inference method is proposed to identify the evader’s real goal within the DSPLNI context.