scispace - formally typeset
C

Canlong Zhang

Researcher at Guangxi Normal University

Publications -  64
Citations -  381

Canlong Zhang is an academic researcher from Guangxi Normal University. The author has contributed to research in topics: Computer science & Feature (computer vision). The author has an hindex of 5, co-authored 49 publications receiving 119 citations. Previous affiliations of Canlong Zhang include Guilin University of Electronic Technology.

Papers
More filters
Journal ArticleDOI

Dual Global Enhanced Transformer for image captioning

TL;DR: In this paper , a dual global enhanced transformer (DGET) was proposed to incorporate global information in the encoding and decoding stages of image captioning, where the grid feature was regarded as the visual global information and adaptively fused it into region features in each layer by a novel Global Enhanced Encoder (GEE).
Journal ArticleDOI

A New multi-instance multi-label learning approach for image and text classification

TL;DR: This paper presents a general MIML framework by combining the feature learning technologies with machine learning technologies, and a new approach called CPNMIML which combines the probabilistic latent semantic analysis (PLSA) with the neural networks (NN) is proposed.
Journal ArticleDOI

Integrating Scene Semantic Knowledge into Image Captioning

TL;DR: Zhang et al. as mentioned in this paper proposed an improved visual attention model, which calculated the focus intensity coefficient of the attention mechanism through the context information of the model, and automatically adjusted the attention intensity through the coefficient to extract more accurate visual information.
Journal ArticleDOI

Text Summarization Method Based on Double Attention Pointer Network

TL;DR: In DAPT, the self-attention mechanism collects key information from the encoder, the soft attention and the pointer network generate more coherent core content, and the fusion of both generates accurate and coherent summaries.
Journal ArticleDOI

The synergy of double attention: Combine sentence-level and word-level attention for image captioning

TL;DR: A double attention model is proposed which combines sentence-level attention model with word- level attention model to generate more accurate captions and outperforms many state-of-the-art image captioning approaches in various evaluation metrics.