A robust real-time deep learning based automatic polyp detection system.

doi:10.1016/J.COMPBIOMED.2021.104519

Home
/
Papers
/
A robust real-time deep learning based automatic polyp detection system.

Journal Article•DOI•

A robust real-time deep learning based automatic polyp detection system.

Ishak Pacal¹, Dervis Karaboga²•Institutions (2)

Iğdır University¹, King Abdulaziz University²

29 May 2021-Computers in Biology and Medicine (Pergamon)-Vol. 134, pp 104519-104519

TL;DR: In this article, the authors proposed a new structure for real-time polyp detection by scaling the YOLOv4 algorithm to overcome the obstacles of computer-aided detection systems (CADs) to detect polyps.

read less

About: This article is published in Computers in Biology and Medicine.The article was published on 2021-05-29. It has received 51 citations till now. The article focuses on the topics: Ensemble learning.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

An efficient real-time colonic polyp detection with YOLO algorithms trained by using negative samples and large datasets

[...]

01 Feb 2022-Computers in Biology and Medicine

TL;DR: Wang et al. as discussed by the authors proposed a cross-stage partial network (CSPNet) for real-time and high-performance automatic polyp detection and applied the proposed methods on the recently published novel datasets, which are SUN polyp database and the PICCOLO database.

...read moreread less

42 citations

Journal Article•DOI•

An efficient real-time colonic polyp detection with YOLO algorithms trained by using negative samples and large datasets

[...]

Ishak Pacal¹, Ahmet Karaman, Dervis Karaboga², Bahriye Akay², Alper Basturk², Ufuk Nalbantoglu², Seymanur Coskun - Show less +3 more•Institutions (2)

Iğdır University¹, Erciyes University²

13 Nov 2021-Computers in Biology and Medicine

TL;DR: Wang et al. as mentioned in this paper proposed a cross-stage partial network (CSPNet) for real-time and high-performance automatic polyp detection and applied the proposed methods on the recently published novel datasets, which are SUN polyp database and the PICCOLO database.

...read moreread less

42 citations

Journal Article•DOI•

Real-time polyp detection model using convolutional neural networks

[...]

Alba Nogueira-Rodríguez¹, Rubén Domínguez-Carbajales, Fernando Campos-Tato¹, Jesús Herrero, Manuel Puga, David Remedios, Laura Rivas, Eloy Sánchez, Águeda Iglesias, Joaquín Cubiella, Florentino Fdez-Riverola¹, Hugo López-Fernández, Miguel Reboiro-Jato¹, Daniel Glez-Peña¹ - Show less +10 more•Institutions (1)

University of Vigo¹

21 Sep 2021-Neural Computing and Applications

TL;DR: A deep learning model for real-time polyp detection based on a pre-trained YOLOv3 (You Only Look Once) architecture and complemented with a post-processing step based on an object-tracking algorithm to reduce false positives is reported, suggesting that the model could be effectively integrated into a CAD system.

...read moreread less

Abstract: Colorectal cancer is a major health problem, where advances towards computer-aided diagnosis (CAD) systems to assist the endoscopist can be a promising path to improvement. Here, a deep learning model for real-time polyp detection based on a pre-trained YOLOv3 (You Only Look Once) architecture and complemented with a post-processing step based on an object-tracking algorithm to reduce false positives is reported. The base YOLOv3 network was fine-tuned using a dataset composed of 28,576 images labelled with locations of 941 polyps that will be made public soon. In a frame-based evaluation using isolated images containing polyps, a general F1 score of 0.88 was achieved (recall = 0.87, precision = 0.89), with lower predictive performance in flat polyps, but higher for sessile, and pedunculated morphologies, as well as with the usage of narrow band imaging, whereas polyp size < 5 mm does not seem to have significant impact. In a polyp-based evaluation using polyp and normal mucosa videos, with a positive criterion defined as the presence of at least one 50-frames-length (window size) segment with a ratio of 75% of frames with predicted bounding boxes (frames positivity), 72.61% of sensitivity (95% CI 68.99–75.95) and 83.04% of specificity (95% CI 76.70–87.92) were achieved (Youden = 0.55, diagnostic odds ratio (DOR) = 12.98). When the positive criterion is less stringent (window size = 25, frames positivity = 50%), sensitivity reaches around 90% (sensitivity = 89.91%, 95% CI 87.20–91.94; specificity = 54.97%, 95% CI 47.49–62.24; Youden = 0.45; DOR = 10.76). The object-tracking algorithm has demonstrated a significant improvement in specificity whereas maintaining sensitivity, as well as a marginal impact on computational performance. These results suggest that the model could be effectively integrated into a CAD system.

...read moreread less

21 citations

Journal Article•DOI•

Deep learning-based decision support system for weeds detection in wheat fields

[...]

Brahim Jabir, Noureddine Falih

01 Feb 2022-International Journal of Electrical and Computer Engineering

TL;DR: A smart system based on object detection models, implemented on a Raspberry, seek to identify the presence of relevant objects in an area in real time and classify those objects for decision support including spot spray with a chosen herbicide in accordance to the weed detected.

...read moreread less

Abstract: In precision farming, identifying weeds is an essential first step in planning an integrated pest management program in cereals. By knowing the species present, we can learn about the types of herbicides to use to control them, especially in non-weeding crops where mechanical methods that are not effective (tillage, hand weeding, and hoeing and mowing). Therefore, using the deep learning based on convolutional neural network (CNN) will help to automatically identify weeds and then an intelligent system comes to achieve a localized spraying of the herbicides avoiding their large-scale use, preserving the environment. In this article we propose a smart system based on object detection models, implemented on a Raspberry, seek to identify the presence of relevant objects (weeds) in an area (wheat crop) in real time and classify those objects for decision support including spot spray with a chosen herbicide in accordance to the weed detected.

...read moreread less

18 citations

Journal Article•DOI•

Investigation Into Recognition Algorithm of Helmet Violation Based on YOLOv5-CBAM-DCN

[...]

01 Jan 2022-IEEE Access

TL;DR: Wang et al. as discussed by the authors presented an enhanced YOLOv5-based method, in which the challenges caused by complicated construction environment backgrounds, dense targets, and the irregular shape of safety helmets are addressed.

...read moreread less

Abstract: Recognition of safety helmets wearing by construction workers is a common target detection topic in applications of deep learning-based image processing. This paper provides a study of an enhanced YOLOv5-based method, in which the challenges caused by complicated construction environment backgrounds, dense targets, and the irregular shape of safety helmets are addressed. In a trunk network, feature extraction is more based on the target shape by using the Deformable Convolution Net instead of the conventional convolution; in the Neck, a Convolutional Block Attention Module is introduced to weaken feature extraction of complex backgrounds by giving weights to enhance the characterization ability of target features; and the original network’s Generalized Intersection over Union Loss is replaced by Distance Intersection over Union Loss to overcome the problem of erroneous location when the population is dense. The dataset for the training network is created by mixing open-source datasets with autonomous collecting to evaluate the effectiveness of the algorithm. We observed that the improved model has a detection accuracy of 91.6%, up 2.3% over the original network model, and a detection speed of 29 frames per second, which is compliant with most security cameras’ capture frame rate.

...read moreread less

12 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

References

PDF

Open Access

More filters

Proceedings Article•DOI•

Deep Residual Learning for Image Recognition

[...]

Kaiming He¹, Xiangyu Zhang¹, Shaoqing Ren¹, Jian Sun¹•Institutions (1)

Microsoft¹

27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Abstract: Deeper neural networks are more difficult to train. We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously. We explicitly reformulate the layers as learning residual functions with reference to the layer inputs, instead of learning unreferenced functions. We provide comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. On the ImageNet dataset we evaluate residual nets with a depth of up to 152 layers—8× deeper than VGG nets [40] but still having lower complexity. An ensemble of these residual nets achieves 3.57% error on the ImageNet test set. This result won the 1st place on the ILSVRC 2015 classification task. We also present analysis on CIFAR-10 with 100 and 1000 layers. The depth of representations is of central importance for many visual recognition tasks. Solely due to our extremely deep representations, we obtain a 28% relative improvement on the COCO object detection dataset. Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions1, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

...read moreread less

123,388 citations

Proceedings Article•

Adam: A Method for Stochastic Optimization

[...]

Diederik P. Kingma¹, Jimmy Ba²•Institutions (2)

University of Amsterdam¹, University of Toronto²

01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Abstract: We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is also appropriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework. Empirical results demonstrate that Adam works well in practice and compares favorably to other stochastic optimization methods. Finally, we discuss AdaMax, a variant of Adam based on the infinity norm.

...read moreread less

111,197 citations

Proceedings Article•

Attention is All you Need

[...]

Ashish Vaswani¹, Noam Shazeer¹, Niki Parmar², Jakob Uszkoreit¹, Llion Jones¹, Aidan N. Gomez¹, Lukasz Kaiser¹, Illia Polosukhin¹ - Show less +4 more•Institutions (2)

Google¹, University of Southern California²

12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Abstract: The dominant sequence transduction models are based on complex recurrent orconvolutional neural networks in an encoder and decoder configuration. The best performing such models also connect the encoder and decoder through an attentionm echanisms. We propose a novel, simple network architecture based solely onan attention mechanism, dispensing with recurrence and convolutions entirely.Experiments on two machine translation tasks show these models to be superiorin quality while being more parallelizable and requiring significantly less timeto train. Our single model with 165 million parameters, achieves 27.5 BLEU onEnglish-to-German translation, improving over the existing best ensemble result by over 1 BLEU. On English-to-French translation, we outperform the previoussingle state-of-the-art with model by 0.7 BLEU, achieving a BLEU score of 41.1.

...read moreread less

52,856 citations

Proceedings Article•DOI•

You Only Look Once: Unified, Real-Time Object Detection

[...]

Joseph Redmon¹, Santosh K. Divvala², Ross Girshick³, Ali Farhadi²•Institutions (3)

University of Washington¹, Allen Institute for Artificial Intelligence², Facebook³

27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Abstract: We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance. Our unified architecture is extremely fast. Our base YOLO model processes images in real-time at 45 frames per second. A smaller version of the network, Fast YOLO, processes an astounding 155 frames per second while still achieving double the mAP of other real-time detectors. Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background. Finally, YOLO learns very general representations of objects. It outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

27,256 citations

Journal Article•DOI•

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

[...]

Shaoqing Ren¹, Kaiming He², Ross Girshick³, Jian Sun²•Institutions (3)

University of Science and Technology of China¹, Microsoft², Facebook³

01 Jun 2017-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

Abstract: State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet [1] and Fast R-CNN [2] have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position. The RPN is trained end-to-end to generate high-quality region proposals, which are used by Fast R-CNN for detection. We further merge RPN and Fast R-CNN into a single network by sharing their convolutional features—using the recently popular terminology of neural networks with ’attention’ mechanisms, the RPN component tells the unified network where to look. For the very deep VGG-16 model [3] , our detection system has a frame rate of 5 fps ( including all steps ) on a GPU, while achieving state-of-the-art object detection accuracy on PASCAL VOC 2007, 2012, and MS COCO datasets with only 300 proposals per image. In ILSVRC and COCO 2015 competitions, Faster R-CNN and RPN are the foundations of the 1st-place winning entries in several tracks. Code has been made publicly available.

...read moreread less

26,458 citations