Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

Open AccessPosted Content

Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

Edgar Riba, +4 more

- 05 Oct 2019 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

Kornia as mentioned in this paper is an open source computer vision library which consists of a set of differentiable routines and modules to solve generic computer vision problems, such as image transformations, camera calibration, epipolar geometry, and low level image processing techniques.

Abstract:

This work presents Kornia -- an open source computer vision library which consists of a set of differentiable routines and modules to solve generic computer vision problems. The package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by OpenCV, Kornia is composed of a set of modules containing operators that can be inserted inside neural networks to train models to perform image transformations, camera calibration, epipolar geometry, and low level image processing techniques, such as filtering and edge detection that operate directly on high dimensional tensor representations. Examples of classical vision problems implemented using our framework are provided including a benchmark comparing to existing vision libraries.

Citations

PDF

Open Access

More filters

Posted Content

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

Ilya Kostrikov, +2 more

- 28 Apr 2020 -

arXiv: Learning

TL;DR: The addition of the augmentation method dramatically improves SAC's performance, enabling it to reach state-of-the-art performance on the DeepMind control suite, surpassing model-based methods and recently proposed contrastive learning (CURL).

...read moreread less

Journal ArticleDOI

TorchIO: A Python library for efficient loading, preprocessing, augmentation and patch-based sampling of medical images in deep learning

Fernando Pérez-García, +2 more

- 17 Jun 2021 -

Computer Methods and Programs in Biomedi...

TL;DR: Support for 2D, 3D and 4D images such as X-ray, histopathology, CT, ultrasound and diffusion MRI and focus on reproducibility and traceability to encourage open-science practices.

...read moreread less

Posted Content

Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research

Krishna Murthy Jatavallabhula, +8 more

- 12 Nov 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Kaolin provides efficient implementations of differentiable 3D modules for use in deep learning systems and curates a comprehensive model zoo comprising many state-of-the-art 3D deep learning architectures to serve as a starting point for future research endeavours.

...read moreread less

Posted ContentDOI

DeepEthogram: a machine learning pipeline for supervised behavior classification from raw pixels

James P Bohnslav, +8 more

- 25 Sep 2020 -

bioRxiv

TL;DR: DeepEthogram is software that takes raw pixel values of videos as input and uses machine learning to output an ethogram, the set of user-defined behaviors of interest present in each frame of a video, which is expected to enable the rapid, automated, and reproducible assignment of behavior labels to every frame of an video, thus accelerating all those studies that quantify behaviors ofinterest.

...read moreread less

Journal ArticleDOI

DeepEthogram, a machine learning pipeline for supervised behavior classification from raw pixels.

James P Bohnslav, +14 more

- 02 Sep 2021 -

eLife

TL;DR: DeepEthogram as discussed by the authors is a software that uses supervised machine learning to convert raw video pixels into an ethogram, the behaviors of interest present in each video frame, which can be used to quantify researcher-defined behaviors to study neural function, gene mutations, and pharmacological therapies.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal ArticleDOI

Image quality assessment: from error visibility to structural similarity

Zhou Wang, +3 more

- 01 Apr 2004 -

IEEE Transactions on Image Processing

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.

...read moreread less

Collapse

Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

Citations

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

TorchIO: A Python library for efficient loading, preprocessing, augmentation and patch-based sampling of medical images in deep learning

Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research

DeepEthogram: a machine learning pipeline for supervised behavior classification from raw pixels

DeepEthogram, a machine learning pipeline for supervised behavior classification from raw pixels.

References

Adam: A Method for Stochastic Optimization

ImageNet Classification with Deep Convolutional Neural Networks

Distinctive Image Features from Scale-Invariant Keypoints

Gradient-based learning applied to document recognition

Image quality assessment: from error visibility to structural similarity

Related Papers (5)

Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

An Introduction to 3D Computer Vision Techniques and Algorithms

OpenGV: A unified and generalized approach to real-time calibrated geometric vision

Coding cells of digital spaces: a framework to write generic digital topology algorithms

Designing Camera Networks by Convex Quadratic Programming