Video Denoising, Deblocking, and Enhancement Through Separable 4-D Nonlocal Spatiotemporal Transforms
read more
Citations
Video Enhancement with Task-Oriented Flow
Burst photography for high dynamic range and low-light imaging on mobile cameras
Video Enhancement with Task-Oriented Flow
Burst Denoising with Kernel Prediction Networks
Fast Hyperspectral Image Denoising and Inpainting Based on Low-Rank and Sparse Representations
References
A simplex method for function minimization
Overview of the H.264/AVC video coding standard
Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering
Convergence Properties of the Nelder--Mead Simplex Method in Low Dimensions
A Review of Image Denoising Algorithms, with a New One
Related Papers (5)
Frequently Asked Questions (17)
Q2. What is the way to control the decay rate of the exponential term?
By setting a proper value of σw the authors can control the decay rate of the exponential term as a function of v or, in other words, how permissive is the window contraction with respect to the velocity of the tracked block.
Q3. What is the effect of the proposed enhancement filter?
The proposed enhancement filter is minimally susceptible to noise even when strong sharpening is performed (i.e., α = 1.25), as shown by the smooth reconstruction of flat areas like the hat of Foreman and the bus roof of Bus.
Q4. What is the common technique used in enhancement?
A critical issue in enhancement is the amplification of the noise together with the sharpening of image details [44], [42], an effect that becomes more severe as the amount of applied sharpening increases.
Q5. What is the value of for the coefficients that do not belong to this 3-D?
the value of α can be decreased for the coefficients that do not belong to this 3-D volume, in order to attenuate the temporal flickering artifacts.
Q6. How is the performance of the V-BM4D metric measured?
the authors measure the performance of V-BM4D by means of the MOVIE index [46], a recently introduced video quality assessment (VQA) metric that is expected to be closer to the human visual judgement than the PSNR, because it concurrently evaluates space, time and jointly space-time video quality.
Q7. What is the optimum value of for a compressed video?
In order to use V-BM4D as a deblocking filter, the authors need to determine a suitable value of σ to handle the artifacts in a compressed video.
Q8. What is the simplest way to reduce the complexity of the grouping phase?
To reduce the complexity of the grouping phase, the authors restrict the search of similar volumes within a NG × NG neighborhood centered around the coordinates of the reference volume, and the authors introduce a step of Nstep ∈ N pixels in both horizontal and vertical directions between each reference volume.
Q9. What is the way to extract a sub-volume of length L0?
Among all the possible criteria for extracting a sub-volume of length L0 = h−0 + h + 0 + 1 from a longer volume, their choice aims at limiting the complexity while maintaining a high correlation within the grouped volumes, because the authors can reasonably assume that similar objects at different positions are represented by similar volumes along time.
Q10. How many arithmetical operations is required to perform the hard-thresholding?
Observe that the hard-thresholding, which is performed via element-wise comparison, requires one arithmetical operation per pixel.
Q11. What is the minimum degree of similarity between volumes?
The parameter τmatch > 0 controls the minimum degree of similarity among volumes with respect to the distance δv, which is typically the `2-norm of the difference between two volumes.
Q12. How are the temporal DC and AC sharpened?
In particular, the temporal DC coefficients are sharpened using αDC = 1.25, and the temporal AC are sharpened using the halved value αAC = 0.625.
Q13. How does the MOVIE index compare to the VBM3D?
From an objective point of view, as reported in Table IV, V-BM4D performs better than VBM3D in every experiment, with PSNR gains of up to 1.5dB.
Q14. How can V-BM4D process the useless blocks?
by skipping the motion estimation of the useless blocks, it is possible to achieve an additional speed-up of ∼12x that allows V-BM4D to process nearly 4 fps without affecting the final reconstruction quality.
Q15. Why is the grouping of volumes different?
in practice, because of the separability of the transform T4D, every group Gz(xi, ti) has to be composed of volumes having the samelength.
Q16. How can the authors eliminate the cost of the Wiener filtering?
Let us observe that this cost can be entirely eliminated when the input video is encoded with a motion-compensated algorithm, such as MPEG-4 or H.264, since the motion vectors required to build the spatiotemporal volumes can be directly extracted from the encoded video.
Q17. What is the result of the hard-thresholding stage?
The outcome of the hard-thresholding stage, ŷht, is obtained by aggregating with a convex combination all the estimated groups Ĝhty (x, t), as defined in (5).