Home
/
Authors
/
Manish Narwaria

Author

Manish Narwaria

Centre national de la recherche scientifique

Other affiliations: Nanyang Technological University, University of Nantes, Dhirubhai Ambani Institute of Information and Communication Technology

Bio: Manish Narwaria is an academic researcher from Centre national de la recherche scientifique. The author has contributed to research in topics: Tone mapping & Human visual system model. The author has an hindex of 19, co-authored 41 publications receiving 1828 citations. Previous affiliations of Manish Narwaria include Nanyang Technological University & University of Nantes.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Toward Better Statistical Validation of Machine Learning-Based Multimedia Quality Estimators

[...]

Manish Narwaria¹•Institutions (1)

Dhirubhai Ambani Institute of Information and Communication Technology¹

17 May 2018-IEEE Transactions on Broadcasting

TL;DR: The main goal of this paper is to shed light on limitations of the current ML-based objective quality predictor approach both from practical and theoretical perspectives wherever applicable, and in the process propose an alternate approach to overcome some of them.

...read moreread less

Abstract: Objective assessment of multimedia quality using machine learning (ML) has been gaining popularity especially in the context of both traditional (e.g., terrestrial and satellite broadcast) and advance (such as over-the-top media services, IPTV) broadcast services. Being data-driven, these methods obviously rely on training to find the optimal model parameters. Therefore, to statistically compare and validate such ML-based quality predictors, the current approach randomly splits the given data into training and test sets and obtains a performance measure (for instance mean squared error, correlation coefficient etc.). The process is repeated a large number of times and parametric tests (e.g., ${t}$ test) are then employed to statistically compare mean (or median) prediction accuracies. However, the current approach suffers from a few limitations (related to the qualitative aspects of training and testing data, the use of improper sample size for statistical testing, possibly dependent sample observations, and a lack of focus on quantifying the learning ability of the ML-based objective quality predictor) which have not been addressed in literature. Therefore, the main goal of this paper is to shed light on the said limitations both from practical and theoretical perspectives wherever applicable, and in the process propose an alternate approach to overcome some of them. As a major advantage, the proposed guidelines not only help in a theoretically more grounded statistical comparison but also provide useful insights into how well the ML-based objective quality predictors exploit data structure for learning. We demonstrate the added value of the proposed set of guidelines on standard datasets by comparing the performance of few existing ML-based quality estimators. A software implementation of the presented guidelines is also made publicly available to enable researchers and developers to test and compare different models in a repeatable manner.

...read moreread less

8 citations

Proceedings Article•DOI•

On improving the pooling in HDR-VDP-2 towards better HDR perceptual quality assessment

[...]

Manish Narwaria¹, Matthieu Perreira Da Silva¹, Patrick Le Callet¹, Romuald Pepion¹•Institutions (1)

Centre national de la recherche scientifique¹

25 Feb 2014-electronic imaging

TL;DR: The HDR Visual Difference Predictor (HDR-VDP-2) is primarily a visibility prediction metric i.e. whether the signal distortion is visible to the eye and to what extent and it also employs a pooling function to compute an overall quality score.

...read moreread less

Abstract: High Dynamic Range (HDR) signals capture much higher contrasts as compared to the traditional 8-bit low dynamic range (LDR) signals. This is achieved by representing the visual signal via values that are related to the real-world luminance, instead of gamma encoded pixel values which is the case with LDR. Therefore, HDR signals cover a larger luminance range and tend to have more visual appeal. However, due to the higher luminance conditions, the existing methods cannot be directly employed for objective quality assessment of HDR signals. For that reason, the HDR Visual Difference Predictor (HDR-VDP-2) has been proposed. HDR-VDP-2 is primarily a visibility prediction metric i.e. whether the signal distortion is visible to the eye and to what extent. Nevertheless, it also employs a pooling function to compute an overall quality score. This paper focuses on the pooling aspect in HDR-VDP-2 and employs a comprehensive database of HDR images (with their corresponding subjective ratings) to improve the prediction accuracy of HDR-VDP-2. We also discuss and evaluate the existing objective methods and provide a perspective towards better HDR quality assessment.

...read moreread less

7 citations

Proceedings Article•DOI•

Video quality assessment using temporal quality variations and machine learning

[...]

Manish Narwaria¹, Weisi Lin¹•Institutions (1)

Nanyang Technological University¹

11 Jul 2011

TL;DR: Experiments conducted using two publicly available video databases show the effectiveness of the proposed full-reference metric in comparison to the relevant existing VQA metrics.

...read moreread less

Abstract: Objective video quality assessment (VQA) is the use of computational models to predict the video quality in line with the perception of the human visual system (HVS). It is challenging due to the underlying complexity, and the relatively limited understanding of the HVS and its intricate mechanisms. There are two important issues regarding VQA: (a) the temporal factors apart from the spatial ones also need to be considered, (b) the contribution of each factor and their interaction to the overall video quality needs to be determined. In this paper, we attempt to tackle the first issue by utilizing the variation of spatial quality along the temporal axis. The second issue is addressed by the use of machine learning; we believe this to be more convincing since the relationship between the factors and the overall quality is derived via training with substantial ground truth (i.e. subjective scores). Experiments conducted using two publicly available video databases show the effectiveness of the proposed full-reference metric in comparison to the relevant existing VQA metrics.

...read moreread less

6 citations

Proceedings Article•DOI•

Rendering of HDR content on LDR displays: an objective approach

[...]

Lukas Krasula¹, Lukas Krasula², Manish Narwaria², Karel Fliegel¹, Patrick Le Callet² - Show less +1 more•Institutions (2)

Czech Technical University in Prague¹, Centre national de la recherche scientifique²

22 Sep 2015-Proceedings of SPIE

TL;DR: This work investigates into a new objective method for TMO parameters optimization based on quantification of contrast reversal and naturalness that does not require any prior knowledge about the input HDR image and works independently on the used TMO.

...read moreread less

Abstract: Dynamic range compression (or tone mapping) of HDR content is an essential step towards rendering it on traditional LDR displays in a meaningful way. This is however non-trivial and one of the reasons is that tone mapping operators (TMOs) usually need content-specific parameters to achieve the said goal. While subjective TMO parameter adjustment is the most accurate, it may not be easily deployable in many practical applications. Its subjective nature can also influence the comparison of different operators. Thus, there is a need for objective TMO parameter selection to automate the rendering process. To that end, we investigate into a new objective method for TMO parameters optimization. Our method is based on quantification of contrast reversal and naturalness. As an important advantage, it does not require any prior knowledge about the input HDR image and works independently on the used TMO. Experimental results using a variety of HDR images and several popular TMOs demonstrate the value of our method in comparison to default TMO parameter settings.

...read moreread less

6 citations

Proceedings Article•DOI•

An automated approach for tone mapping operator parameter adjustment in security applications

[...]

LukáÅ. ¡ Krasula¹, LukáÅ. ¡ Krasula², Manish Narwaria², Patrick Le Callet²•Institutions (2)

Czech Technical University in Prague¹, Centre national de la recherche scientifique²

15 May 2014-Proceedings of SPIE

TL;DR: This paper presents the universal method for TMO parameters tuning, in order to maintain as many details as possible, which is desirable in security applications, and suggests possible increase in privacy intrusion.

...read moreread less

Abstract: High Dynamic Range (HDR) imaging has been gaining popularity in recent years. Different from the traditional low dynamic range (LDR), HDR content tends to be visually more appealing and realistic as it can represent the dynamic range of the visual stimuli present in the real world. As a result, more scene details can be faithfully reproduced. As a direct consequence, the visual quality tends to improve. HDR can be also directly exploited for new applications such as video surveillance and other security tasks. Since more scene details are available in HDR, it can help in identifying/tracking visual information which otherwise might be difficult with typical LDR content due to factors such as lack/excess of illumination, extreme contrast in the scene, etc. On the other hand, with HDR, there might be issues related to increased privacy intrusion. To display the HDR content on the regular screen, tone-mapping operators (TMO) are used. In this paper, we present the universal method for TMO parameters tuning, in order to maintain as many details as possible, which is desirable in security applications. The method’s performance is verified on several TMOs by comparing the outcomes from tone-mapping with default and optimized parameters. The results suggest that the proposed approach preserves more information which could be of advantage for security surveillance but, on the other hand, makes us consider possible increase in privacy intrusion.

...read moreread less

6 citations

1
2
3
…
4
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

No-Reference Image Quality Assessment in the Spatial Domain

[...]

Anish Mittal¹, Anush K. Moorthy¹, Alan C. Bovik¹•Institutions (1)

University of Texas at Austin¹

01 Dec 2012-IEEE Transactions on Image Processing

TL;DR: Despite its simplicity, it is able to show that BRISQUE is statistically better than the full-reference peak signal-to-noise ratio and the structural similarity index, and is highly competitive with respect to all present-day distortion-generic NR IQA algorithms.

...read moreread less

Abstract: We propose a natural scene statistic-based distortion-generic blind/no-reference (NR) image quality assessment (IQA) model that operates in the spatial domain. The new model, dubbed blind/referenceless image spatial quality evaluator (BRISQUE) does not compute distortion-specific features, such as ringing, blur, or blocking, but instead uses scene statistics of locally normalized luminance coefficients to quantify possible losses of “naturalness” in the image due to the presence of distortions, thereby leading to a holistic measure of quality. The underlying features used derive from the empirical distribution of locally normalized luminances and products of locally normalized luminances under a spatial natural scene statistic model. No transformation to another coordinate frame (DCT, wavelet, etc.) is required, distinguishing it from prior NR IQA approaches. Despite its simplicity, we are able to show that BRISQUE is statistically better than the full-reference peak signal-to-noise ratio and the structural similarity index, and is highly competitive with respect to all present-day distortion-generic NR IQA algorithms. BRISQUE has very low computational complexity, making it well suited for real time applications. BRISQUE features may be used for distortion-identification as well. To illustrate a new practical application of BRISQUE, we describe how a nonblind image denoising algorithm can be augmented with BRISQUE in order to perform blind image denoising. Results show that BRISQUE augmentation leads to performance improvements over state-of-the-art methods. A software release of BRISQUE is available online: http://live.ece.utexas.edu/research/quality/BRISQUE_release.zip for public use and evaluation.

...read moreread less

3,780 citations

Journal Article•DOI•

Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index

[...]

Wufeng Xue¹, Lei Zhang, Xuanqin Mou¹, Alan C. Bovik²•Institutions (2)

Xi'an Jiaotong University¹, University of Texas at Austin²

01 Feb 2014-IEEE Transactions on Image Processing

TL;DR: It is found that the pixel-wise gradient magnitude similarity (GMS) between the reference and distorted images combined with a novel pooling strategy-the standard deviation of the GMS map-can predict accurately perceptual image quality.

...read moreread less

Abstract: It is an important task to faithfully evaluate the perceptual quality of output images in many applications, such as image compression, image restoration, and multimedia streaming. A good image quality assessment (IQA) model should not only deliver high quality prediction accuracy, but also be computationally efficient. The efficiency of IQA metrics is becoming particularly important due to the increasing proliferation of high-volume visual data in high-speed networks. We present a new effective and efficient IQA model, called gradient magnitude similarity deviation (GMSD). The image gradients are sensitive to image distortions, while different local structures in a distorted image suffer different degrees of degradations. This motivates us to explore the use of global variation of gradient based local quality map for overall image quality prediction. We find that the pixel-wise gradient magnitude similarity (GMS) between the reference and distorted images combined with a novel pooling strategy-the standard deviation of the GMS map-can predict accurately perceptual image quality. The resulting GMSD algorithm is much faster than most state-of-the-art IQA methods, and delivers highly competitive prediction accuracy. MATLAB source code of GMSD can be downloaded at http://www4.comp.polyu.edu.hk/~cslzhang/IQA/GMSD/GMSD.htm.

...read moreread less

1,211 citations

Journal Article•DOI•

Perceptual visual quality metrics: A survey

[...]

Weisi Lin¹, C.-C. Jay Kuo²•Institutions (2)

Nanyang Technological University¹, University of Southern California²

01 May 2011-Journal of Visual Communication and Image Representation

TL;DR: A systematic, comprehensive and up-to-date review of perceptual visual quality metrics (PVQMs) to predict picture quality according to human perception.

...read moreread less

895 citations

Journal Article•DOI•

VSI: a visual saliency-induced index for perceptual image quality assessment.

[...]

Lin Zhang¹, Ying Shen¹, Hongyu Li¹•Institutions (1)

Tongji University¹

07 Aug 2014-IEEE Transactions on Image Processing

TL;DR: Extensive experiments performed on four largescale benchmark databases demonstrate that the proposed IQA index VSI works better in terms of the prediction accuracy than all state-of-the-art IQA indices the authors can find while maintaining a moderate computational complexity.

...read moreread less

Abstract: Perceptual image quality assessment (IQA) aims to use computational models to measure the image quality in consistent with subjective evaluations. Visual saliency (VS) has been widely studied by psychologists, neurobiologists, and computer scientists during the last decade to investigate, which areas of an image will attract the most attention of the human visual system. Intuitively, VS is closely related to IQA in that suprathreshold distortions can largely affect VS maps of images. With this consideration, we propose a simple but very effective full reference IQA method using VS. In our proposed IQA model, the role of VS is twofold. First, VS is used as a feature when computing the local quality map of the distorted image. Second, when pooling the quality score, VS is employed as a weighting function to reflect the importance of a local region. The proposed IQA index is called visual saliency-based index (VSI). Several prominent computational VS models have been investigated in the context of IQA and the best one is chosen for VSI. Extensive experiments performed on four large-scale benchmark databases demonstrate that the proposed IQA index VSI works better in terms of the prediction accuracy than all state-of-the-art IQA indices we can find while maintaining a moderate computational complexity. The MATLAB source code of VSI and the evaluation results are publicly available online at http://sse.tongji.edu.cn/linzhang/IQA/VSI/VSI.htm.

...read moreread less

823 citations

Posted Content•

Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index

[...]

Wufeng Xue¹, Lei Zhang, Xuanqin Mou¹, Alan C. Bovik²•Institutions (2)

Xi'an Jiaotong University¹, University of Texas at Austin²

14 Aug 2013-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a gradient magnitude similarity deviation (GMSD) method was proposed for image quality assessment, where the pixel-wise GMS between the reference and distorted images was combined with a novel pooling strategy to predict accurately perceptual image quality.

...read moreread less

Abstract: It is an important task to faithfully evaluate the perceptual quality of output images in many applications such as image compression, image restoration and multimedia streaming. A good image quality assessment (IQA) model should not only deliver high quality prediction accuracy but also be computationally efficient. The efficiency of IQA metrics is becoming particularly important due to the increasing proliferation of high-volume visual data in high-speed networks. We present a new effective and efficient IQA model, called gradient magnitude similarity deviation (GMSD). The image gradients are sensitive to image distortions, while different local structures in a distorted image suffer different degrees of degradations. This motivates us to explore the use of global variation of gradient based local quality map for overall image quality prediction. We find that the pixel-wise gradient magnitude similarity (GMS) between the reference and distorted images combined with a novel pooling strategy the standard deviation of the GMS map can predict accurately perceptual image quality. The resulting GMSD algorithm is much faster than most state-of-the-art IQA methods, and delivers highly competitive prediction accuracy.

...read moreread less

742 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse