Institution
Dolby Laboratories
Company•Amsterdam, Netherlands•
About: Dolby Laboratories is a company organization based out in Amsterdam, Netherlands. It is known for research contribution in the topics: Audio signal & Audio signal flow. The organization has 956 authors who have published 1726 publications receiving 29456 citations.
Papers published on a yearly basis
Papers
More filters
••
06 Jun 2021TL;DR: In this paper, a semi-supervised learning approach was proposed for automatic speech quality assessment, combining available annotations with programmatically generated data, and using three different optimization criteria together with five complementary auxiliary tasks.
Abstract: Automatic speech quality assessment is an important, transversal task whose progress is hampered by the scarcity of human annotations, poor generalization to unseen recording conditions, and a lack of flexibility of existing approaches. In this work, we tackle these problems with a semi-supervised learning approach, combining available annotations with programmatically generated data, and using 3 different optimization criteria together with 5 complementary auxiliary tasks. Our results show that such a semi-supervised approach can cut the error of existing methods by more than 36%, while providing additional benefits in terms of reusable features or auxiliary outputs. Improvement is further corroborated with an out-of-sample test showing promising generalization capabilities.
18 citations
•
21 Mar 2013TL;DR: In this paper, a method of outputting audio in a teleconferencing environment includes receiving audio streams, processing audio streams according to information regarding effective spatial positions, and outputting, by at least three speakers arranged in more than one dimension, the audio streams having been processed.
Abstract: A method of outputting audio in a teleconferencing environment includes receiving audio streams, processing the audio streams according to information regarding effective spatial positions, and outputting, by at least three speakers arranged in more than one dimension, the audio streams having been processed. The information regarding the plurality of effective spatial positions corresponds to a perceived spatial scene that extends beyond the speakers in at least two dimensions. In this manner, participants in the teleconference perceive the audio from the remote participants as originating at different positions in the teleconference room.
18 citations
••
TL;DR: In this paper, a plurality of discrete photo sensors examine small fractional portions of the sound track width to convert annoying quantizing noise to more tolerable random noise while using a practical number of sensors.
Abstract: Apparatus for scanning variable area optical sound tracks wherein a plurality of discrete photo sensors examine small fractional portions of the sound track width. Selection of the number of sensors in relation to photographic grain dimensions results in converting annoying quantizing noise to more tolerable random noise while using a practical number of sensors. Electronic processing of the sensors' outputs reduces impulse noise from dirt and scratches.
18 citations
••
TL;DR: In this paper, it was proved that every integer N that satisfies N ≡ 5(mod24) can be written as $$¯¯N = p^{2}_{1} + p^{ 2}_{2} + 1 ≤ i ≤ 5, be prime numbers.
Abstract: Let P
i
, 1 ≤ i ≤ 5, be prime numbers. It is proved that every integer N that satisfies
N ≡ 5(mod24) can be written as $$
N = p^{2}_{1} + p^{2}_{2} + p^{2}_{3} + p^{2}_{4} + p^{2}_{5} ,{\text{where}}{\left| {{\sqrt N }5 - p_{i} } \right|} \leqslant N^{{\frac{1}
{2} - \frac{{19}}
{{850}} + \in }}
$$
.
18 citations
•
05 Dec 2012TL;DR: In this paper, a visual dynamic range (VDR) coding system creates a sequence of VDR prediction images using corresponding standard dynamic range images and a prediction function, and an encoder identifies one or more areas within the prediction image suitable for post-prediction filtering.
Abstract: A visual dynamic range (VDR) coding system creates a sequence of VDR prediction images using corresponding standard dynamic range (SDR) images and a prediction function. For each prediction image, an encoder identifies one or more areas within the prediction image suitable for post-prediction filtering. For each identified post-prediction area, a post-prediction filtering mode is selected among one or more post-prediction filtering modes. The selected post-prediction filtering mode is applied to output a filtered prediction image. Information related to the post-prediction filtering areas and the selected corresponding post-prediction filtering modes may be communicated to a receiver (e.g., as metadata) for guided post-prediction filtering. Example post-prediction filtering modes that use low-pass averaging filtering or adaptive linear interpolation are also described.
18 citations
Authors
Showing all 959 results
Name | H-index | Papers | Citations |
---|---|---|---|
Wolfgang Heidrich | 64 | 312 | 15854 |
Rabab K. Ward | 56 | 549 | 14364 |
Lorne A. Whitehead | 42 | 232 | 6661 |
Scott J. Daly | 41 | 230 | 5543 |
Michael E. Miller | 40 | 225 | 5264 |
Alireza Marandi | 39 | 140 | 6116 |
Wolfgang Stuerzlinger | 35 | 230 | 5192 |
Lars Villemoes | 33 | 180 | 2815 |
Joan Serrà | 31 | 139 | 4046 |
Dong Tian | 31 | 116 | 3621 |
Peng Yin | 30 | 133 | 2454 |
Ning Xu | 28 | 117 | 2705 |
Nicolas R. Tsingos | 28 | 110 | 2749 |
Panos Nasiopoulos | 27 | 271 | 3706 |
Zhibo Chen | 27 | 344 | 3385 |