AI Gets Creative

doi:10.1145/3347449.3357490

Proceedings ArticleDOI

AI Gets Creative

Marta Mrak

- pp 1-2

Chats0

TLDR

Looking forward, this penetration of AI opens new challenges, such as interpretability of deep learning (to enable use AI in an accountable way as well as to enable AI-inspired low-complexity algorithms) and applicability in systems which require low- complexity solutions and/or do not have enough training data.

Abstract:

Numerous breakthroughs in multimedia signal processing are being enabled thanks to applications of machine learning in tasks such as multimedia creation, enhancement, classification and compression [1]. Notably, in the context of production and distribution of television programmes, it has been successfully demonstrated how Artificial Intelligence (AI) can support innovation in the creative sector. In the context of delivering TV programmes of stunning visual quality, the applications of deep learning have enabled significant advances when the original content is of poor quality / resolution, or when delivery channels are very limited. Examples when the enhancement of originally poor quality is needed include new content forms (e.g. user generated content) and historical content (e.g. archives), while limitations of delivery channels can, first of all, be addressed by improving content compression. As a state-of-the-art example, the benefits of deep-learning solutions have been recently demonstrated within an end-to-end platform for management of user generated content [2], where deep learning is applied to increase video resolution, evaluate video quality and enrich the video by providing automatic metadata. Within this particular application space where large amount of user generated content is available, the progress has also been made in addressing visual story editing using social media data in automatic ways, making programmes from large amount of content faster [3]. Broadcasters are also interested in restauration of historical content more cheaply. For example, adding colour to "black and white" content has until now been an expensive and time-consuming task. However, recently new algorithms have been developed to perform the task more efficiently. Generative Adversarial Networks (GANs) have become the baseline for many image-to-image translation tasks, including image colourisation. Aiming at the generation of more naturally coloured images from "black and white" sources, newest algorithms are capable of generalisation of the colour of natural images, producing realistic and plausible results [4]. In the context of content delivery, new generations of compression standards enable significant reduction of required bandwidth [5], however, with a cost of increased computational complexity. This is another area where AI can be utilised for better efficiency - either in its simple forms as decision trees [6,7] or more advanced deep convolutional neural networks [8]. Looking forward, this penetration of AI opens new challenges, such as interpretability of deep learning (to enable use AI in an accountable way as well as to enable AI-inspired low-complexity algorithms) and applicability in systems which require low-complexity solutions and/or do not have enough training data. However, overall further benefits of these new approaches include automatization of many traditional production tasks which has the potential to transform the way content providers make their programmes in cheaper and more effective ways.

AI Gets Creative

Citations

Towards Transparent Application of Machine Learning in Video Processing.

Scene-Adaptive Temporal Stabilisation for Video Colourisation Using Deep Video Priors

References

End-to-End Conditional GAN-based Architectures for Image Colourisation

Decision Trees for Complexity Reduction in Video Compression

Estimation of Rate Control Parameters for Video Coding Using CNN

Decision Trees for Complexity Reduction in Video Compression

Fast Inter-prediction Based on Decision Trees for AV1 Encoding

Related Papers (5)

The Technological Elements of Artificial Intelligence

Emerging Computer Vision Based Machine Learning Issues for Smart Cities

Content without context is meaningless

SIGIR 2018 Workshop on Learning from Limited or Noisy Data for Information Retrieval

Overcoming Barriers to Progress in Multimodal Fusion Research.

Trending Questions (3)