DeepStyle: Multimodal Search Engine for Fashion and Interior Design

Open AccessPosted Content

DeepStyle: Multimodal Search Engine for Fashion and Interior Design

Ivona Tautkute, +4 more

- 08 Jan 2018 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

DeepStyle as mentioned in this paper proposes a multimodal search engine that combines visual and textual cues to retrieve items from a multimedia database aesthetically similar to the query by using a joint neural network architecture.

Abstract:

In this paper, we propose a multimodal search engine that combines visual and textual cues to retrieve items from a multimedia database aesthetically similar to the query. The goal of our engine is to enable intuitive retrieval of fashion merchandise such as clothes or furniture. Existing search engines treat textual input only as an additional source of information about the query image and do not correspond to the real-life scenario where the user looks for 'the same shirt but of denim'. Our novel method, dubbed DeepStyle, mitigates those shortcomings by using a joint neural network architecture to model contextual dependencies between features of different modalities. We prove the robustness of this approach on two different challenging datasets of fashion items and furniture where our DeepStyle engine outperforms baseline methods by 18-21% on the tested datasets. Our search engine is commercially deployed and available through a Web-based application.

Citations

PDF

Open Access

More filters

Posted Content

POG: Personalized Outfit Generation for Fashion Recommendation at Alibaba iFashion

Wen Chen, +9 more

- 06 May 2019 -

arXiv: Information Retrieval

TL;DR: This paper proposes a Personalized Outfit Generation (POG) model, which connects user preferences regarding individual items and outfits with Transformer architecture, and releases a large-scale dataset, which is the largest, publicly available, fashion related dataset, and the first to provide user behaviors relating to both outfits and fashion items.

...read moreread less

Posted Content

Outfit Generation and Style Extraction via Bidirectional LSTM and Autoencoder.

Takuma Nakamura, +1 more

- 29 Jun 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: An unsupervised style extraction module is incorporated into a model to learn outfits using the style information of an outfit as a whole to generate outfits more flexibly without requiring additional information.

...read moreread less

Proceedings ArticleDOI

GANs-based Clothes Design: Pattern Maker Is All You Need to Design Clothing

Natsumi Kato, +4 more

TL;DR: This paper proposes a method of generation of clothing images for pattern makers using Progressive Growing of GANs (P-GANs) and conducts a user study to investigate whether the different image quality factors such as epoch and resolution affect the participants' confidence score.

...read moreread less

Journal ArticleDOI

FashionFit: Analysis of Mapping 3D Pose and Neural Body Fit for Custom Virtual Try-On

Mohammad Farukh Hashmi, +4 more

- 11 May 2020 -

IEEE Access

TL;DR: The authors propose a novel architecture which facilitates the combining outfits provided by the retailers and visualize it on the users themselves using Neural Body Fit, and creates a benchmark in disentangling the custom generation of cloth outfits using GANs and virtually trying it onThe users to ensure a virtual-photorealistic appearance and results to create a great customer experience by using AI.

...read moreread less

Posted Content

Reducing catastrophic forgetting with learning on synthetic data

Wojciech Masarczyk, +1 more

- 29 Apr 2020 -

arXiv: Learning

TL;DR: This work answers the question: Is it possible to generate data synthetically which learned in sequence does not result in catastrophic forgetting and proposes a method to generate such data in two-step optimisation process via meta-gradients.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less