scispace - formally typeset
Open AccessPosted Content

DeepStyle: Multimodal Search Engine for Fashion and Interior Design

Reads0
Chats0
TLDR
DeepStyle as mentioned in this paper proposes a multimodal search engine that combines visual and textual cues to retrieve items from a multimedia database aesthetically similar to the query by using a joint neural network architecture.
Abstract: 
In this paper, we propose a multimodal search engine that combines visual and textual cues to retrieve items from a multimedia database aesthetically similar to the query. The goal of our engine is to enable intuitive retrieval of fashion merchandise such as clothes or furniture. Existing search engines treat textual input only as an additional source of information about the query image and do not correspond to the real-life scenario where the user looks for 'the same shirt but of denim'. Our novel method, dubbed DeepStyle, mitigates those shortcomings by using a joint neural network architecture to model contextual dependencies between features of different modalities. We prove the robustness of this approach on two different challenging datasets of fashion items and furniture where our DeepStyle engine outperforms baseline methods by 18-21% on the tested datasets. Our search engine is commercially deployed and available through a Web-based application.

read more

Citations
More filters
Posted Content

POG: Personalized Outfit Generation for Fashion Recommendation at Alibaba iFashion

TL;DR: This paper proposes a Personalized Outfit Generation (POG) model, which connects user preferences regarding individual items and outfits with Transformer architecture, and releases a large-scale dataset, which is the largest, publicly available, fashion related dataset, and the first to provide user behaviors relating to both outfits and fashion items.
Posted Content

Outfit Generation and Style Extraction via Bidirectional LSTM and Autoencoder.

TL;DR: An unsupervised style extraction module is incorporated into a model to learn outfits using the style information of an outfit as a whole to generate outfits more flexibly without requiring additional information.
Proceedings ArticleDOI

GANs-based Clothes Design: Pattern Maker Is All You Need to Design Clothing

TL;DR: This paper proposes a method of generation of clothing images for pattern makers using Progressive Growing of GANs (P-GANs) and conducts a user study to investigate whether the different image quality factors such as epoch and resolution affect the participants' confidence score.
Journal ArticleDOI

FashionFit: Analysis of Mapping 3D Pose and Neural Body Fit for Custom Virtual Try-On

TL;DR: The authors propose a novel architecture which facilitates the combining outfits provided by the retailers and visualize it on the users themselves using Neural Body Fit, and creates a benchmark in disentangling the custom generation of cloth outfits using GANs and virtually trying it onThe users to ensure a virtual-photorealistic appearance and results to create a great customer experience by using AI.
Posted Content

Reducing catastrophic forgetting with learning on synthetic data

TL;DR: This work answers the question: Is it possible to generate data synthetically which learned in sequence does not result in catastrophic forgetting and proposes a method to generate such data in two-step optimisation process via meta-gradients.
References
More filters
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
Related Papers (5)