Skip to content

Latest commit

 

History

History
108 lines (62 loc) · 3.43 KB

README.md

File metadata and controls

108 lines (62 loc) · 3.43 KB

Apparel-Recommendation

Extracts apparel details from amazon api and recommends similar products using nltk and keras

DOMAIN BACKGROUND

Personalized product recommendations are the alternative way of navigating through the online shop. More people find products they need. Even if they didn’t think of them. Build a recommendation engine which suggests similar products to the given product in any e-commerce websites ex. Amazon.com, myntra.com etc.

In this project we are extracting the json file for over 180,000 apparels images and recommend similar apparels using content based search.

PROBLEM STATEMENT

Given a json file for extracting the 180k apparel images from amazon.com we need to recommend the similar apparel based on the document id i.e product id and number of apparels to be recommended at a time. Each of those images will be recommended based on following fields:

1. asin  ( Amazon standard identification number)

2. brand ( brand to which the product belongs to )

3. color ( Color information of apparel, it can contain many colors as   a value ex: red and black stripes ) 

4. product_type_name (type of the apperal, ex: SHIRT/TSHIRT )

5. medium_image_url  ( url of the image )

6. title (title of the product.)

7. formatted_price (price of the product)

We are going to use a total of seven approaches for recommending the apparel as following.

1.Bag of words model

2.tf-idf model

3.idf model

4.word2vec model

5.idf weighted word2vec model

6.weighted similarity using brand and color

7.visual features based using convolution neural networks

DATASETS AND INPUTS

Software Requirments

Execution and Running Code

Observation

  • We trained the seven models and recommended similar top 20 apparels with the least euclidean distance for each model.

  • We calculated the average euclidean distance for each model by taking the mean and compared them using the line plot and bar graph

Result

We found that the order of the best recommendation performance is as follows

1.TF-IDF

2.AVERAGE WORD2VEC

3.BAG OF WORDS

4.BRAND AND COLOR

5.WEIGHTED WORD2VEC

6.IDF

7.CNN