IMDB Coding Challenge

Note

Make sure that the correct host is selected in the configuration file. Also, provide the data path for Movies Title, Actors Details, and Principals in the configuration or place the files in already created folders.

Getting Started

The solution is containerized and automated using Dockers and Make. Use following commands to run the solution.

First, build and start the containers:

make build

Execute the solution:

make run

Execute test cases:

make run-testcases

Prune/Delete containers:

make stop

Execution time of the solution is aproximately 02 mins with 04 cores and 12G of memory.
The distribution graph is stored in resources/graph

Without Docker

To run the solution locally, open config file, comment out line 07 and remove the comment character from line 08.

python runner.py --Remote False

Approach Used

A top-down approach is used in combination with divide-and-conquer. First, the larger data frame is selected and reduced using the filters, then the calculations are performed on the reduced data to achieve better performance.

For more samples of my work, please visit GitHub

Email :abdul.mateen59@yahoo.com

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
config		config
jobs		jobs
resources/graphs		resources/graphs
test		test
.gitignore		.gitignore
Data Engineer Coding Challenge.pdf		Data Engineer Coding Challenge.pdf
Dockerfile		Dockerfile
Makefile		Makefile
README.MD		README.MD
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
runner.py		runner.py
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IMDB Coding Challenge

Note

Getting Started

Without Docker

Approach Used

About

Releases

Packages

Languages

abdulmateen59/imdb-coding-challenge

Folders and files

Latest commit

History

Repository files navigation

IMDB Coding Challenge

Note

Getting Started

Without Docker

Approach Used

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages