GitHub - jiyolla/CSI4101-software-capstone-design: Network aware load balancing for edge/cloud computing using deep reinforcement learning

jiyolla / CSI4101-software-capstone-design Public

Notifications You must be signed in to change notification settings
Fork 9
Star 16

Network aware load balancing for edge/cloud computing using deep reinforcement learning

16 stars 9 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
client_side		client_side
common		common
data		data
drl		drl
service_side		service_side
.gitignore		.gitignore
Design.png		Design.png
README.md		README.md

Repository files navigation

A network-aware load balancer using DRL

Scenario:

Client query load balancing server for inferencing server address.
Load balancer return serving address using DRL.
Client send actual request to inferencing server.
Add artifical network overhead upon receiving inferencing response.
Return overall results to evaluater to generate reward for DRL.

Things To Do

Make test request series and evaluater
Implement random load balancer
Implement region aware load balancer
Run drl for 1 an hour
Compare the result of 3 load balancer
Automate deployment.
1. Include dockerfile of the service server. Also probably include some bash command for multiple docker each serving a single model.
2. Scripts to install prerequisite.

client.py:

Load ImageNet validation images.
Query loadbalancer.py for inferencing server address.
Send request to inferencing server.
Calculate artficial network overhead.
Send result to evaluater.py.

loadbalancer.py:

Generate 'oberservation' for DRL upon client query for serving address.
Return DRL's action(serving address) to client.

evaluater.py:

Load ImageNet validation solutions.
Listen from client response reports, parse and feed to drl.py

servermonitor.py

Listen for state reports from servermonitor.py
Send gathered state reports to drl.py

reportstate.py:

Collect server's state.
Send it to servermonitor.py.

drl.py:

Uses request and server_state as state
DQN to aprox. q function
Return server/model as action

About

Network aware load balancing for edge/cloud computing using deep reinforcement learning

Report repository

Releases

No releases published

Packages

No packages published

Languages