PFA-DDQN_Experience_Replay

This repository provides code and resources for a Double Deep Q-Network (DDQN) with Experience Replay, a deep reinforcement learning (DRL) solution for load optimization in edge Kubernetes clusters. It aims to enhance Kubernetes efficiency and performance by dynamically managing and redistributing workloads based on real-time memory usage across nodes.

Key Features

Double Deep Q-Network (DDQN): Utilizes a DDQN architecture to enhance the stability and performance of the learning process by addressing the overestimation bias often found in standard DQNs.
Experience Replay : Implements an experience replay mechanism to store and reuse past experiences, which helps in breaking the correlation between consecutive experiences and improves learning efficiency.
Kubernetes Environment Simulation: Contains a training environment code that simulates a Kubernetes cluster, providing a realistic setting for training the DRL agent.
Load Optimization: Focuses on optimizing memory usage across nodes in the cluster, ensuring balanced and efficient workload distribution to prevent memory bottlenecks and improve overall cluster performance.

Notebook Architecture

Pod model desciption : Details the modeling of jobs and pods within the Kubernetes environment.
Define Imports : Contains the necessary imports and dependencies required for the project.
Cluster Simulation : Simulates the Kubernetes cluster environment, providing a realistic setting for training the DRL agent whci inlcudes :

3.1 Pod class

3.2 Node class

3.3 Cluster class

3.4 Reward Functions : Defines the reward functions used to guide the agent's learning process.
Agent Class : Implements the DDQN agent, encapsulating the core logic of the learning algorithm.
Experience Replay : Implements the experience replay mechanism to store and reuse past experiences.
Environment Class : Defines the environment class that interacts with the DDQN agent, simulating the Kubernetes cluster's state and transitions.
main class : The main class orchestrates the training process, integrating all components and executing the training loop.

Prerequisites

Python 3.10.14
Jupyter Notebook
TensorFlow 2.10.1
Numpy 1.26.4
Kubernetes cluster (simulated or real) for testing and deployment

Kubernetes Simulation

Please click on the link to the other repository "PFA" for information about the Kubernetes simulation.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
PFA @ 7f1a0fb		PFA @ 7f1a0fb
.gitmodules		.gitmodules
README.md		README.md
simulation_DQN.ipynb		simulation_DQN.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PFA-DDQN_Experience_Replay

Key Features

Notebook Architecture

Prerequisites

Kubernetes Simulation

About

Releases

Packages

Languages

GHARBIyasmine/PFA-DDQN_Experience_Replay_Kubernetes_MemoryUsageOptimization

Folders and files

Latest commit

History

Repository files navigation

PFA-DDQN_Experience_Replay

Key Features

Notebook Architecture

Prerequisites

Kubernetes Simulation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages