Skip to content

Latest commit

 

History

History
165 lines (149 loc) · 7.28 KB

miner.md

File metadata and controls

165 lines (149 loc) · 7.28 KB

Setup for miner

Make sure that you have a registered hotkey to Subnet 23. If you haven't done so, please refer to https://docs.bittensor.com/subnets/register-validate-mine

Incentive Distribution

Category Incentive Distribution Timeout (s)
RealitiesEdgeXL 9% 12
AnimeV3 9% 12
JuggernautXL 7% 12
GoJourney 4% 12
Llama3_70b 0% 128
Llama3_3_70b 7% 128
Gemma7b 3% 64
StickerMaker 3% 64
SUPIR 8% 180
FluxSchnell 12% 24
Kolors 10% 32
OpenGeneral 8% 32
OpenDigitalArt 8% 32
OpenTraditionalArt 8% 32
Pixtral_12b 4% 64

Guide Fixed Category

  1. Git clone and install requirements
apt-get update && apt-get install ffmpeg libsm6 libxext6  -y
git clone https://github.com/NicheTensor/NicheImage
cd NicheImage
python -m venv main_env
source main_env/bin/activate
pip install -e .
pip uninstall uvloop -y
git submodule update --init --recursive
. generation_models/custom_pipelines/scripts/download_antelopev2.sh
. generation_models/custom_pipelines/scripts/setup_supir.sh
  1. Select miner based on incentive distribution and subnet state at https://nicheimage.streamlit.app/
  2. Setup prequisites if needed
  • For StickerMaker & FaceToMany, find the guide here
  • For Gemma7b, Llama3_70b, Llama3_3_70b, Pixtral_12B, find the guide here
  • For GoJourney, register here and get the GOJOURNEY_API_KEY
  1. Start the endpoint

For Image Generation Category

  • Important notes
    • For the DallE, GoJourney model, you need to set --num_gpus 0 and --num_replicas equal to your max_concurrent_requests to allow the miner to handle multiple requests concurrently.
source main_env/bin/activate
pip install xformers==0.0.28.dev863 # run if you selected SUPIR model.
export GOJOURNEY_API_KEY=<your-gojourney-api-key> # set if you use GoJourney model.
export OPENAI_API_KEY=<your-openai-api-key> # set if you use DallE model.
export RAY_SERVE_QUEUE_LENGTH_RESPONSE_DEADLINE_S=1.0
export PROCESS_MODE=<your-task-type> # set if you use GoJourney model
pm2 start python --name "miner_endpoint" -- -m services.miner_endpoint.app \
--model_name <selected-model-name> \
--num_replicas X --num_gpus Y \ # num_gpus * num_replicas = your_total_gpus_count
--port 10006 # default port

For Text Generation Category

source main_env/bin/activate
HF_TOKEN=<your-huggingface-token> \
pm2 start python --name "miner_endpoint" -- -m services.miner_endpoint.text_app --model_name <selected-model-name> --num_replicas X --num_gpus Y \
--port 10006 # default port

For Multimodal Generation Category

source main_env/bin/activate
pm2 start python --name "miner_endpoint" -- -m services.miner_endpoint.multimodal_app --model_name <selected-model-name> \
--port 10006 # default port
  1. Start miner
  • Basic Guide
pm2 start python --name "miner" \
-- \
-m neurons.miner.miner \
--netuid 23 \
--wallet.name <wallet_name> --wallet.hotkey <wallet_hotkey> \
--subtensor.network <network> \ # default is finney
--axon.port <your_public_port> \
--generate_endpoint http://127.0.0.1:10006/generate \ # change if you use different port or host
--info_endpoint http://127.0.0.1:10006/info \ # change if you use different port or host
--miner.total_volume <your-generation-volume> \ # default is 40. Change based on your model timeout value and GPU capacity
--miner.max_concurrent_requests <your-max-concurrent-requests> # default is 4. Change based on your model timeout value and GPU capacity

Guide Open Category

  1. Git clone and install requirements
git clone https://github.com/NicheTensor/NicheImage
cd NicheImage
python -m venv main_env
source main_env/bin/activate
pip install -e .
pip uninstall uvloop -y
git submodule update --init --recursive
. generation_models/custom_pipelines/scripts/download_antelopev2.sh
. generation_models/custom_pipelines/scripts/setup_supir.sh
  1. Setup generation endpoint based on provided template
  • We provide a template for the open category. You can find the template here. Basically, this endpoint will receive a request with a payload and return the generated image as base64 string.
  • With provided template, you can run a miner by select diffusion model on huggingface. Example:
source main_env/bin/activate
pm2 start python --name "miner_endpoint" -- \
-m services.miner_endpoint.open_category_app \
--model_name "black-forest-labs/FLUX.1-dev" \
--category OpenCategory \ # default is OpenCategory, change if you use different category
--num_gpus 1 --port 10006 --num_inference_steps 30 --guidance_scale 3.0 # inference params for diffusion model
  1. Start miner
  • Basic Guide
pm2 start python --name "miner" \
-- \
-m neurons.miner.miner \
--netuid 23 \
--wallet.name <wallet_name> --wallet.hotkey <wallet_hotkey> \
--subtensor.network <network> \ # default is finney
--axon.port <your_public_port> \
--generate_endpoint http://localhost:10006/generate \ # change if you use different port or host
--info_endpoint http://localhost:10006/info \ # change if you use different port or host
--miner.total_volume <your-generation-volume> # default is 40. Change based on your model timeout value and GPU capacity

Benchmark Your Setup

Fixed Category

You can benchmark your setup by running the following command:

python tests/benchmark_sdxl.py \
--max_concurrent_requests <your-max-concurrent-requests> \ # should equal to your miner.max_concurrent_requests
--n_times <number-of-iterations> \ # n_times * max_concurrent_requests should be less than or equal to your miner.total_volume
--model_name <selected-model-name>

This script will run the miner with the specified number of concurrent requests and measure the average latency and throughput.

Output

  • Console Print:
    • report (dict): A dictionary with keys are status_code. Values are list of latencies for each request. Example: {200: [0.1, 0.2, 0.3], 408: [12, 12, 12]}
  • Plot latency histogram:
    • x-axis: latency in seconds
    • y-axis: number of requests

Example Plot

  • [with ControlNet] RealitiesEdgeXL model with 3 concurrent requests and 100 iterations w ControlNet Latency Histogram
  • [without ControlNet] RealitiesEdgeXL model with 3 concurrent requests and 100 iterations w/o ControlNet Latency Histogram

Open Category

You can benchmark your setup by running the following command. Remember to spin up the generation endpoint before running this script.

python tests/benchmark_open_category_distributed.py