tpu

Here are 175 public repositories matching this topic...

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda inference pytorch transformer llama gpt rocm model-serving tpu hpu mlops xpu llm inferentia llmops llm-serving trainium

Updated Nov 22, 2024
Python

AI-Hypercomputer / xpk

Star

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

gke gcloud tpu

Updated Nov 22, 2024
Python

PygmalionAI / aphrodite-engine

Star

Large-scale LLM inference engine

machine-learning cuda intel api-rest lora rocm inference-engine tpu inferentia speculative-decoding

Updated Nov 22, 2024
Python

skypilot-org / skypilot

Star

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Updated Nov 22, 2024
Python

devngho / tpuswarm

Sponsor

Star

Create spot TPU instances, then run a batched job on them.

gcp batch tpu

Updated Nov 19, 2024
Python

AI-Hypercomputer / JetStream

Star

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gpu inference pytorch transformer llama gpt gemma model-serving tpu jax mlops large-language-models llm llmops llm-inference llama2

Updated Nov 19, 2024
Python

ElectricS01 / TPU-Mac

Star

A native Mac App for Troplo's TPU made with SwiftUI

tpu swiftui privateuploader flowinity

Updated Nov 13, 2024
Swift

AI-Hypercomputer / jetstream-pytorch

Star

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

inference pytorch batching attention llama gemma model-serving tpu llm llm-inference llama2

Updated Nov 9, 2024
Python

Pancronos / FPV-3D-print

Star

Collection of STL for FPV and drones

drone fpv parallax stl print inav tbs betaflight 3dprint tpu betafpv flyfish iflight diatone topframes flywoo speedybee itsfpv bayckrc

Updated Nov 4, 2024
G-code

Kohulan / DECIMER-Image_Transformer

Star

DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images into SMILES strings, enabling the digitization of chemical data from scanned documents, literature, and patents.

python deep-learning tensorflow transformers tpu chemical-image-recognition image-data-mining decimer