intel / llm-on-ray Public

Notifications You must be signed in to change notification settings
Fork 30
Star 103

Code
Issues 30
Pull requests 14
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: intel/llm-on-ray

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

30 Open 52 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Inference Mixtral on Gaudi

#249 opened Jun 12, 2024 by Deegue

Finetuning on Ray and CPU causes Runtime error

#242 opened May 31, 2024 by premdass

A question about finetune dataset processing

#234 opened May 22, 2024 by KepingYan

Revise README.md in examples directory

#224 opened May 15, 2024 by xwu99

expected scalar type BFloat16 but found Float

#223 opened May 14, 2024 by darmenliu

Calculate correct input length for every prompt in a single batch

#222 opened May 14, 2024 by kira-lin

Output some debug info in CI when Internal Server Error

#218 opened May 13, 2024 by xwu99

Define simple_protocol.py and define pydantic SimpleRequest and SimpleModelResponse classes to encapsulate current json format

#217 opened May 13, 2024 by xwu99

Issue about using ipex on cpu

#197 opened Apr 19, 2024 by KepingYan

Openai API not allow temperature=0.0 for llama-2-7b-chat-hf

#139 opened Mar 12, 2024 by yutianchen666

Add ipex extra in pyproject.toml to use restricted transformers version

#127 opened Feb 29, 2024 by jiafuzha

Enable mllm in CI

#126 opened Feb 28, 2024 by carsonwang

Support functions/tools in OpenAI API enhancement

New feature or request

#121 opened Feb 23, 2024 by carsonwang

Support and validate model Mixtral-8x7B enhancement

New feature or request

#119 opened Feb 23, 2024 by carsonwang

[Quantization] Support loading AWQ, GPTQ, GGUF/GGML quantized models

#85 opened Jan 26, 2024 by xwu99

[Benchmark] Load config from yaml and output results with multiple formats

#82 opened Jan 24, 2024 by xwu99

[Serving] Example to chat from command line

#74 opened Jan 22, 2024 by carsonwang

Getting dependencies issues while installing on CPU

#67 opened Jan 18, 2024 by nkanike07

Getting error while executing query_openai_sdk.py to test the inference

#66 opened Jan 18, 2024 by dkiran1

Not able to run inference server for mistral 7b model, mpt-7b model on Ray

#65 opened Jan 18, 2024 by dkiran1

Gettign error while running start_ui.py

#64 opened Jan 18, 2024 by dkiran1

Add type hints to core interfaces

#55 opened Jan 12, 2024 by xwu99

Model evaluation for finetuning and quantization.

#54 opened Jan 11, 2024 by xwu99

[Doc] Document serve simple protocol of request/response

#53 opened Jan 11, 2024 by xwu99

[Serving] Add a table of models and corresponding supported parameters

#51 opened Jan 11, 2024 by KepingYan

Previous 1 2 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly