This repository has been archived by the owner on Dec 6, 2024. It is now read-only.

QwenLM / qwen.cpp Public archive

Notifications You must be signed in to change notification settings
Fork 49
Star 565

Code
Issues 64
Pull requests 5
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: QwenLM/qwen.cpp

Labels 9 Milestones 0

64 Open 10 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

如何在x86架构上进行交叉编译到ARM64架构？

#84 opened Sep 8, 2024 by yux-lab

qwen2 support

#82 opened Jun 11, 2024 by bil-ash

[BUG] 多轮对话的 prompt 应该如何构建？

#81 opened May 24, 2024 by 791136190

2 tasks done

qwen1.5 support?

#80 opened Feb 27, 2024 by anan1213095357

Does it support Qwen1.5 Model?

#78 opened Feb 8, 2024 by kicGit

why missing "assistant" here

#76 opened Jan 31, 2024 by feixyz10

crash if compliing in debug mode, everything is ok if in release mode

#75 opened Jan 31, 2024 by feixyz10

如何下载tiktoken_cpp

#74 opened Jan 25, 2024 by eswulei

添加tokens生成速度

#73 opened Jan 22, 2024 by OliverQueen1466

请问用qwen.cpp量化后的模型如何使用optimum-benchmark进行性能基准测试,现在参照readme中所述只得到一个build文件夹，不清楚如何进行下一步的测试

#72 opened Jan 18, 2024 by suyu-zhang

Why does TextStreamer hold on punctuation?

#71 opened Jan 8, 2024 by Wovchena

windows 下使用qwen.cpp 问题

#70 opened Dec 29, 2023 by kingpingyue

[BUG] Qwen-1.8-Chat，用llama.cpp量化为f16，然后推理回答错乱，请问1.8在llama.cpp还不支持吗？

#69 opened Dec 26, 2023 by Lyzin

2 tasks done

多轮会话

#67 opened Dec 25, 2023 by litongjava

如何将gradio架构构建的前端和qwen-cpp推理代码连接？

#66 opened Dec 22, 2023 by tougeqaq

💡 [Question] - 您好，请教个问题，qwen-cpp BaseStreamer 如何通过std::string 构造一个　BaseStreamer？Ｃ＋＋代码少一个构造方式 question

Further information is requested

#62 opened Dec 18, 2023 by micronetboy

您好，请教个问题，qwen-cpp BaseStreamer 如何通过std::string 构造一个　BaseStreamer？Ｃ＋＋代码少一个构造方式

#61 opened Dec 18, 2023 by micronetboy

希望团队能继续支持qwen.cpp

#60 opened Dec 16, 2023 by awtestergit

💡 [Question] - <title>qwen-cpp 只使用 cpu 和启用 cpu BLAS 加速, 在都不使用GPU的情况下，速度有多大差别？我测试没有差别 question

Further information is requested

#63 opened Dec 15, 2023 by micronetboy

💡 [Question] - QwenCPP Python Binding 如何支持 BLAS CPU 加速 question

Further information is requested

#64 opened Dec 15, 2023 by micronetboy

Python Binding 如何支持BLAS CPU 加速

#59 opened Dec 15, 2023 by micronetboy

pip install -U qwen-cpp 报错

#58 opened Dec 14, 2023 by micronetboy

💡 [REQUEST] - CPU 的 qwen-cpp 如何封装为一个 http 服务？ question

Further information is requested

#65 opened Dec 14, 2023 by micronetboy

为啥qwen.cpp在A100和A10性能差距很大

#56 opened Dec 11, 2023 by zhangzai666

CUDA error 2 at /home/qwen.cpp/third_party/ggml/src/ggml-cuda.cu:7196: out of memory

#55 opened Dec 8, 2023 by youngallien

Previous 1 2 3 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly