🔥 Playground

Prompt Engineering aims to carefully curate input prompts that can extract the best possible results from Large language models(LLMs).

🌀 As a prominent example of LLMs, ChatGPT has received widespread attention and skyrocketed in popularity. Nonetheless, in recent years, a significant number of LLMs have emerged, typically several tens of gigabytes in size and trained on massive amounts of textual data. Therefore, there are several alternatives available that we can use to practice prompt techniques using these models.

📜 Table of Contents

TrustGPT From EgoAlpha
Directly Usage of LLMs
Providing the Pre-train Weights
Without Opensource Till Now
LLMs in Coding
Dataset in LLMs area
LLMs from China

TrustGPT

🌟 TrustGPT can also serve as a playground for everyone's convenience to learn and practice advanced prompt techniques. You can also commit your issues from TrustGPT to this repo page. Thanks a lot.

We will gradually release the following features:

Prompt example
Question answering over your own document
Autonomous agent
Access to various LLMs

As resources are limited, we suggest using this playground for learning and practicing prompt techniques rather than for work. This will help more people access prompt engineering.

Directly Usage of LLMs

🤩 These models in the table below are directly accessible via links, The page contains the usage guide and API interface of the model for the convenience of all developers and researchers to explore and experience. The Checkpoints can also obtained by corresponding links.

Model	Type	Lab	Playgrounds	Params(B)	Blog/Paper/Github	Checkpoints	Announced Time
Gemma	Decoder	Google	🔗	2,7	Github	Gemma-2B/Gemma-7B	Feb-24
Yi series	Decoder	01.Ai	🔗	6,34	Github	Yi-34B/Yi-6B	Nov-23
InternLM	Decoder	Shanghai Artificial Intelligence Laboratory	🔗	20	Github	InternLM-20B	Aug-23
Mistral 7B	Decoder		🔗	7	Paper/Blog	Mistral-7B-v0.1	Oct-23
Llama-2	Decoder	Meta	🔗	7,13,70	Github/Paper/Blog	Llama-7B, Llama-13B, Llama-70B	Jul-23
TigerBot	Decoder	-	🔗	70	Github	TigerBot-70B	Jun-23
Falcon	Decoder	TII	🔗	1,7,40	Blog	Falcon-40B-instruct, Falcon-7B-instruct,Falcon-RW-1B,Falcon-RW-7B	May-23
GPT-J-6B	Decoder	EleutherAI	🔗	6	Blog	GPT-J-6B, GPT4All-J	May-23
DLite	Decoder	EleutherAI	🔗	0.124-1.5	Blog	dlite-v2-1_5b	May-23
OpenLLaMA	Decoder	H2O.AI	🔗	3,7	Github	OpenLLaMA-7b-preview-300bt	May-23
RedPajama-INCITE	Decoder	Together	🔗	3-7	Blog	RedPajama-INCITE	May-23
MPT-7B	Decoder	mosaic	🔗	7	Blog	MPT-7B, MPT-7B-Instruct	May-23
h2oGPT	Decoder	EleutherAI	🔗	12-20	Blog	h2oGPT	May-23
Dolly	Decoder	EleutherAI	🔗	3,7,12	Blog/Github	dolly-v2-12b	Apr-23
Pythia	Decoder	EleutherAI	🔗	0.07-12	Paper/Github	pythia 70M - 12B	Apr-23
FastChat-T5	Decoder	EleutherAI	🔗	3	Blog	fastchat-t5-3b-v1.0	Apr-23
StableLM-Alpha	Decoder	EleutherAI	🔗	3-65	Github	StableLM-Alpha	Apr-23
oasst-sft-6-llama-30b	Decoder	HuggingFace	🔗	30	Github	-	Apr-23
Cerebras-GPT	Decoder	HuggingFace	🔗	0.111-13	Paper	Cerebras-GPT: A Family of Open, Compute-efficient, Large Language Models	Mar-23
OpenAssistant(Pythia family)	Decoder	LAION AI	🔗	12	Paper/Github	OA-Pythia-12B-SFT-8, OA-Pythia-12B-SFT-4, OA-Pythia-12B-SFT-1	Apr-23
GPT-4	Decoder	OpenAI	🔗	20	Paper	-	Mar-23
OpenChatKit	Decoder	Together	🔗	20	Github	-	Mar-23
Alpaca	Decoder	Stanford	🔗	7	Github	-	Mar-23
ChatGPT	Decoder	OpenAI	🔗	175	Paper	-	Nov-22
GPT-JT	Decoder	Together	🔗	6	Github	-	Nov-22
Flan-T5	Encoder-Decoder	Google Research	🔗	11	Paper/Github	Flan-T5	Oct-22
Flan-UL2	Encoder-Decoder	Google Research	🔗	20	Paper/Github	Flan-UL2	Oct-22
CodeGeeX	Decoder	Tsinghua	🔗	13	Github	CodeGeeX register path	Sep-22
GLM-130B	Encoder-Decoder	Tsinghua & Zhipu	🔗	130	Paper/Github	-	Aug-22
BLOOM(tr11-176B-ml)	Decoder	BigScience	🔗	176	Github	BLOOM	Jul-22
PaLM	Decoder	Google Research	🔗	540	Paper	-	Apr-22
GPT-NeoX-20B	Decoder	EleutherAI	🔗	20	Paper	GPT-NEOX-20B	Apr-22
CodeT5	Encoder-Decoder	Salesforce Research Asia	🔗	small:0.06,base:0.22	Paper	-	Mar-22
ERNIE3.0	Encoder-Decoder	Baidu	🔗	10	Paper	-	Dec-21
CodeX	Decoder	OpenAI	🔗	12	Paper	-	Aug-21
RWKV	Decoder	OpenAI	🔗	0.1-14	Github	RWKV, ChatRWKV	Aug-21
GPT-3	Decoder	OpenAI	🔗	175	Paper	-	May-20
T5	Encoder-Decoder	Google	🔗	11	Paper	T5	Oct-19
RoBERTa	Encoder	MetaAI	🔗	0.355	Paper	roberta-series	Jul-19
GPT-2	Decoder	OpenAI	🔗	1.5	Paper	GPT_2 Series	Feb-19
BERT	Encoder	Google	🔗	0.3	Paper	BERT Series	Oct-18
GPT-1	Decoder	OpenAI	🔗	0.117	Paper	GPT_1_seriers	Jun-18

Providing the Pre-train weights

🤨 The models in the table below all provide pre-trained weights on which developers can fine-tune (without changing the original backbone architecture), and people can visually see the work of a good team of researchers by using the pre-trained weights of the models directly for a good Demo.

Model	Type	Lab	Github	Params(B)	Paper/Code	Announced Time
Gorilla-OpenFunctions series	Decoder	Gorilla LLM	🔗	-	Paper/Github	-
LLaMA-65B	Decoder	MetaAI	🔗	65	Paper/Code	Feb-23
OPT-IML	Decoder	MetaAI	🔗	175	Paper/-	Dec-22
ERNIE-Code	Encoder-Decoder	Baidu	🔗	0.56	Paper/-	Dec-22
Galactica	Decoder	MetaAI	🔗	120	Paper/-	Nov-22
mT0	Encoder-Decoder	BigScience	🔗	13	Paper/-	Nov-22
BLOOMZ	Decoder	BigScience	🔗	176	Paper/-	Nov-22
Atlas	Encoder-Decoder	MetaAI	🔗	11	Paper/-	Aug-22
OPT-175B	Decoder	MetaAI	🔗	175	Paper/-	May-22
RETRO	Encoder-Decoder	DeepMind	🔗	7.5	Paper/-	Dec-21
FLAN	Encoder-Decoder	Google	🔗	137	Paper/-	Sep-21

Without Opensource Till Now

😣 The following table show that the related models and codes are not open-source till now.

Model	Type	Lab	Report	Params(B)	Paper/Code	Announced Time
Med-PaLM	Encoder	Google & DeepMind	🔗	540	Paper/-	Dec-22
GLaM	Encoder	Google Inc	🔗	1200	Paper/-	Dec-22
RL-CAI	Encoder	Anthropic	🔗	52	Paper/-	Dec-22
Sparrow	Decoder	DeepMind	🔗	70	Paper/-	Sep-22
PaLI	Encoder-Decoder	Google	🔗	17	Paper/-	Sep-22
Gato(Cat)	Encoder-Decoder	DeepMind	🔗	1	Paper/-	May-22
Chinchilla	Encoder	DeepMind	🔗	70	Paper/-	Mar-22
Gopher	Encoder	DeepMind	🔗	280	Paper/-	Dec-21
LaMDA	Decoder	GoogleAI	🔗	137	Paper/-	Jun-21

LLMs in Coding

🎭 The following table shows the LLMs for Coding.

Model	Checkpoints	Paper/Blog	Params (B)	Announced Time
StarCoder	starcoder	Blog	15	May-23
StarChat Alpha	starchat-alpha	Blog	16	May-23
Replit Code	replit-code-v1-3b	Blog	2.7	May-23
CodeT5+	CodeT5+	Paper	0.22 - 16	May-23
CodeGen2	codegen2 1B-16B	Paper	1 - 16	Apr-23
SantaCoder	santacoder	Paper	1.1	Jan-23

Dataset in LLMs Area

📈 The following table shows the Dataset of the LLM area, with instruction-tunning and alignment-tuning.

Dataset	Paper/Blog	Dataset	Samples (K)	Announced Time	Type
MPT-7B-Instruct	Blog	dolly_hhrlhf	59	May-23	instruction-tuning
databricks-dolly-15k	Blog	databricks-dolly-15k	15	Apr-23	instruction-tuning
OpenAssistant Conversations Dataset	Blog	oasst1	161	Apr-23	alignment-tuning
OIG (Open Instruction Generalist)	Blog	OIG	44,000	Mar-23	instruction-tuning

LLMs from China

🇨🇳 The following table shows the LLMs from China, including the research lab, firms, and some universities.

Note： The part of contents of the list are from here, and we have made appropriate modifications and supplements, hereby noted.

Source	Model & Link	Description
复旦大学	MOSS	Playground
贝壳	BELLE	基于BLOOMZ或LLaMA系列的多个模型
哈尔滨工业大学	本草	医学；基于LLaMA；另有基于 ChatGLM 的Med-ChatGLM
云知声	山海	通用大模型
百度	文心一言	申请账号
科大讯飞	星火	申请账号
清华大学	ChatGLM,NowcastNet	开源6B，ChatGLM2-6B, 智谱AI,气象,临近预报大模型
华为	盘古,盘古气象,盘古-Σ	华为+鹏城,华为云盘古
达观数据	曹植	试用需账号
阿里云	通义千问	试用需账号
浙江大学	启真,PromptProtein	医学大模型提供基于LLaMA-7B、CaMA-13B和ChatGLM-6B 三个版本,用于PromptProtein的模型
百川智能	baichuan-7B,Baichuan-13B	模型下载：Baichuan-13B-Base,Baichuan-13B-Chat,Baichuan-7B,开源可商用
上海人工智能实验室	书生·浦语, OpenMEDLab浦医	技术报告,开源的InternLM-7B,HuggingFace下载模型权重
OpenBMB	CPM,CPM-Bee	面壁智能,CPM-Bee-10B
港中文深圳	华佗，凤凰	香港中文大学（深圳）和深圳市大数据研究院，医学,Demo,华佗和凤凰都基于BLOOMZ
中国科学院自动化研究所	紫东·太初	紫东太初2.0号称100B参数，全模态
虎博科技	TigerBot	基于BLOOM
东北大学	TechGPT,PICA	TechGPT->BELLE->LLaMA，图谱构建和阅读理解问答;PICA->ChatGLM2-6B情感大模型
上海交通大学	K2,白玉兰	Demo，GeoLLaMA，基于LLaMA，HuggingFace
IDEA研究院	封神榜MindBot	姜子牙系列模型
智源人工智能研究院	悟道·天鹰,悟道·EMU	悟道3.0,视界视觉，AQUILA天鹰座，Aquila-7B,AquilaChat-7B,AquilaCode-7B-NV,AquilaCode-7B-TS,HuggingFace,EMU基于LLaMA
度小满	轩辕	基于BLOOM
23	360	智脑,一见
艾写科技	Anima	基于Guanaco->基于LLaMA，使用QLoRA
西湖心辰	西湖	通用大模型
晓多科技+国家超算成都中心	晓模型XPT	试用需要账号，位置
稀宇科技	MiniMax	GLOW虚拟社交
北京语言大学	桃李	基于LLaMA,北语+清华+东北、北京交大
商汤科技	SenseNova日日新	商汤科技版ChatGPT
国家超级计算天津中心	天河天元	目前官网查询不到
星环科技	无涯、求索	无涯——金融；求索——大数据分析
慧言科技+天津大学	海河·谛听	-
恒生电子	LightGPT	-
电信智科	星河	通用视觉，中国电信
左手医生	左医GPT	医疗，试用需Key
智慧眼	砭石	医疗领域
好未来	MathGPT	学而思
数慧时空	长城	自然资源，遥感
理想科技	大道Dao	运维大模型
硅基智能	炎帝	旅游行业大模型
中工互联	智工	与复旦NLP实验室联合，工业领域
创业黑马	天启	创业黑马与360合作,科创服务行业
追一科技	博文Bowen	-
上海科技大学	DoctorGLM	医学大模型，论文
华东师范大学	EmoGPT,EduChat	EmoGPT是上海市心理健康与危机干预重点实验室与镜象科技公司合作完成, 教学教育大模型EduChat基于BELLE（BELLE基于LLaMA）
昆仑万维	天工	与奇点智源联合研发
智媒开源研究院	智媒	基于LLaMA，面向自媒体
医疗算网	Uni-talk	上海联通+华山医院+上海超算中心+华为
蚂蚁集团	贞仪	据传语言和多模态两个
香港科技大学	罗宾Robin	基于LLaMA,港科大开源LMFlow
腾讯	混元	-
拓尔思	拓天	中文通用大模型
乐言科技	乐言	TRSGPT
清博智能	先问	基于结构化数据
智子引擎	元乘象	手机号快速登录，使用方便
拓世科技	拓世	数万亿参数量，通用领域
循环智能	盘古	循环智能,清华大学,华为
印象笔记	大象GPT	AGI智能化产品
第四范式	式说	以生成式AI重构企业软件（AI-Generated Software），提升企业软件的体验和开发效率。
字节跳动	Grace	内部代号
出门问问	序列猴子	AI写作助理大模型
数说故事	SocialGPT	聚焦社交对话大模型
云从科技	从容	通用大模型
浪潮信息	源	论文支撑——源
中国农业银行	小数ChatABC	金融行业大模型
麒麟合盛	天燕AiLMe	需要账号登录，登录位置
台智云	福尔摩斯FFM	华硕子公司
医联科技	medGPT	国内首款AI医生
理想汽车	MindGPT	-
深思考人工智能	Dongni	登录需要账号
长虹	长虹超脑	-
孩子王	KidsGPT	-
中科闻歌	雅意	媒体、金融、宣传等领域的大模型应用
中国联通	鸿湖	-
思必驰	DFM-2	通用大模型
中科创达	魔方Rubik	-
电科太极	小可	党政企行业应用
中国移动	九天	-
中国电信	TeleChat	-
容联云	赤兔	客服，营销
云天励飞	天书	-
维智科技	CityGPT	城市大模型
澜舟科技	孟子	自研大规模预训练语言模型
京东	言犀	面向不同过产业大模型
智臻智能	华藏	小i机器人
新华三H3C	百业灵犀	-
鹏城实验室	鹏城·脑海	Peng Cheng Mind
宇视科技	梧桐	AIoT行业
网易有道	子曰	-
美亚柏科	天擎	公共安全
赛灵力科技	达尔文	赛灵力,清华珠三角研究院,赛业生物,大湾区科技创新服务中心
佳都科技	佳都知行	交通领域
知乎	知海图	知乎和面壁科技合作
实在智能	塔斯	TARS
网易伏羲	玉言	-
北京大学信息工程学院	ChatLaw	ChatLaw-13B基于Ziya-LLaMA-13B-v1->LLaMA,ChatLaw-33B基于Anima33B->Guanaco->LLaMA
华南理工大学	扁鹊,灵心SoulChat	医疗大模型
中国科学院计算技术研究所	百聆	基于 LLaMA，权重Diff下载7B和13B,demo
沪渝人工智能研究院	兆言	也称：上海交通大学重庆人工智能研究院
企查查	知彼阿尔法	-
超对称技术公司	乾元	BBT-1-1B金融模型，BBT-2-12B-TF金融模型，BBT-2-12B-TC代码模型，BBT-2-12B-Image文生图模型，BBT-2-12B-Science科学论文模型，BBT-2.5-13B-Text中英双语基础模型
清睿智能	ArynGPT	英语智能对话口语老师
微盟	WAI	-
蜜度	文修	智能校对
中国电子云	星智	政务大模型
西北工业大学+华为	秦岭·翱翔	流体力学大模型,湍流+流场
奇点智源	Singularity OpenAPI	瑶光和天枢
联汇科技	欧姆	OmModel欧姆多模态（视觉语言）大模型
阅文集团	网文大模型	国内首个网文行业大模型
北京交通大学	TransGPT	国内首个综合交通领域的大模型

Please keep adding relevant information, we greatly appreciate your contributions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Playground.md

Playground.md

🔥 Playground

📜 Table of Contents

TrustGPT

Directly Usage of LLMs

🤩 These models in the table below are directly accessible via links, The page contains the usage guide and API interface of the model for the convenience of all developers and researchers to explore and experience. The Checkpoints can also obtained by corresponding links.

Providing the Pre-train weights

🤨 The models in the table below all provide pre-trained weights on which developers can fine-tune (without changing the original backbone architecture), and people can visually see the work of a good team of researchers by using the pre-trained weights of the models directly for a good Demo.

Without Opensource Till Now

😣 The following table show that the related models and codes are not open-source till now.

LLMs in Coding

🎭 The following table shows the LLMs for Coding.

Dataset in LLMs Area

📈 The following table shows the Dataset of the LLM area, with instruction-tunning and alignment-tuning.

LLMs from China

🇨🇳 The following table shows the LLMs from China, including the research lab, firms, and some universities.

Please keep adding relevant information, we greatly appreciate your contributions.

Files

Playground.md

Latest commit

History

Playground.md

File metadata and controls

🔥 Playground

📜 Table of Contents

TrustGPT

Directly Usage of LLMs

🤩 These models in the table below are directly accessible via links, The page contains the usage guide and API interface of the model for the convenience of all developers and researchers to explore and experience. The Checkpoints can also obtained by corresponding links.

Providing the Pre-train weights

🤨 The models in the table below all provide pre-trained weights on which developers can fine-tune (without changing the original backbone architecture), and people can visually see the work of a good team of researchers by using the pre-trained weights of the models directly for a good Demo.

Without Opensource Till Now

😣 The following table show that the related models and codes are not open-source till now.

LLMs in Coding

🎭 The following table shows the LLMs for Coding.

Dataset in LLMs Area

📈 The following table shows the Dataset of the LLM area, with instruction-tunning and alignment-tuning.

LLMs from China

🇨🇳 The following table shows the LLMs from China, including the research lab, firms, and some universities.

Please keep adding relevant information, we greatly appreciate your contributions.