openai-o1 · GitHub Topics

#大语言模型#A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

chain-of-thought Code 大语言模型数学 mcts openai-o1 strawberry reinforcement-learning

6.64 k

2 天前

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

transformers vllm large-language-models raylib reinforcement-learning-from-human-feedback reinforcement-learning openai-o1 proximal-policy-optimization

Python 6.21 k

13 小时前

refly-ai / refly

🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifacts, AI knowledge base integration, chrome extension clip & sav...

agent 人工智能 ai-memory anthropic artifacts content-creation 文档 gemini gpt-4 knowledge-base qwen rag workflow blogs Canvas deep-research deepseek-r1 openai-o1 artifact manus

TypeScript 3.51 k

2 天前

atfortes / Awesome-LLM-Reasoning

#大语言模型#Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

language-models reasoning prompt in-context-learning ChatGPT chain-of-thought prompt-engineering cot Awesome Lists gpt mllm multimodal papers gpt-4o openai-o1 strawberry deepseek deepseek-r1

2.96 k

25 天前

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

chain-of-thought cot deepseek-r1 instruction-tuning large-vision-language-model multimodal multimodal-chain-of-thought multimodal-large-language-models openai-o1 reasoning survey mcts

425

7 天前

jxhuang0508 / Awesome-LLM-Reasoning-OpenAI-o1

#大语言模型#Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large Language Model, Straberry

人工智能 ChatGPT large-language-model openai-o1 chain-of-thought Code 大语言模型数学 mcts reinforcement-learning strawberry

6 个月前

tsinghua-fib-lab / SmartAgent

The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".

embodied-ai large-language-model llm-agent multi-modal personalization human-computer-interaction chain-of-thought openai-o1

23 天前

MaxiDonkey / DelphiGenAI

#大语言模型#The GenAI API wrapper for Delphi is designed to integrate OpenAI’s latest models (GPT-4o, O1, O3 and GPT-4.5) seamlessly, offering robust features for chat interactions, text generation, vision proces...

ChatGPT delphi gpt openai openai-api openai-o1 assistant batch fine-tuning wrapper

Pascal 21

14 天前

inngest / vercel-ai-o1-preview-crm-agent

A demo of OpenAI o1 and Next.js, used to automate the import of user-provided contacts file.

agentic-workflow openai openai-o1 Vercel vercel-ai-sdk Next

TypeScript 21

3 个月前

inngest / vercel-ai-o1-preview-crm-agent

A demo of OpenAI o1 and Next.js, used to automate the import of user-provided contacts file.

agentic-workflow openai openai-o1 Vercel vercel-ai-sdk Next

TypeScript 16

3 个月前

zcccccz / Awesome-LLM-Implicit-Reasoning

Papers of Implicit Reasoning in LLMs.

Awesome Lists chain-of-thought hidden implicit latent-space llm-inference reasoning deepseek-r1 openai-o1

1 个月前

remember00000 / Awesome-DeepSeek-R1-Resources

#Awesome#Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟

Awesome Lists deepseek-r1 reasoning deepseek papers chain-of-thought cot gpt language-models openai-o1 prompt-engineering

3 个月前

Joshue2006 / LLM-Reasoner

Make any LLM to think like OpenAI o1 and deepseek R1

agent agi 命令行界面 GitHub large-language-model large-language-models llamacpp llamaindex 数学 openai-o1 prompt-engineering reasoning vllm

20 天前

chensnathan / LLMo1Wrapper

A Python wrapper that enables large language models (LLMs) to simulate the step-by-step thinking process of OpenAI’s o1 model, providing users with detailed reasoning and comprehensive answers.

agents cot llms openai-o1 reflection

Python 3

7 个月前