#大语言模型#A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
#大语言模型#Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifact, AI knowledge base integration, chrome extension clip & save...
#大语言模型#Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large Language Model, Straberry
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
A demo of OpenAI o1 and Next.js, used to automate the import of user-provided contacts file.
A demo of OpenAI o1 and Next.js, used to automate the import of user-provided contacts file.
#大语言模型#The GenAI API wrapper for Delphi is designed to integrate OpenAI’s latest models (GPT-4o, O1, O3 and GPT-4.5) seamlessly, offering robust features for chat interactions, text generation, vision proces...
#Awesome#Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟
A Python wrapper that enables large language models (LLMs) to simulate the step-by-step thinking process of OpenAI’s o1 model, providing users with detailed reasoning and comprehensive answers.
Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟
Make any LLM to think like OpenAI o1 and deepseek R1
Papers of Implicit Reasoning in LLMs.
#大语言模型#A Challenging Multi-Modal Mathematical Reasoning Benchmark
openai o1 is A new series of reasoning models for solving hard problems. Available starting
#大语言模型#A roadmap to reproduce OpenAI o1.
#大语言模型#OpenAI-powered tool for bulk processing of Google Ads search queries to identify and filter irrelevant keywords