Web Scraping with GPT-4 Vision API and Puppeteer
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Lightweight GPT-4 Vision processing over the Webcam
Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description
A collection of awesome GPT4 vision use cases
GPT-4 Vision Chrome Extension
GPT-4 Vision Chatbot examples
Create browser automation as if you were teaching a human using GPT-4 Vision.
Conversational AI with GPT-4 Vision, OpenAI Whisper, and TTS
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
#大语言模型#OpenAI ChatGPT/GPT-4/GPT-3 SDK Go Client to Interact with the GPT-4/GPT-3 APIs.
#大语言模型#Instruction Tuning with GPT-4
#大语言模型#OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
DB-GPT WebUI,LLM to vision.
How to stream GPT-4, ChatGPT & GPT-3.5 model responses (gpt-4, gpt-3.5-turbo & text-davinci-003)?
Desktop AI Assistant powered by o1, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, Bielik, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, command ex...
使用 GPT-4 自动化您的浏览器
LiveQuery GPT-4: chatbot with GPT-4-powered convos & Google-powered real-time search
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)