foundation-model · GitHub Topics

guardrails-ai / guardrails

#大语言模型#Adding guardrails to large language models.

人工智能 foundation-model gpt-3 大语言模型 openai

Python 4.78 k

6 天前

OpenGVLab / InternGPT

#大语言模型#InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ...

ChatGPT foundation-model gpt gpt-4 gradio husky image-captioning langchain 大语言模型 multimodal vqa llama vicuna video-generation sam segment-anything click draggan

Python 3.22 k

8 个月前

OpenGVLab / InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Backbone.js deformable-convolution object-detection semantic-segmentation foundation-model

Python 2.63 k

19 天前

hyp1231 / awesome-llm-powered-agent

#Awesome#Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

Awesome Lists embodied-agent embodied-ai foundation-model foundation-models generative-agents generative-ai generative-model generative-models large-language-model large-language-models llms 大语言模型 ChatGPT gpt-4

1.96 k

17 天前

bowang-lab / scGPT

foundation-model gpt

Jupyter Notebook 1.2 k

13 天前

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

foundation-model object-detection open-world tracking open-vocabulary-detection open-vocabulary-segmentation open-vocabulary-video-segmentation referring-expression-comprehension referring-expression-segmentation video-instance-segmentation video-object-segmentation zero-shot-object-detection referring-video-object-segmentation interactive-segmentation segment-anything

Python 1.12 k

6 个月前

IDEA-Research / Grounding-DINO-1.5-API

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

grounding-dino object-detection open-set foundation-model open-vocabulary-detection open-world zero-shot-object-detection

Python 930

3 个月前

OpenDriveLab / DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

foundation-model autonomous-driving embodied-ai policy-learning video-generation world-models

Python 708

3 个月前

OpenGVLab / VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

cvpr2023 foundation-model self-supervised-learning video-understanding action-detection action-recognition temporal-action-detection

Python 617

6 个月前

AILab-CVC / SEED

Official implementation of SEED-LLaMA (ICLR 2024).

foundation-model multimodal vision-language

Python 608

7 个月前

mahmoodlab / UNI

Pathology Foundation Model - Nature Medicine

foundation foundation-model histopathology pathology uni computational-pathology digital-pathology

Jupyter Notebook 441

18 天前

ViTAE-Transformer / Remote-Sensing-RVSA

#计算机科学#The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"

深度学习 foundation-model object-detection remote-sensing self-supervised-learning semantic-segmentation transfer-learning foundation-models PyTorch vision-transformer

Python 440

2 个月前

Clay-foundation / model

The Clay Foundation Model - An open source AI model and interface for Earth

earth-observation foundation-model embeddings

Python 431

5 天前

cambridgeltl / visual-med-alpaca

Visual Med-Alpaca is an open-source, multi-modal foundation model designed specifically for the biomedical domain, built on the LLaMa-7B.

biomedical biomedical-image-processing foundation-model large-language-models multimodal

Python 383

1 年前

OpenDriveLab / OpenScene

3D Occupancy Prediction Benchmark in Autonomous Driving

autonomous-driving foundation-model

Python 351

1 年前

westlake-repl / Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review

#大语言模型#Paper List of Pre-trained Foundation Recommender Models

ChatGPT foundation-model 大语言模型 multimodal pre-training recommender-system transfer-learning chatgpt3 language-model multimodal-deep-learning recommendation-system large-language-model

347

8 个月前

mahmoodlab / CONCH

Vision-Language Pathology Foundation Model - Nature Medicine

foundation-model histopathology Medical imaging 自然语言处理 pathology computational-pathology digital-pathology

Python 342

18 天前

spotify-research / llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

foundation-model multimodal music-information-retrieval

Jupyter Notebook 341

10 个月前