A framework for few-shot evaluation of language models.
Evaluation of Deep Learning Frameworks
#大语言模型#A unified evaluation framework for large language models
SLAM performance evaluation framework
A framework for the evaluation of autoregressive code generation language models.
PROJECT DELTA: SDN SECURITY EVALUATION FRAMEWORK
ParlAI 是一个用于训练和对话人工智能研究的 Python 框架
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Benchmark Framework for fair evaluation of rPPG
Python Framework for Saliency Modeling and Evaluation
#计算机科学#🐢 Open-Source Evaluation & Testing for ML & LLM systems
Well tested & Multi-language evaluation framework for text summarization.
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
GSM Assessment Toolkit - A security evaluation framework for GSM networks
Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges
An evaluation framework for machine learning models simulating high-throughput materials discovery.
Arbitrary expression evaluation for golang
翻译 - golang的任意表达式求值
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image mod...