contextual-bandits · GitHub Topics

#计算机科学#Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learni...

翻译 - Vowpal Wabbit是一个机器学习系统，它通过在线，哈希，减少，归约，learning2search，主动和交互式学习等技术来推动机器学习的前沿。

C++机器学习 online-learning contextual-bandits reinforcement-learning active-learning learning-to-search

C++ 8.55 k

6 个月前

tensorflow / agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

翻译 - TF-Agents是TensorFlow中的强化学习库

reinforcement-learning Tensorflow contextual-bandits dqn

Python 2.9 k

1 个月前

david-cortes / contextualbandits

Python implementations of contextual bandits algorithms

contextual-bandits reinforcement-learning exploration-exploitation

Python 770

3 个月前

st-tech / zr-obp

#数据仓库#Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

数据集 contextual-bandits research

Python 663

10 个月前

fidelity / mabwiser

#计算机科学#[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

contextual-bandits 机器学习 recsys

Python 230

7 个月前

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

神经网络 neural-architecture-search pytorch-implementation 机器学习 contextual-bandits reinforcement-learning-algorithms reinforcement-learning PyTorch

Python 186

5 年前

alison-carrera / mabalgs

#算法刷题#👤 Multi-Armed Bandit Algorithms Library (MAB) 👮‍♂️

arm Simulation ucb 算法 ranking-algorithm contextual-bandits reinforcement-learning reinforcement-learning-algorithms

Python 133

3 年前

Nth-iteration-labs / contextual

#计算机科学#Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

bandit Simulation 统计 contextual-bandits bandit-learning reinforcement-learning exploitation exploration evaluation 机器学习

R 80

5 年前

instadeepai / catx

#计算机科学#🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX

contextual-bandits jax 深度学习 Python

Python 66

3 年前

banditml / banditml

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

contextual-bandits PyTorch personalization neural-networks reinforcement-learning

Python 66

4 年前

lil-lab / blocks

#自然语言处理#Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

自然语言处理机器学习 natural-language-understanding reinforcement-learning contextual-bandits

Python 40

6 年前

pemami4911 / sinkhorn-policy-gradient.pytorch

#计算机科学#Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

深度学习 combinatorial-optimization reinforcement-learning contextual-bandits

Python 39

7 年前

Heewon-Hailey / multi-armed-bandits-for-recommendation-systems

implement basic and contextual MAB algorithms for recommendation system

Python scikit-learn NumPy matplotlib recommendation-system contextual-bandits

Jupyter Notebook 36

3 年前

thunfischtoast / LinUCB

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Java bandit-learning contextual-bandits

Java 29

2 年前

doerlbh / MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

speaker-diarization Bukkit speaker-recognition online-learning contextual-bandits self-supervised-learning

Cuda 27

4 年前

mmalekzadeh / privacy-preserving-bandits

#计算机科学#Privacy-Preserving Bandits (MLSys'20)

differential-privacy 机器学习 online-machine-learning reinforcement-learning contextual-bandits privacy-preserving-machine-learning federated-learning recommender-system recommendation bandit-learning

Jupyter Notebook 22

2 年前

RonyAbecidan / Neural-Thompson-Sampling

Study of the paper 'Neural Thompson Sampling' published in October 2020

神经网络 contextual-bandits

Jupyter Notebook 21

3 年前

improve-ai / python-ranker

#计算机科学#Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

ab-testing 人工智能 contextual-bandits 机器学习 personalization Python recommender-system xgboost reinforcement-learning multivariate-testing

Python 21

2 年前

thoughtworks / simplebandit

lightweight contextual bandit library for ts/js

contextual-bandits recommender personalization recommendation-system recommender-systems

TypeScript 18

1 年前

jtcho / FairMachineLearning

#计算机科学#Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

机器学习 contextual-bandits Python Jupyter Notebook NumPy

Jupyter Notebook 14

8 年前