#计算机科学#Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learni...
翻译 - Vowpal Wabbit是一个机器学习系统,它通过在线,哈希,减少,归约,learning2search,主动和交互式学习等技术来推动机器学习的前沿。
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
翻译 - TF-Agents是TensorFlow中的强化学习库
Python implementations of contextual bandits algorithms
#数据仓库#Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
#计算机科学#[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
#算法刷题#👤 Multi-Armed Bandit Algorithms Library (MAB) 👮♂️
#计算机科学#Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
#计算机科学#🐈⬛ Contextual bandits library for continuous action trees with smoothing in JAX
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
#自然语言处理#Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)
#计算机科学#Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
implement basic and contextual MAB algorithms for recommendation system
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
#计算机科学#Privacy-Preserving Bandits (MLSys'20)
Study of the paper 'Neural Thompson Sampling' published in October 2020
#计算机科学#Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions
lightweight contextual bandit library for ts/js
#计算机科学#Implementation of provably Rawlsian fair ML algorithms for contextual bandits.