#

mab

Here are 24 public repositories matching this topic...

avorozhtsov / shipit

Exploitation vs Exploration problem stated as A/B-testing with maximum profit per unit time.

continuous-testing ab-testing mab exploration-exploitation peaking

Updated Oct 4, 2023
Mathematica

jiseongHAN / reinforcement

My Little Reinforcement Learning

reinforcement-learning pytorch dqn reinforce ddqn mab ppo-pytorch

Updated Jul 13, 2021
Python

DURUII / Replica-EUWR

🐯REPLICA of "Combinatorial Multi-Armed Bandit Based Unknown Worker Recruitment in Heterogeneous Crowdsensing"

crowdsourcing multi-armed-bandits online-learning crowdsensing mab mobile-crowdsensing worker-recruitment

Updated Dec 24, 2023
Jupyter Notebook

DURUII / Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

vmarchaud / ts-mab

Typescript implementation of a multi-armed bandit

typescript thompson-sampling mab

Updated May 17, 2020
TypeScript

sshaplygin / abcs

Adaptive bandit cache selection

golang statistics lru-cache arc-cache lfu-cache mab 2q-cache lfuda-cache

Updated Apr 14, 2024
Go

pko89403 / Recommender

Implementation of recommender ( Pytorch & Keras )

python keras pytorch matrix-factorization bst doc2vec cf mf mab item2vec bert4rec dlrm widendeep

Updated Nov 15, 2021
Jupyter Notebook

aijunbai / bandit

Algorithms for multi-armed bandit (MAB) problems

mab

Updated Oct 1, 2015
C++

JoelJa835 / MAB_Algorithms

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

tuhinsharma121 / pybandit-archive

A Python library for all popular multi-armed bandit algorithms.

optimization-algorithms mab

Updated Apr 28, 2023
Jupyter Notebook

Bachfischer / COMP90051-StatML-Assignment-2

Source code for Assignment 2 of COMP90051 (Semester 2 2020)

ucb multi-armed-bandit mab

Updated Oct 21, 2020
Jupyter Notebook

juliennonin / multiplayer-bandits

Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]

reinforcement-learning multi-armed-bandit mab

Updated Jan 21, 2021
Python

duchuyle108 / SDN-EgressNode-Selection

The work in paper "A Reinforcement Learning-Based Solution for Intra-Domain Egress Selection" - Duc-Huy LE, Hai Anh TRAN

Updated Sep 11, 2022
Python

abhinavcreed13 / Multi-armed-bandits-MAB

This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.

algorithms evaluation python3 multi-armed-bandits mab gridsearch

Updated Mar 20, 2020
Jupyter Notebook

pm3310 / mab-covid19

Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively

python aws multi-armed-bandits mab sagemaker coronavirus covid-19

Updated Mar 25, 2020
Jupyter Notebook

vmam

MatteoGuadrini / vmam

VLAN Mac-address Authentication Manager

ldap radius python3 ldap-authentication ldap-server vlan mac-address network-architecture ldap-manager radius-server nac mab 80211 pywinrm 8021x ldap3 rfc-3579 ldap-group ieee8021x

Updated Apr 5, 2021
Python

VladMarianCimpeanu / OLA_project

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications

Updated Oct 30, 2022
Jupyter Notebook

aldente0630 / multi_armed_bandit

Experiment results using MAB algorithms in Yahoo! Front Page Today Module User Click Log dataset

contextual-bandits mab striatum

Updated Jan 2, 2020
Jupyter Notebook

v-i-s-h / MAB.jl

A Julia Package for providing Multi Armed Bandit Experiments

reinforcement-learning julia julia-language thompson-sampling reinforcement-learning-algorithms multi-arm-bandits ucb julia-package exp julialang mab bandit-experiments

Updated Jul 19, 2018
Julia

jacksonpradolima / coleman4hcs

COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context

tcp continuous-integration ci multi-armed-bandit hcs coleman mab test-case-prioritization tcpci highly-configurable-system

Updated Jul 22, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the mab topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mab topic, visit your repo's landing page and select "manage topics."