Git репозиторий - публичные и приватные гит репозитории

Сортировать по

Язык: Все

Топик: reinforcement-learning

rnekrasov/
xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
large-language-models
agent
reinforcement-learning
tool-use
code-generation
web-grounding
complex-reasoning
language-agent
llm-robotics
neural-symbolic
Markdown
0
1
Обновлено 7 месяцев назад
rnekrasov/
wandb
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
machine-learning
pytorch
deep-learning
tensorflow
mlops
jax
data-science
keras
ml-platform
model-versioning
reinforcement-learning
reproducibility
collaboration
data-versioning
experiment-track
hyperparameter-optimization
hyperparameter-search
hyperparameter-tuning
Python
0
0
Обновлено 7 месяцев назад
AIRI-Institute/
him-agent
The Hierarchical Intrinsically Motivated Agent (HIMA) is an algorithm that is intended to exhibit an adaptive goal-directed behavior using neurophysiological models of the neocortex, basal ganglia, and thalamus.
reinforcement-learning
biologically-plausible-learning
hierarchical-temporal-memory
intrinsic-motivation
model-based-reinforcement-learning
sparse-distributed-representations
Python
0
0
Обновлено 2 месяца назад
AIRI-Institute/
pogema
POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored to a variety of PO-MAPF settings.
reinforcement-learning
pathfinding
simulation
po-mapf
gym-environment
mapf
marl
Python
0
0
Обновлено 2 месяца назад
AIRI-Institute/
learn-to-follow
[AAAI-2024] Follower: This study addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed Follower approach utilizes a combination of a planning algorithm for constructing a long-term plan and reinforcement learning for resolving local conflicts.
reinforcement-learning
pathfinding
mapf
aaai-2024
grid
lifelong
pogema
C++
0
0
Обновлено 2 месяца назад
AIRI-Institute/
when-to-switch
"When to Switch" Implementation: Addressing the PO-MAPF challenge with RePlan & EPOM policies. This repo includes search-based re-planning, reinforcement learning techniques, and three mixed policies for pathfinding in partially observable multi-agent environments. 🤖🛤️
reinforcement-learning
pathfinding
mapf
grid
pogema
Python
0
0
Обновлено 2 месяца назад

xlang-paper-reading

wandb

him-agent

pogema

learn-to-follow

when-to-switch

Использование cookies