Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
83.7
TFLOPS
61
38
444
David Golchinfar
PRO
DavidGF
Follow
chuangxinlezhi's profile picture
sequelbox's profile picture
KnutJaegersberg's profile picture
65 followers
Β·
47 following
https://vago-solutions.ai
DavidGFar
dgolchin
AI & ML interests
finetune llms, improve german language understanding and generated text of llms
Recent Activity
reacted
to
anakin87
's
post
with β€οΈ
1 day ago
π Let LLMs wander - Engineering RL Environments Reinforcement Learning Environments are little worlds where models can act, get rewards, and learn. I've been exploring how to design them, figuring out what works and what doesn't. If you want to learn how to build them, I recorded a practical intro video. You'll also see how to turn Liquid AI LFM2-2.6B into a Tic-tac-toe master π π₯ Engineering RL Environments video: https://www.youtube.com/watch?v=71V3fTaUp2Q --- π± LLM RL Environments Lil Course: https://github.com/anakin87/llm-rl-environments-lil-course π€πΉοΈ Play against the trained model: https://huggingface.co/spaces/anakin87/LFM2-2.6B-mr-tictactoe π HF collection (datasets + models): https://huggingface.co/collections/anakin87/lfm2-26b-mr-tic-tac-toe
liked
a model
2 days ago
AIDC-AI/Marco-Mini-Instruct
liked
a model
4 days ago
LiquidAI/LFM2.5-VL-450M
View all activity
Organizations
DavidGF
's datasets
None public yet