Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
12
2
Suchir Salhan
suchirsalhan
Follow
yulongchen's profile picture
Moibe's profile picture
snklp's profile picture
13 followers
·
27 following
https://www.suchirsalhan.com/
suchirsalhan
suchirsalhan
ssalhan
AI & ML interests
Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.
Recent Activity
updated
a dataset
7 minutes ago
Beetle-Data/es-raw-28B
published
a dataset
8 minutes ago
Beetle-Data/es-raw-28B
updated
a dataset
about 1 hour ago
Beetle-Data/en-raw-28B
View all activity
Organizations
suchirsalhan
's datasets
9
Sort: Recently updated
suchirsalhan/kidalign-llama-filterable
Viewer
•
Updated
1 day ago
•
97.6k
•
16
suchirsalhan/kidalign-llama-3.1-8B-Instruct
Updated
1 day ago
•
4
suchirsalhan/babylm-detox
Viewer
•
Updated
8 days ago
•
11.6M
•
25
suchirsalhan/gptbert-tokenised
Updated
Jul 24, 2025
•
5
suchirsalhan/Phonemized-UD
Viewer
•
Updated
May 30, 2025
•
1.19M
•
136
suchirsalhan/BabyLM-Pretokenised
Viewer
•
Updated
Jan 31, 2025
•
1.64M
•
6
suchirsalhan/MAO-CHILDES
Viewer
•
Updated
Apr 11, 2024
•
3.81M
•
5
suchirsalhan/CLiMP
Preview
•
Updated
Apr 2, 2024
•
30
•
1
suchirsalhan/SLING
Viewer
•
Updated
Apr 2, 2024
•
40k
•
31