Svoboda
|
Graniru
|
BBC Russia
|
Golosameriki
|
Facebook
Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
3
5
5
Jie Liu
jieliu
Follow
sefira32's profile picture
jizhongpeng's profile picture
xiao-lin's profile picture
8 followers
·
0 following
yifan123
AI & ML interests
Reinforcement Learning, Large Language Model
Organizations
Papers
5
arxiv:
2407.16154
arxiv:
2406.11817
arxiv:
2402.12343
arxiv:
2310.03708
Expand 5 papers
models
2
Sort: Recently updated
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-beta0.5
Updated
16 days ago
jieliu/Storm-7B
Text Generation
•
Updated
Jun 18
•
72
•
39
datasets
None public yet