Changhao's picture

1 5

Changhao

lichangh20

·

https://lichangh20.github.io/

lichangh20

AI & ML interests

RL, Agent, Efficient ML

Recent Activity

upvoted an article about 1 month ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

upvoted a paper about 2 months ago

Matryoshka: Learning to Drive Black-Box LLMs with LLMs

upvoted a paper 3 months ago

MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline

View all activity

Organizations

Papers 4

arxiv:2507.16782

arxiv:2505.07782

arxiv:2410.20749

arxiv:2306.11987

models 2

lichangh20/s1k_format_filtered_bf16

Feature Extraction • 7B • Updated Mar 25 • 8

lichangh20/s1k_format_filtered

Feature Extraction • 7B • Updated Mar 25 • 9

datasets 15

lichangh20/s1K_initial_filtered_for_llama8b

Viewer • Updated May 2 • 1k • 6

lichangh20/olympiadbench

Viewer • Updated Apr 22 • 674 • 15

lichangh20/minervamath

Viewer • Updated Apr 22 • 272 • 8

lichangh20/s1K_simplified_filtered_for_adapter

Viewer • Updated Mar 24 • 927 • 14

lichangh20/s1K_initial_filtered_for_qwen7b_simplified_summarized

Viewer • Updated Mar 15 • 997 • 7

lichangh20/s1K_initial_filtered_for_qwen7b_summarized

Viewer • Updated Mar 12 • 997 • 11

lichangh20/s1K_filtered_for_qwen7b_sft

Viewer • Updated Mar 11 • 899 • 13

lichangh20/s1k_eval_sampled_1of12

Viewer • Updated Mar 9 • 77 • 9

lichangh20/s1k_train_sampled_1of12

Viewer • Updated Mar 9 • 77 • 8

lichangh20/gpqa_sampled_1of3

Viewer • Updated Mar 9 • 66 • 12

View 15 datasets