All checkpoints for "Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment", https://arxiv.org/abs/2503.04647
WenYang
James-WYang
AI & ML interests
None yet
Organizations
None yet
Language Imbalance Driven Rewarding
All checkpoints for our work "Language Imbalance Driven Rewarding for Multilingual Self-improving", https://arxiv.org/abs/2410.08964
-
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr
8B • Updated • 6 -
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_th_bn_sw
8B • Updated • 4 -
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_translate_by_system_en_th_bn_sw
8B • Updated • 5 -
James-WYang/LIDR_M0_Qwen2-7B-Instruct_en_es_ru_de_fr
8B • Updated • 5
X-Instruction
X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instruction, https://arxiv.org/abs/2405.19744
Implicit Cross-lingual Reward
All checkpoints for "Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment", https://arxiv.org/abs/2503.04647
Language Imbalance Driven Rewarding
All checkpoints for our work "Language Imbalance Driven Rewarding for Multilingual Self-improving", https://arxiv.org/abs/2410.08964
-
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr
8B • Updated • 6 -
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_th_bn_sw
8B • Updated • 4 -
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_translate_by_system_en_th_bn_sw
8B • Updated • 5 -
James-WYang/LIDR_M0_Qwen2-7B-Instruct_en_es_ru_de_fr
8B • Updated • 5
BigTranslate
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages, https://arxiv.org/abs/2305.18098
X-Instruction
X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instruction, https://arxiv.org/abs/2405.19744