1 30 3

Jinming Wu

kimingng

https://kimingng.notion.site/Jinming-Kimmy-Wu-b22c1682d48d47939dcd7c41bf6a6bab?source=copy_link

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Meta-RL Induces Exploration in Language Agents

upvoted a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 18 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

View all activity

Organizations

upvoted a paper 3 days ago

Meta-RL Induces Exploration in Language Agents

Paper • 2512.16848 • Published 9 days ago • 10

upvoted a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 5 days ago • 60

upvoted a paper 18 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 19 days ago • 36

upvoted a paper 24 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published about 1 month ago • 141

upvoted a paper 26 days ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25 • 166

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20 • 91

upvoted a paper about 2 months ago

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28 • 40

upvoted a paper 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 66

upvoted 2 papers 3 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10 • 50

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Paper • 2510.05094 • Published Oct 6 • 37

upvoted a collection 3 months ago

LLaVA-OneVision

Collection

a model good at arbitrary types of visual input • 17 items • Updated Sep 17 • 31

upvoted 5 papers 4 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Paper • 2508.13154 • Published Aug 18 • 62

updated a dataset 5 months ago

lmms-lab/FVQA

Viewer • Updated Aug 9 • 6.66k • 439 • 7

liked a dataset 5 months ago

WHB139426/Grounded-VideoLLM

Updated Apr 10 • 3.93k • 9

updated a collection 5 months ago

MMSearch-R1

Collection

MMSearch-R1 is a solution designed to train LMMs to perform on-demand multimodal search in real-world environment. • 4 items • Updated Aug 8 • 1

Jinming Wu

AI & ML interests

Recent Activity

Organizations

kimingng's activity