A unified inference and post-training framework for accelerated video generation.
-
Updated
Dec 8, 2025 - Python
A unified inference and post-training framework for accelerated video generation.
Awesome Reasoning LLM Tutorial/Survey/Guide
心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / Mixtral / LLama / GLM series models
Explore the Multimodal “Aha Moment” on 2B Model
"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
An unified model for 4D human-scene reconstruction
Train a Language Model with GRPO to create a schedule from a list of events and priorities
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
A brief and partial summary of RLHF algorithms.
Post-training scripts and samples for NVIDIA Cosmos ecosystem
RapidFire AI: Rapid AI Customization from RAG to Fine-Tuning
A collection of vision-language-action model post-training methods.
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.
A High-Efficiency System of Large Language Model Based Search Agents
[NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis
A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-inference stages.
Add a description, image, and links to the post-training topic page so that developers can more easily learn about it.
To associate your repository with the post-training topic, visit your repo's landing page and select "manage topics."