WARNING: THIS SITE IS A MIRROR OF GITHUB.COM / IT CANNOT LOGIN OR REGISTER ACCOUNTS / THE CONTENTS ARE PROVIDED AS-IS / THIS SITE ASSUMES NO RESPONSIBILITY FOR ANY DISPLAYED CONTENT OR LINKS / IF YOU FOUND SOMETHING MAY NOT GOOD FOR EVERYONE, CONTACT ADMIN AT ilovescratch@foxmail.com
Skip to content

LeanModels/.github

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

LeanModels — Making Foundation Models Leaner and Meaner

Welcome to LeanModels, an organization founded by Tianyi Zhang dedicated to making foundation models, such as LLMs and diffusion models, more memory- and compute-efficient through practical compression and inference optimization techniques.

Explore our key projects:

  • DFloat11: A lossless LLM compression framework enabling efficient GPU inference
  • Bagel-DFloat11: DFloat11-compressed version of Bagel, a unified multimodal model
  • LeanQuant: Scalable, loss-error-aware quantization for LLMs

We welcome contributors, collaborators, and feedback! If you're working on model compression or efficient inference, feel free to reach out.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published