Commercially usable open-source LLMs.
Optimized for fast training and inference.
with full data transparency
for the possibility of commercial use
on your data for your use case
for cost effective deployments
to understand long documents
for pretraining, finetuning, evaluation
“Using the MosaicML platform, we were able to train and deploy our Ghostwriter 2.7B LLM for code generation with our own data within a week and achieve leading results.”
A new standard for open-source, commercially usable LLMs. Train, finetune, and deploy your own private models.
Unleash the power of generative AI and maximize the potential of your data with our purpose-built platform.
Customize models on your data for your specific use cases
Create your own domain-specific LLMs from scratch for maximum customization & security
“The foundation series by MosaicML, including MPT-7B/30B (and an efficient training repo), makes high-quality pre-trained language models available to anyone for commercial use.”
Use our optimized codebase for training, finetuning, evaluating, and deploying LLMs.
Our codebase was built with Composer and integrates with the MosaicML platform.
Easy to use, efficient, and flexible, this codebase is designed to enable rapid experimentation with the latest techniques.
The Model Gauntlet is our standardized method of evaluating LLMs in a holistic manner.
It encompasses 34 different benchmarks collected from a variety of sources, and organized into several broad categories of competencies.
Check out our leaderboard to compare eval scores across popular open-source LLMs!
“MPT-30B is an open-source and commercially licensed decoder-based LLM that is more powerful than GPT-3-175B.”