Megatron-LM is a GPU-optimized deep learning framework from NVIDIA designed to train extremely large transformer-based language models efficiently at scale. The repository provides both a reference training implementation and Megatron Core, a composable library of high-performance building blocks for custom large-model pipelines. It supports advanced parallelism strategies including tensor, pipeline, data, expert, and context parallelism, enabling training across massive multi-GPU and multi-node clusters. The framework includes mixed-precision training options such as FP16, BF16, FP8, and FP4 to maximize performance and memory efficiency on modern hardware. Megatron-LM is widely used in research and industry for pretraining GPT-, BERT-, T5-, and multimodal-style models, with tooling for checkpoint conversion and interoperability with Hugging Face. Overall, it is a production-grade system for organizations pushing the limits of large-scale language model training.

Features

  • GPU-optimized transformer training
  • Advanced parallelism strategies
  • Mixed precision training support
  • Composable Megatron Core library
  • Hugging Face checkpoint conversion
  • Multi-node scalable training pipelines

Project Samples

Project Activity

See All Activity >

Categories

Research

License

MIT License

Follow Megatron-LM

Megatron-LM Web Site

Other Useful Business Software
Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK. Icon
Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK.

With Ditto Edge Server, you can join devices as small as a Raspberry Pi to a local mesh network and synchronize data across edge environments.

Ditto's Edge SDK is the only thing your edge devices need to ensure your application is operational in any environment, regardless of network conditions.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Megatron-LM!

Additional Project Details

Programming Language

Python

Related Categories

Python Research Software

Registered

2026-02-25