Braina Logo

AI Library

DBRX Model Information & Download

Model Tag:
dbrx 📋
DBRX is a versatile, open-source large language model developed by Databricks. It is designed for a wide range of applications.
Model File Size
74 GB
Quantization
Q4
License
Databricks Open Model License
Last Updated
2024-05-04 (6 months ago)

DBRX Models

DBRX is an open, general-purpose language model developed by Databricks. Designed to enhance the capabilities of AI applications, DBRX stands out with its state-of-the-art architecture and robust training methodologies.

DBRX Model

What is DBRX?

DBRX is a transformer-based decoder-only large language model (LLM) trained specifically for next-token prediction. With its sophisticated implementation of a fine-grained mixture-of-experts (MoE) architecture, DBRX boasts a remarkable 132 billion total parameters. Notably, it can actively engage 36 billion parameters on any given input, ensuring high performance across various tasks.

Training and Performance

One of the key features that set DBRX apart from its competitors is the extensive data set used during its pre-training phase. DBRX was trained on a staggering 12 trillion tokens of text and code, providing it with a broad understanding of language nuances, programming concepts, and contextual relevance. As a result, DBRX excels particularly in coding tasks, outperforming even specialized models like CodeLLaMA-70B in programming-related applications.

Key Features of DBRX

← Back to Model Library