Stable Code 3B Model Information & Download

Model Options:

Model Tag:

stable-code 📋

Stable Code 3B is a coding model that offers instruct and code completion options comparable to larger models like Code Llama 7B, which is 2.5 times its size.

Model File Size

1.6 GB

Quantization

License

STABILITY AI NON-COMMERCIAL RESEARCH COMMUNITY LICENSE AGREEMENT

Last Updated

2024-04-07 (1 year ago)

Tutorial

Download and Run Stable Code 3B on your PC

Stable Code 3B: Revolutionizing Code Completion Model

In the realm of programming, the demand for advanced tools that enhance productivity and accelerate development processes has never been greater. Stable Code 3B, developed by Stability AI, is a cutting-edge coding model that offers capabilities on par with larger models like Code Llama 7B, yet is significantly more efficient. This article delves deep into the features, architecture, and applications of Stable Code 3B.

Overview of Stable Code 3B

Stable Code 3B is a 3 billion parameter Large Language Model (LLM) that excels in code completion. Its sophisticated design ensures high accuracy and responsiveness, making it an invaluable tool for developers across various programming languages.

Key Features

NEW Instruct Model: Utilize the ollama run stable-code command for enhanced functionality.
Fill in Middle Capability (FIM): This feature allows the model to predict and complete code snippets more intelligently.
Supports Long Context: Trained with sequences of up to 16,384 tokens, ensuring comprehensive understanding of context in code.

Performance Comparison

When compared to other models, Stable Code 3B demonstrates competitive performance across a variety of programming languages:

Model	Size	Python	C++	JavaScript	Java	PHP	Rust
Stable Code	3B	32.4%	30.9%	32.1%	32.1%	24.2%	23.0%
CodeLLama	7B	30.0%	28.2%	32.5%	31.1%	25.7%	26.3%
Deepseek Coder	1.3B	28.6%	29.2%	28.7%	29.0%	23.6%	18.5%
Wizard Coder	3B	31.6%	25.6%	26.2%	25.8%	25.3%	20.4%
StarCoder	3B	21.6%	19.8%	21.5%	20.5%	19.0%	16.9%

Technical Specifications

Model Architecture

Stable Code 3B is built on a decoder-only transformer architecture, similar to the renowned LLaMA architecture. It features the following specifications:

Parameters: 2,796,431,360
Hidden Size: 2560
Layers: 32
Heads: 32
Sequence Length: 16384

The model utilizes Rotary Position Embeddings to enhance throughput and a modified version of the GPTNeoX Tokenizer, incorporating special tokens for FIM capabilities.

Training Dataset

The training dataset for Stable Code 3B is a carefully curated mixture of high-quality, open-source datasets available on the Hugging Face Hub, including:

Falcon RefinedWeb extract
CommitPackFT
Github Issues
StarCoder

The model is trained on a diverse range of programming languages, including:

C
C++
Java
JavaScript
Python
PHP
Rust
And many more.

Use and Limitations

Intended Use

Stable Code 3B serves as a foundational model designed for application-specific fine-tuning. Developers must evaluate and adjust the model to ensure its safe and effective performance in real-world applications.

Limitations and Bias

As with any foundational model, users should be aware of inherent limitations and potential biases. The model may produce unreliable or inappropriate outputs based on its training data. Careful evaluation and adjustment are crucial before deploying the model in mission-critical or sensitive applications.

Conclusion

Stable Code 3B represents a significant advancement in coding models, providing developers with a robust tool for efficient coding and instruction comprehension. Its ability to operate locally via Braina AI enhances its accessibility, making it a practical choice for modern developers.