gm.nn#
Gemma models.
Symbols#
Module#
Symbols needed to build new |
Class#
Wrapper around a model to compute policy and anchor outputs. |
|
Output of the |
|
Attention module. |
|
Transformer block. |
|
Einsum is a convenience module for parameterized tensor multiplication. |
|
Embedder module. |
|
Feed forward module. |
|
Gemma2 transformer architecture. |
|
Gemma2 transformer architecture. |
|
Gemma2 transformer architecture. |
|
Gemma3 transformer architecture. |
|
Gemma3 transformer architecture. |
|
Gemma3 transformer architecture. |
|
Gemma3 transformer architecture. |
|
Gemma3 transformer architecture. |
|
Gemma3n E2B transformer architecture. |
|
Gemma3n E4B transformer architecture. |
|
Gemma 4 26B_A4B MoE model. |
|
Gemma 4 31B model. |
|
Gemma 4 E2B model. |
|
Gemma 4 E4B model. |
|
Wrapper around a Gemma model to enable int4 inference. |
|
Wrapper around a Gemma model to enable LoRA. |
|
Output of the Gemma model. |
|
Wrapper around a Gemma model to enable quantization aware training. |
|
RMSNorm layer. |
|
SigLIP vision encoder forward pass from PatchifiedMedia. |
|
Base transformer class. |
|
Protocol for a transformer model to be used with a Sampler. |