Skip to content

Inference Layers

Model layers optimized for inference.

Components

  • Attention - Attention mechanisms
  • Activation - Activation functions
  • Linear - Linear layers
  • LayerNorm - Layer normalization
  • RotaryEmbedding - Rotary position embeddings
  • Sampler - Token sampling
  • EmbedHead - Embedding and head layers