Implement Attention-Based Building Blocks #79

sanowl · 2024-04-04T07:51:50Z

This pull request implements attention-based building blocks for neural networks using the tch-rs library. The implemented components include:

GeGlu: Gated Linear Unit activation function.
FeedForward: A feed-forward layer with GeGlu activation.
CrossAttention: Cross-attention layer for query-key-value attention.
BasicTransformerBlock: A basic Transformer block composed of cross-attention and feed-forward layers.
SpatialTransformer: A spatial transformer model (also known as Transformer2DModel) that applies a series of BasicTransformerBlock layers.
AttentionBlock: An attention block that performs self-attention on the input tensor

Update embeddings.rs

410de7f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Attention-Based Building Blocks #79

Implement Attention-Based Building Blocks #79

Uh oh!

sanowl commented Apr 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Implement Attention-Based Building Blocks #79

Are you sure you want to change the base?

Implement Attention-Based Building Blocks #79

Uh oh!

Conversation

sanowl commented Apr 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant