Transformer Learning Project

This project is designed for learning the principles and implementation of the Transformer architecture. It includes various demos that explore different components and use cases of Transformer models.

Project Structure

The project is divided into several parts, each focusing on a specific aspect of the Transformer model:

1. Seq2Seq Demo

A simple sequence-to-sequence (seq2seq) model implemented using the Transformer architecture.
Demonstrates the core functionality of the Transformer in solving sequence-based tasks.

2. Transformer Module Breakdown

A step-by-step breakdown of the key components of a Transformer model.
Includes demos of attention mechanisms, positional encoding, and other foundational parts of the architecture.

3. Encoder-Decoder Transformer

An encoder-decoder Transformer model, which is typically used for tasks like machine translation and text summarization.
Shows how the encoder and decoder work together to process input and generate output sequences.

4. Decoder-Only Transformer

A decoder-only Transformer model, often used for autoregressive tasks like language modeling or text generation.
Focuses on how the model generates output step-by-step based on the previous tokens.

5. Distributed Training and Inference

Demos showcasing how to use various distributed frameworks (e.g., PyTorch Distributed, TensorFlow, DeepSpeed, etc.) for training and inference with Transformer models.
Focuses on scaling up training processes and optimizing inference performance.

Requirements

Python 3
PyTorch
Other dependencies listed in requirements.txt

Installation

To install the necessary dependencies, run the following command:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
attention		attention
distributed_demo		distributed_demo
huggingface_transfomers_demo		huggingface_transfomers_demo
llm_demo		llm_demo
seq2seq_demo		seq2seq_demo
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer Learning Project

Project Structure

1. Seq2Seq Demo

2. Transformer Module Breakdown

3. Encoder-Decoder Transformer

4. Decoder-Only Transformer

5. Distributed Training and Inference

Requirements

Installation

About

Releases

Packages

Languages

JJJJerry/Transformer-Recipes

Folders and files

Latest commit

History

Repository files navigation

Transformer Learning Project

Project Structure

1. Seq2Seq Demo

2. Transformer Module Breakdown

3. Encoder-Decoder Transformer

4. Decoder-Only Transformer

5. Distributed Training and Inference

Requirements

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages