File:Full GPT architecture.png

Original file (863 × 1,038 pixels, file size: 129 KB, MIME type: image/png)

Summary

Description
English: The full architecture of a generative pre-trained transformer (GPT) model.
Date
Source Own work
Author Marxav
Other versions

Licensing

I, the copyright holder of this work, hereby publish it under the following license:
Creative Commons CC-Zero This file is made available under the Creative Commons CC0 1.0 Universal Public Domain Dedication.
The person who associated a work with this deed has dedicated the work to the public domain by waiving all of their rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law. You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission.

Captions

The full architecture of a GPT model.

Items portrayed in this file

depicts

27 December 2022

image/png

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current09:00, 2 January 2023Thumbnail for version as of 09:00, 2 January 2023863 × 1,038 (129 KB)Marxavadded a dropout module
16:10, 1 January 2023Thumbnail for version as of 16:10, 1 January 2023863 × 987 (129 KB)Marxavalignement of "head ..."
15:23, 27 December 2022Thumbnail for version as of 15:23, 27 December 2022863 × 987 (128 KB)Marxavupdate color
13:31, 27 December 2022Thumbnail for version as of 13:31, 27 December 2022928 × 987 (132 KB)MarxavUploaded own work with UploadWizard
No pages on the English Wikipedia use this file (pages on other projects are not listed).