RNN中的编解码器结构

7086 (我的键盘是琴键，我的代码是诗行)

2025-02-02 20:16:55 已编辑陕西

The first few stages of the network are used to absorb the input sequence, and the associated output vectors are simply ignored. This part of the network can be viewed as an 'encoder' in which the entire input sentence has been compressed into the state z* of the hidden variable. The remaining network stages function as the 'decoder', which generates the translated sentence as output one word at a time. Notice that each output word is fed as input to the next stage of the network, and so this approach has an autoregressive structure analogous to (12.31).引自第381页

35人阅读

> 7086的所有笔记（703篇）

7086对本书的所有笔记 · · · · · ·

变换器（transformer）如何通过词符(token)理解多模态输入

The input data to a transformer is a set of vectors {x_n} of dimensionality D, where n ...
输入镶嵌（embedding）在神经网络中的地位

Word embeddings were originally developed as natural language processing tools in their...
RNN中的编解码器结构
变换器与编解码器

Transformers can be applied to many different kinds of language processing task and can...
变换器的多模态潜力

Transformers have subsequently been found to achieve excellent results in many other do...

> 查看全部25篇

说明 · · · · · ·

表示其中内容是对原文的摘抄

RNN中的编解码器结构

7086 (我的键盘是琴键，我的代码是诗行)

7086对本书的所有笔记 · · · · · ·

变换器（transformer）如何通过词符(token)理解多模态输入

输入镶嵌（embedding）在神经网络中的地位

RNN中的编解码器结构

变换器与编解码器

变换器的多模态潜力

说明 · · · · · ·