230530-论文整理-课题组2-Toy模板网

这篇具有很好参考价值的文章主要介绍了230530-论文整理-课题组2。希望对大家有所帮助。如果存在错误或未考虑完全的地方，请大家不吝赐教，您也可以点击"举报违法"按钮提交疑问。

对这些研究有点兴趣颇微。

Rethinking Dense Retrieval’s Few-Shot Ability

我们定制了一个标准的FewDR数据集和评估协议，用于少量密集的检索。该数据集是在维基百科语料库上构建的，包含41,420个样本，有60个细粒度的类别。
具体内容上，和其他的dense retrieval方法，没有感觉到有太大的不同。
230530-论文整理-课题组2

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

传统上，大部分seq2seq任务是由编码器-解码器框架解决的，它需要一个编码器来编码源序列，一个解码器来生成目标文本。

This paper aims to address this gap by conducting a detailed comparison between the encoder-decoder architecture and the decoder-only language model framework through the analysis of a regularized encoder-decoder structure.

问题矛盾点：
1.encoder-decoder模型结构相比于decoder-ONLY结构，哪个更有优势？
2.我们揭示了语言模型中的注意力退化问题，即随着生成步骤数的增加，越来越少的注意力被集中在源序列上。

230530-论文整理-课题组2
traditional ED structure named as Regularized Encoder-Decoder (RED) framework

230530-论文整理-课题组2

1.为了避免注意力退化的问题，提出了单向交叉注意，单向的交叉注意同时关注源矩阵和目标矩阵；
2.连续位置编码，在target序列中的位置编码和source序列中的位置编码是连续，而不是在target中从头开始排序。

PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction

语音和视觉相似性知识对这项任务很重要。 PLOME 利用 GRU 网络根据字符的语音和笔画对此类知识进行建模。

230530-论文整理-课题组2
所提出的模型将每个字符的笔画和拼音作为输入，这使得 PLOME 能够对任意字符之间的相似性进行建模。
PLOME 通过联合恢复掩码标记的真实字符和语音来学习字符和语音级别的拼写错误知识。
模型结构图
230530-论文整理-课题组2

we randomly mask some percentage of the input tokens and then recover them
mask 15% of tokens in the corpus. In addition, we use dynamic masking strategy
the final embedding of each character is the sum of character embedding, position embedding, phonic embedding and shape embedding

The probability of the character predicted for the i-th token in a given
sentence is defined as

230530-论文整理-课题组2

The probability of pronunciation prediction
is defined as:

230530-论文整理-课题组2
损失函数：

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

汉字中常见的错误类型如上文所述，一个是拼音，一个是字形。
230530-论文整理-课题组2
模型结构图

The Semantic Encoder

The input tokens X = (x1, . . . , xN ) are first
projected into Ht0
through the input embedding.
Then the computation of Transformer (Vaswani
et al., 2017) encoder layers can be formulated as:

230530-论文整理-课题组2

The Phonetic Encoder（拼音encoder）

 The 5 kinds of tones (take
the final “a” as an example, { a,¯ a,´ a,ˇ a, a ` }) can be
mapped into numbers {1, 2, 3, 4, 0}

The Character-level Encoder

a single-layer
uni-directional GRU (Cho et al., 2014), which encodes the pinyin of the i-th character xi as:

230530-论文整理-课题组2
The Graphic Encoder

**fused module **
采用的gate机制实现的embedding的融合。

230530-论文整理-课题组2 文章来源地址https://www.toymoban.com/news/detail-465090.html

到了这里，关于230530-论文整理-课题组2的文章就介绍完了。如果您还想了解更多内容，请在右上角搜索TOY模板网以前的文章或继续浏览下面的相关文章，希望大家以后多多支持TOY模板网！

230530-论文整理-课题组2

Rethinking Dense Retrieval’s Few-Shot Ability

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏

支付宝扫一扫领取红包，优惠每天领

二维码1

二维码2