A Simple but Tough-to-Beat Baseline for Sentence Embeddings阅读笔记

文章目录

概述
算法
实验

1. Textual Similarity Tasks
2. Supervised Tasks

概述

一篇17年的论文, 采用无监督的方法.
主要思想可以概括为两步:

利用词嵌入方法，通过词向量的线性的加权组合对一个句子进行编码
利用奇异向量求出最终的句向量。

算法

A Simple but Tough-to-Beat Baseline for Sentence Embeddings阅读笔记

实验

1. Textual Similarity Tasks

数据集

all the datasets from SemEval semantic textual similarity (STS) tasks (2012-2015)
the SemEval 2015 Twitter task
the SemEval 2014 Semantic Relatedness task

实验设置
词向量分别采用了无监督的GloVe和弱监督的PSL.
$\alpha$ 固定为 $10^{-3}$ , 词频利用commoncrawl dataset进行统计.

实验结果
A Simple but Tough-to-Beat Baseline for Sentence Embeddings阅读笔记

2. Supervised Tasks

the SICK similarity task
the SICK entailment task
the Stanford Sentiment Treebank (SST) binary classification task

实验结果
A Simple but Tough-to-Beat Baseline for Sentence Embeddings阅读笔记

相关文章：

2021-12-11
2021-07-28
2021-07-17
2021-05-28
2021-04-24
2021-09-15
2021-04-30
2021-06-10

猜你喜欢

2021-08-24
2021-10-19
2021-04-21
2021-08-06
2021-11-22
2021-04-28
2021-06-01

相关资源

下载 2021-06-07
下载 2023-03-18
下载 2022-12-30

相似解决方案

热门标签

Java Python linux javascript Mysql C# Docker 算法前端 SpringBoot Redis Vue spring 设计模式 .net core .net kubernetes c++ 数据库数据结构大数据 js 机器学习微服务 Android Go 程序员面试 JVM ASP.net core 云原生人工智能后端 PHP git CSS golang k8s Nginx Django mybatis 深度学习多线程 React 架构 devops 爬虫云计算 Spring Boot LeetCode