在 Solr 中索引纯文本文件

【问题标题】：Indexing plain text files in Solr在 Solr 中索引纯文本文件
【发布时间】：2019-01-01 00:37:51
【问题描述】：

很难找到结构良好的手册和信息，如何在 Solr (.txt) 中为纯文本进行索引。

我明白了如何使用 Solr 标准数据类型，如 .xml 或 .json，但直到现在还没有至少一本结构化且完整描述的纯文本索引手册（尤其是如果您的文件不包含 id 和只有单词和空格）。

期待收到一些可以帮助我解决这个问题的资源或一些可以帮助我解决这个问题的代码示例。

【问题讨论】：

标签： indexing solr plaintext

【解决方案1】：

您应该仍然可以使用extract 端点（在后台使用 Apache Tika）。可以提供字段值through the query string as seen in the example for the techproducts data set：

/solr/techproducts/update/extract?literal.id=doc1&commit=true

literal.id=doc1 参数为无法从提交的数据集中提取的字段提供实际值。

确保set the Content-Type header to text/plain when you're submitting（除非您作为常规 html 表单上传提交）。

【讨论】：

猜你喜欢

sed 文件的内容附加到另一个文件中的特定行 2025-11-21
在客户端用 JavaScript 逐行读取文件 2025-11-21
如何从 PyGTK 中的 FileChooserButton 获取文件名？ 2025-11-21
带有两个图像和文本的 Android 按钮 2025-11-21
mysql连接查询不使用索引 2025-11-21
使用索引同时从 numpy 2D 数组的行中减去多个值 2025-11-21
在查询中搜索逗号值？ 2025-11-21
gearman 中的错误条件和重试？ 2025-11-21
if 语句未通过 - groovy 脚本 2025-11-21
Jasper Reports - 从输入控件中删除按钮 2025-11-21

相关资源

Visual Studio.NET使用技巧手册完整版PDF(中文+英文)下载 2021-06-07
PHP漏洞扫描软件源码 v1.0 beta下载 2022-12-28

最近更新更多

热门标签

Java Python linux javascript C# Mysql Docker 算法前端 SpringBoot Redis Vue spring .net 设计模式 .net core c++ kubernetes 数据库机器学习大数据数据结构微服务 js 人工智能 Go Android 面试程序员 JVM 云原生后端 ASP.net core 深度学习 CSS k8s git golang PHP devops Nginx Django React mybatis 架构多线程 Spring Boot 云计算 LeetCode 分布式