【发布时间】:2017-09-18 18:20:19
【问题描述】:
我有一个包含如下数据的文本文件:
History
The term "data science" (originally used interchangeably with "datalogy") has existed for over thirty years and was used initially as a substitute for computer science by Peter Naur in 1960. In 1974, Naur published Concise Survey of Computer Methods, which freely used the term data science in its survey of the contemporary data processing methods that are used in a wide range of application
Application
In the 2010–2011 time frame, data science software reached an inflection point where open source software started supplanting proprietary software.[30] The use of open source software enables modifying and extending the software, and it allows sharing of the resulting algorithms
现在我想提取包含特定单词集的段落或特定部分,例如{" Software", opensource" }
我尝试过regexp 和if loop,但无法提取所需的输出,谁能帮帮我。
【问题讨论】:
标签: python grep information-extraction