【发布时间】:2014-08-24 12:47:33
【问题描述】:
I have several .dat, containing information about hotel reviews as below
/*
<Author> simmotours
<Content> review......goes here
<Date>Nov 18, 2008
<No. Reader>-1
<No. Helpful>-1
<Overall>4`enter code here`
<Value>4
<Rooms>3
<Location>4
<Cleanliness>4
<Check in / front desk>4
<Service>4
<Business service>-1
*/ 我想将评论分为两个 pos 和 neg ,即有两个文件夹 pos 和 neg 包含几个文件,其中评论高于 3 归类为正面,低于 3 归类为负面。
How can I quickly and efficiently automate this process?
【问题讨论】:
-
您拥有的文件是否与您所指示的一样,还是一个正确的 XML 文件?
-
按照我的指示!
标签: python-3.x notepad++ classification text-processing sentiment-analysis