【发布时间】:2018-05-09 03:08:33
【问题描述】:
我正在逐字阅读文件,我想首先通过将所有字符转换为小写并删除除标点符号之外的所有非字母字符来预处理单词:连字符 (-) 和撇号 ('),然后相应地显示。我已完成将单词转换为lowercase,但现在我需要删除除连字符(-)和撇号(')之外的非字母字符。我不知道该怎么做。有人能解决这个问题吗?
void WordStats::ReadTxtFile()
{
std::ifstream ifile(Filename);
if(!ifile)
{
std::cerr << "Error Opening file " << Filename << std::endl;
exit(1);
}
for (std::string word; ifile >> word; )
{
transform (word.begin(), word.end(), word.begin(), ::tolower);
WordMap & Words = (Dictionary.count(word) ? KnownWords :
UnknownWords);
Words[word].push_back(ifile.tellg());
}
std::cout << KnownWords.size() << " known words read." << std::endl;
std::cout << UnknownWords.size() << " unknown words read." << std::endl;
}
【问题讨论】:
标签: c++ stl formatting string-formatting read-write