【问题标题】:Wordcloud + corpus error in RR中的Wordcloud +语料库错误
【发布时间】:2014-11-25 15:27:14
【问题描述】:

我想使用 Wordcloud 功能对 Twitter 数据进行云计算。我已经安装了 twitter 包并使用了 api。之后我会执行以下操作。

bigdata <- searchTwitter("#bigdata", n=20)

bigdata_list <- sapply(bigdata, function(x) x$getText())
bigdata_corpus <- Corpus(VectorSource(bigdata_list))
bigdata_corpus <- tm_map(bigdata_corpus, content_transformer(tolower), lazy=TRUE)
bigdata_corpus <- tm_map(bigdata_corpus, removePunctuation, lazy=TRUE)
bigdata_corpus <- tm_map(bigdata_corpus, 
                           function(x)removeWords(x,stopwords()), lazy=TRUE)
wordcloud(bigdata_corpus)

这会产生 Wordcloud 命令的错误消息:

Error in UseMethod("meta", x) : 
  no applicable method for 'meta' applied to an object of class "try-error"
In addition: Warning messages:
1: In mclapply(x$content[i], function(d) tm_reduce(d, x$lazy$maps)) :
  all scheduled cores encountered errors in user code
2: In mclapply(unname(content(x)), termFreq, control) :
  all scheduled cores encountered errors in user code

我尝试了不同的语料库命令,但似乎无法正确执行。 有什么想法吗?

【问题讨论】:

    标签: r twitter corpus


    【解决方案1】:

    你可以试试这个:

    library("tm")
    # Transform your corpus in a term document matrix
    bigdata_tdm <- as.matrix(TermDocumentMatrix(bigdata_corpus))
    # Get the frequency by words
    bigdata_freq <- data.frame(Words = rownames(bigdata_tdm), Freq = rowSums(bigdata_tdm), stringsAsFactors = FALSE)
    # sort
    bigdata_freq <- bigdata_freq[order(bigdata_freq$Freq, decreasing = TRUE), ]
    # keep the 50 most frequent words
    bigdata_freq <- bigdata_freq[1:50, ]
    
    # Draw the wordcloud
    library("wordcloud")
    wordcloud(words = bigdata_freq$Words, freq = bigdata_freq$Freq)
    

    tm_0.6wordcloud_2.5 两种方式都有效。

    【讨论】:

      猜你喜欢
      • 2016-10-15
      • 1970-01-01
      • 1970-01-01
      • 2018-03-31
      • 2014-08-03
      • 1970-01-01
      • 1970-01-01
      • 2017-04-17
      • 2020-04-14
      相关资源
      最近更新 更多