【发布时间】:2021-04-29 00:00:14
【问题描述】:
Word2Vec中的vectorSize和HashingTF中的numFeatures有什么区别?我指的是pyspark中的类Word2Vec和HashingTF:
WORD2VEC:类 pyspark.ml.feature.Word2Vec(*, vectorSize=100, minCount=5, numPartitions=1, stepSize=0.025, maxIter=1,种子=None, inputCol=None, outputCol=None, windowSize=5, maxSentenceLength=1000)
HashingTF:类 pyspark.ml.feature.HashingTF(*, numFeatures=262144, binary=False, inputCol=None, outputCol=None)
【问题讨论】:
标签: python gensim word2vec tf-idf