【发布时间】:2020-08-24 09:09:42
【问题描述】:
我想加载从here借来的文本文件,其中每一行代表一个json字符串,如下所示:
{"overall": 2.0, "verified": true, "reviewTime": "02 4, 2014", "reviewerID": "A1M117A53LEI8", "asin": "7508492919", "reviewerName": "Sharon Williams", "reviewText": "DON'T CARE FOR IT. GAVE IT AS A GIFT AND THEY WERE OKAY WITH IT. JUST NOT WHAT I EXPECTED.", "summary": "CASE", "unixReviewTime": 1391472000}
我想使用 tensorflow 从数据集中仅提取 reviewText 和 overall 特征,但遇到以下错误。
AttributeError: in user code:
<ipython-input-4-419019a35c5e>:9 None *
line_dataset = line_dataset.map(lambda row: transform(row))
<ipython-input-4-419019a35c5e>:2 transform *
str_example = example.numpy().decode("utf-8")
AttributeError: 'Tensor' object has no attribute 'numpy'
我的代码 sn-p 如下所示:
def transform(example):
str_example = example.numpy().decode("utf-8")
json_example = json.loads(str_example)
overall = json_example.get('overall', None)
text = json_example.get('reviewText', None)
return (overall, text)
line_dataset = tf.data.TextLineDataset(filenames = [file_path])
line_dataset = line_dataset.map(lambda row: transform(row))
for example in line_dataset.take(5):
print(example)
我使用的是 tensorflow 2.3.0。
【问题讨论】:
标签: python tensorflow tensorflow2.0