【发布时间】:2019-09-04 19:06:12
【问题描述】:
我每条推文都有 twitter 帐户时间线数据以 .json 格式保存,我无法将数据保存到 mongodb 中
示例:获取一条推文的数据。
{
"created_at": "Fri Apr 12 05:13:35 +0000 2019",
"id": 1116570031511359489,
"id_str": "1116570031511359489",
"full_text": "@jurafsky How can i get your video lectures related to Sentiment Analysis",
"truncated": false,
"display_text_range": [0, 73],
"entities": {
"hashtags": [],
"symbols": [],
"user_mentions": [
{
"screen_name": "jurafsky",
"name": "Dan Jurafsky",
"id": 14968475,
"id_str": "14968475",
"indices": [0, 9]
}
],
"urls": []
}
它还包含 url 和其他丢失的信息
我试过下面的代码。
from pymongo import MongoClient
import json
client=MongoClient('localhost',27107)
db=client.test
coll=db.dataset
with open('tweets.json') as f:
file_data=json.loads(f.read())
coll.insert(file_data)
client.close()
【问题讨论】:
标签: json python-3.x mongodb twitter