【发布时间】:2020-05-24 11:11:45
【问题描述】:
我有这样的查询
test = spark.sql("select mg.moviegenreid, sum(quantity) as total \
from moviegenre mg \
join movie m on m.moviegenreid = mg.moviegenreid \
join detailtransaction dt on dt.movieid = m.movieid \
join headertransaction ht on ht.transactionid = dt.transactionid \
group by mg.moviegenreid \
having sum(quantity) \
order by total desc \
limit 5")
然后我将它插入到 pandas 数据帧中
data = test.toPandas()
我只想使用
制作小节线x = data[{"moviegenreid"}]
y = data[{"total"}
val = pd.DataFrame(data=y,index=x)
val.plot.bar()
我总是遇到这样的错误
ValueError: Index data must be 1-dimensional
【问题讨论】:
标签: python pandas apache-spark-sql