【发布时间】:2015-10-29 18:56:27
【问题描述】:
How to load a parquet file into vertica database using spark???
链接 (http://www.sparkexpert.com/2015/04/17/save-apache-spark-dataframe-to-database/)
我尝试使用上面的链接将数据框(镶木地板文件)加载到 mysql 中,它起作用了。但是当我尝试将它加载到 vertica 数据库中时,这是我面临的错误。下面的错误是因为 vertica db 不支持数据帧(镶木地板文件)中的数据类型(字符串)。我不想键入强制转换列,因为这将是一个性能问题。我们希望加载大约 2.8 亿行。您能否建议将数据加载到 vertica db 中的最佳方法。
Exception in thread “main” java.sql.SQLSyntaxErrorException: [Vertica][VJDBC](5108) ERROR: Type “TEXT” does not exist
at com.vertica.util.ServerErrorData.buildException(Unknown Source)
at com.vertica.io.ProtocolStream.readExpectedMessage(Unknown Source)
at com.vertica.dataengine.VDataEngine.prepareImpl(Unknown Source)
at com.vertica.dataengine.VDataEngine.prepare(Unknown Source)
at com.vertica.dataengine.VDataEngine.prepare(Unknown Source)
at com.vertica.jdbc.common.SPreparedStatement.(Unknown Source)
at com.vertica.jdbc.jdbc4.S4PreparedStatement.(Unknown Source)
at com.vertica.jdbc.VerticaJdbc4PreparedStatementImpl.(Unknown Source)
at com.vertica.jdbc.VJDBCObjectFactory.createPreparedStatement(Unknown Source)
at com.vertica.jdbc.common.SConnection.prepareStatement(Unknown Source)
at org.apache.spark.sql.DataFrameWriter.jdbc(DataFrameWriter.scala:275)
at org.apache.spark.sql.DataFrame.createJDBCTable(DataFrame.scala:1611)
at com.sparkread.SparkVertica.JdbctoVertica.main(JdbctoVertica.java:51)
Caused by: com.vertica.support.exceptions.SyntaxErrorException: [Vertica][VJDBC](5108) ERROR: Type “TEXT” does not exist
… 13 more
【问题讨论】:
标签: apache-spark apache-spark-sql vertica