【发布时间】:2020-05-10 10:15:39
【问题描述】:
我是 SQLAlchemy ORM 的新手。我正在尝试构建一个 AWS S3 摄取程序,该程序将通过 ORM 将任何 CSV 文件从 S3 存储桶摄取到 Postgres。我正在尝试读取 CSV 文件的第一行并将结果存储到列表(columns_names)中。代码报错:
无法为映射表组装任何主键列。
只有在声明 PRIMARY KEY 列后才会在数据库中创建表。通过 ORM 创建表时必须使用主键吗?另外,如何从列表 columns_names 动态创建列?
这是我的代码:
import boto
import boto3
import botocore
import os
from datetime import datetime
import s3fs
import pandas as pd
import configparser
import re
from sqlalchemy import create_engine
from sqlalchemy import MetaData, Table, Column, Integer, String
from sqlalchemy.orm.session import sessionmaker
from sqlalchemy.orm import relationship
from sqlalchemy.ext.declarative import declarative_base
config = configparser.ConfigParser(allow_no_value=True)
config.read('IngestionConfig.config')
table_name = config.get('db-settings','table_name')
S3Bucket = config.get('AWS-settings','BucketName')
S3Key = config.get('AWS-settings','filename')
s3_client = boto3.client('s3')
response = s3_client.get_object(Bucket = S3Bucket, Key= S3Key)
file = response["Body"]
filedata = file.read()
contents = filedata.decode('utf-8')
first_line = contents.split('\n',1)[0]
col_names = re.sub(r"\s+", '_', first_line).replace('"', r'')
columns_names= []
columns_names = col_names.split(',')
postgresql_db = create_engine('postgresql://ayan.putatunda@localhost/postgres',echo = True)
Base = declarative_base()
class test(Base):
__tablename__ = table_name
for name in columns_names:
name = Column(String)
Base.metadata.create_all(postgresql_db)
【问题讨论】:
标签: python postgresql amazon-s3 sqlalchemy