SQL Datetime WHERE 子句返回错误的月份
SQL Datetime WHERE clause returning wrong month
我正在使用 pyathena 库和以下函数从 AWS Athena 中提取数据:
def import_ben_datalake(ACCESS_KEY, SECRET_KEY, S3_DIR, REGION, start, end):
conn = pyathena.connect(aws_access_key_id = ACCESS_KEY,
aws_secret_access_key = SECRET_KEY,
s3_staging_dir = S3_DIR,
region_name = REGION)
sql = f"""SELECT columns
FROM table
WHERE column_datetime BETWEEN PARSE_DATETIME('{start.strftime("%Y-%m-%d")}', 'YYYY-MM-DD')
AND PARSE_DATETIME('{end.strftime("%Y-%m-%d")}', 'YYYY-MM-DD')"""
df = pd.read_sql(sql, conn)
conn.close()
return df
开始和结束参数是 datetime.date
变量:
start_test = datetime.date(2020, 11, 22)
end_test = datetime.date(2020, 11, 28)
两者都是今年 11 月的日期,但是当我调用该函数时,它返回 2020-Jan-22 和 2020-Jan-28 之间的所有值。
任何帮助都会很好地解决这个问题!
按照解决我的问题的参数化查询示例进行操作:
def import_ben_datalake(ACCESS_KEY, SECRET_KEY, S3_DIR, REGION, start, end):
conn = pyathena.connect(aws_access_key_id = ACCESS_KEY,
aws_secret_access_key = SECRET_KEY,
s3_staging_dir = S3_DIR,
region_name = REGION)
sql = """SELECT columns
FROM table
WHERE column.datetime BETWEEN %(start)s AND %(end)s"""
df = pd.read_sql(sql, conn, params = {"start": start, "end": end})
conn.close()
return df
我正在使用 pyathena 库和以下函数从 AWS Athena 中提取数据:
def import_ben_datalake(ACCESS_KEY, SECRET_KEY, S3_DIR, REGION, start, end):
conn = pyathena.connect(aws_access_key_id = ACCESS_KEY,
aws_secret_access_key = SECRET_KEY,
s3_staging_dir = S3_DIR,
region_name = REGION)
sql = f"""SELECT columns
FROM table
WHERE column_datetime BETWEEN PARSE_DATETIME('{start.strftime("%Y-%m-%d")}', 'YYYY-MM-DD')
AND PARSE_DATETIME('{end.strftime("%Y-%m-%d")}', 'YYYY-MM-DD')"""
df = pd.read_sql(sql, conn)
conn.close()
return df
开始和结束参数是 datetime.date
变量:
start_test = datetime.date(2020, 11, 22)
end_test = datetime.date(2020, 11, 28)
两者都是今年 11 月的日期,但是当我调用该函数时,它返回 2020-Jan-22 和 2020-Jan-28 之间的所有值。
任何帮助都会很好地解决这个问题!
按照解决我的问题的参数化查询示例进行操作:
def import_ben_datalake(ACCESS_KEY, SECRET_KEY, S3_DIR, REGION, start, end):
conn = pyathena.connect(aws_access_key_id = ACCESS_KEY,
aws_secret_access_key = SECRET_KEY,
s3_staging_dir = S3_DIR,
region_name = REGION)
sql = """SELECT columns
FROM table
WHERE column.datetime BETWEEN %(start)s AND %(end)s"""
df = pd.read_sql(sql, conn, params = {"start": start, "end": end})
conn.close()
return df