在读取 CSV 文件的 Lambda 函数中找不到行

Question

我有一个读取 CSV 文件的 Lambda 函数，每一行都添加到 DynamoDB table。我正在使用打印语句打印 CSV 中的每一行以登录 CloudWatch。这里有一个问题，因为 129 行中只有 51 行被打印了。

此外，只有少量实际找到的行被添加到 DynamoDB tables。

Lambda 函数：

# ChronojumpDataProcessor Lambda function
#
# This function is triggered by an object being created in an Amazon S3 bucket.
# The file is downloaded and each line is inserted into DynamoDB tables.

from __future__ import print_function
import json, urllib, boto3, csv

# Connect to S3 and DynamoDB
s3 = boto3.resource('s3')
dynamodb = boto3.resource('dynamodb')

# Connect to the DynamoDB tables
athleteTable = dynamodb.Table('Athlete');
countermovementTable = dynamodb.Table('CMJ');
depthTable = dynamodb.Table('DepthJump');

# This handler is executed every time the Lambda function is triggered
def lambda_handler(event, context):

    # Show the incoming event in the debug log
    #print("Event received by Lambda function: " + json.dumps(event, indent=2))

    # Get the bucket and object key from the Event
    bucket = event['Records'][0]['s3']['bucket']['name']
    key = urllib.parse.unquote_plus(event['Records'][0]['s3']['object']['key'], encoding='utf-8')
    localFilename = '/tmp/session.csv'

    # Download the file from S3 to the local filesystem
    try:
        s3.meta.client.download_file(bucket, key, localFilename)
    except Exception as e:
        print(e)
        print('Error getting object {} from bucket {}. Make sure they exist and your bucket is in the same region as this function.'.format(key, bucket))
        raise e

    # Read the Session CSV file. Delimiter is the ',' character
    with open(localFilename) as csvfile:
        reader = csv.DictReader(csvfile, delimiter=',')

        # Read each row in the file
        rowCount = 0
        for row in reader:
            rowCount += 1

            # Show the row in the debug log
            print(row['athlete_id'], row['athlete_name'], row['jump_id'], row['date_time'], row['jump_type'], row['jump_tc'], row['jump_height'], row['jump_RSI'])

            # Insert Athlete ID and Name into Athlete DynamoDB table
            athleteTable.put_item(
                Item={
                    'AthleteID':       row['athlete_id'],
                    'AthleteName':     row['athlete_name']})

            # Insert CMJ details into Countermovement Jump DynamoDB table
            if ((row['jump_type'] == "CMJ") | (row['jump_type'] == "Free")) :
                countermovementTable.put_item(
                    Item={
                        'AthleteID':           row['athlete_id'],
                        'AthleteName':         row['athlete_name'],
                        'DateTime':            row['date_time'],
                        'JumpType':            row['jump_type'],
                        'JumpID':               row['jump_id'],
                        'Height':              row['jump_height']})
            else :
                # Insert Depth Jump details into Depth Jump DynamoDB table
                depthTable.put_item(
                    Item={
                        'AthleteID':            row['athlete_id'],
                        'AthleteName':          row['athlete_name'],
                        'DateTime':             row['date_time'],
                        'JumpType':             row['jump_type'],
                        'JumpID':               row['jump_id'],
                        'ContactTime':          row['jump_tc'],
                        'Height':               row['jump_height'],
                        'RSI':                  row['jump_RSI']})

                # Finished!
                return "%d data inserted" % rowCount

我为 Lambda 函数添加了 2 分钟的超时，因为我认为可能没有为函数读取每一行提供足够的时间，但这并没有解决问题。

Answer 1

您的 return 语句在 else 下缩进，这意味着该函数将在 if 计算为 False 时立即退出。

应该缩进以匹配 with 行上的缩进用法。

在读取 CSV 文件的 Lambda 函数中找不到行

Rows not being found in Lambda function reading CSV file

csv

amazon-web-services

amazon-dynamodb

boto3

aws-lambda