如何在 Python 代码中使用 Twitter API 中的 "referenced_tweets.type" 响应字段来尝试提取 Twitter 标签数据?

How do I use the "referenced_tweets.type" response field in the Twitter API in Python code where I'm trying to extract Twitter hashtag data?

我使用的代码运行良好。我已经添加了从 geeks for geeks 中获取的整个代码。但是我想修改它以添加referenced_tweets.type。我是 API 的新手,真的很想了解如何解决这个问题。

import pandas as pd
import tweepy


# function to display data of each tweet
def printtweetdata(n, ith_tweet):
    print()
    print(f"Tweet {n}:")
    print(f"Username:{ith_tweet[0]}")
    print(f"likes:{ith_tweet[1]}")
    print(f"Location:{ith_tweet[2]}")
    print(f"Following Count:{ith_tweet[3]}")
    print(f"Follower Count:{ith_tweet[4]}")
    print(f"Total Tweets:{ith_tweet[5]}")
    print(f"Retweet Count:{ith_tweet[6]}")
    print(f"Tweet Text:{ith_tweet[7]}")
    print(f"Hashtags Used:{ith_tweet[8]}")


# function to perform data extraction
def scrape(words, date_since, numtweet):

    # Creating DataFrame using pandas
    db = pd.DataFrame(columns=['username', 'likes', 'location', 'following',
                            'followers', 'totaltweets', 'retweetcount', 'text', 'hashtags'])

    # We are using .Cursor() to search through twitter for the required tweets.
    # The number of tweets can be restricted using .items(number of tweets)
    tweets = tweepy.Cursor(api.search, q=words, lang="en",
                        since=date_since, tweet_mode='extended').items(numtweet)

    # .Cursor() returns an iterable object. Each item in
    # the iterator has various attributes that you can access to
    # get information about each tweet
    list_tweets = [tweet for tweet in tweets]

    # Counter to maintain Tweet Count
    i = 1

    # we will iterate over each tweet in the list for extracting information about each tweet
    for tweet in list_tweets:
        username = tweet.user.screen_name
        likes = tweet.favorite_count
        location = tweet.user.location
        following = tweet.user.friends_count
        followers = tweet.user.followers_count
        totaltweets = tweet.user.statuses_count
        retweetcount = tweet.retweet_count
        hashtags = tweet.entities['hashtags']

        # Retweets can be distinguished by a retweeted_status attribute,
        # in case it is an invalid reference, except block will be executed
        try:
            text = tweet.retweeted_status.full_text
        except AttributeError:
            text = tweet.full_text
        hashtext = list()
        for j in range(0, len(hashtags)):
            hashtext.append(hashtags[j]['text'])

        # Here we are appending all the extracted information in the DataFrame
        ith_tweet = [username, likes, location, following,
                    followers, totaltweets, retweetcount, text, hashtext]
        db.loc[len(db)] = ith_tweet

        # Function call to print tweet data on screen
        printtweetdata(i, ith_tweet)
        i = i+1
    filename = 'etihad.csv'

    # we will save our database as a CSV file.
    db.to_csv(filename)


if __name__ == '__main__':

    # Enter your own credentials obtained
    # from your developer account
    consumer_key = 
    consumer_secret = 
    access_key = 
    access_secret = 
    auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
    auth.set_access_token(access_key, access_secret)
    api = tweepy.API(auth)

    # Enter Hashtag and initial date
    print("Enter Twitter HashTag to search for")
    words = input()
    print("Enter Date since The Tweets are required in yyyy-mm--dd")
    date_since = input()

    # number of tweets you want to extract in one run
    numtweet = 100
    scrape(words, date_since, numtweet)
    print('Scraping has completed!')

我现在想添加 referenced_tweets.type 以了解推文是否为转推,但我不确定该怎么做。有人可以帮忙吗?

API.search uses the standard search API,部分 Twitter API v1.1.
referenced_tweets 是可以为 tweet.fields 设置的值,一个 Twitter API v2 fields 参数。

目前,如果您想通过 Tweepy 使用 Twitter API v2,则必须在主分支及其 Client class 上使用 Tweepy 的开发版本。否则,您需要等到 Tweepy v4.0 发布。

或者,如果您的唯一目标是确定 Status/Tweet object 是否为转推,您可以简单地检查 retweeted_status 属性。