I am having two dataframes named tweetsDF and HashtagsDF. tweet_status_id of both the dataframes are equal and I want to retrieve hashtags count for a single tweet. This is the query I am using which inturns throws
ERROR : SyntaxError: keyword can't be an expression
tweet_hashtags_count_DF = tweetsDF.join(HashtagsDF,sum('tweetsDF.*'),tweetsDF.tweet_status_id == HashtagsDF.tweet_status_id & tweetsDF.tweet_status_id='636984052600274944').show()
Where I am wrong in the query ?
Try tweetsDF.tweet_status_id == '636984052600274944'
(==
instead of =
).
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With