Python Pandas sort by Time and group by user ID

Tags:

I am loading a CSV file with pandas. It has three columns: a column with date and time, a column with a user id, and another 'campaignID'. Example rows:

date                 user_id              campaign_id
2018-01-10 0:21:09   151312395            GOOGLE
2018-01-10 0:21:19   151312395            GOOGLE
2018-01-10 0:21:32   151312395            GOOGLE

I want to group the data by the user id, and then for each user id group the rows by time and the campaign ID, it should look as follows.

user_id              date                           ad_campaign
151312395            2018-01-10 0:21:09             GOOGLE
                     2018-01-10 0:21:19             GOOGLE
                     2018-01-10 0:21:32             GOOGLE

This is what I have made until now: import pandas as pd import numpy as np import datetime

def dateparse(time_in_secs):
    return datetime.datetime.fromtimestamp(float(time_in_secs))
columnnames = ['date','user_id', 'ad_campaign']
columnnames, sep='\t' ,usecols=[0,1,3],index_col = 'date')
df=pd.read_csv(r'C:\Users\L\Desktop\Data.csv' , 
     sep='\t',names = columnnames, usecols=[0,1,3], 
    parse_dates=True,date_parser=dateparse)
df.date = pd.to_datetime(df.date)
df = df.sort_values(by = 'date')
g = df.groupby('user_id')['ad_campaign']
print(g)

This gives the following output:

<pandas.core.groupby.SeriesGroupBy object at 0x04EF26F0>
[Finished in 0.6s]

Why doesnt the print provide the sorted columns?

714

asked Apr 26 '18 14:04

Laila Van Ments

1 Answers

Firstly, if you are doing groupby, you don't need to sort the column explicitly.

You can do:

Method 1:

df.date = pd.to_datetime(df.date)
g = df.groupby(['user_id','date'])['ad_campaign']
print(g.first())

Method 2:

df.set_index(['user_id','date']).sort_index()

186

answered Nov 15 '22 06:11

YOLO

Related questions
                            
                                Resolve a variable name given only a stack frame object
                            
                                Python Pillow's thumbnail method returning None
                            
                                TypeError: string indices must be integers (Python) [duplicate]
                            
                                Should I ever directly call object.__str__()?
                            
                                Get the positive and negative words from a Textblob based on its polarity in Python (Sentimental analysis)
                            
                                Pyinstaller : program that reads a csv
                            
                                Vectorized pythonic way to get count of elements greater than current element
                            
                                Combine 'toc' and 'hide input' when using nbconvert html export
                            
                                Permission Error: Using Image.open
                            
                                How to resize Moviepy to fullscreen?
                            
                                Confused on a for loop for a hangman game?
                            
                                Why is pip installing Pillow for OS X 10.12, when I have OS X 10.11 installed?
                            
                                Is numpy+mkl faster than numpy?
                            
                                Collectstatic creates empty files
                            
                                BeautifulSoup and Python Lambda
                            
                                Python - Remove borders from charts and legend
                            
                                Parallel threads with TensorFlow Dataset API and flat_map
                            
                                Difference between tf.clip_by_value and tf.clip_by_global_norm for RNN's and how to decide max value to clip on?
                            
                                ElementTree iterparse strategy
                            
                                Detecting Mouse clicks in windows using python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas sort by Time and group by user ID

Tags:

python

pandas

csv

pandas-groupby

Laila Van Ments

People also ask

1 Answers

YOLO

Recent Activity

Donate For Us