Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
496 views
in Technique[技术] by (71.8m points)

python - averaging every five minutes data as one datapoint in pandas dataframe

I have a Dataframe in Pandas like this

1. 2013-10-09 09:00:05
2. 2013-10-09 09:01:00
3. 2013-10-09 09:02:00
4.  ............
5.   ............
6.   ............
7. 2013-10-10 09:15:05
8. 2013-10-10 09:16:00 
9. 2013-10-10 09:17:00

I would like reduce the size of the Dataframe by averaging every 5 mins data and forming 1 datapoint for it ..like this

1. 2013-10-09 09:05:00
2. 2013-10-09 09:10:00
3. 2013-10-09 09:15:00

Can someone help me with this ??

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

you may want to look at pandas.resample:

df['Data'].resample('5Min', how='mean')

or, as how = 'mean' is default parameter:

df['Data'].resample('5Min')

For example:

>>> rng = pd.date_range('1/1/2012', periods=10, freq='Min')
>>> df = pd.DataFrame({'Data':np.random.randint(0, 500, len(rng))}, index=rng)
>>> df
                     Data
2012-01-01 00:00:00   488
2012-01-01 00:01:00   172
2012-01-01 00:02:00   276
2012-01-01 00:03:00     5
2012-01-01 00:04:00   233
2012-01-01 00:05:00   266
2012-01-01 00:06:00   103
2012-01-01 00:07:00    40
2012-01-01 00:08:00   274
2012-01-01 00:09:00   494
>>>
>>> df['Data'].resample('5Min')
2012-01-01 00:00:00    234.8
2012-01-01 00:05:00    235.4

You can find more examples here.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...