Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
263 views
in Technique[技术] by (71.8m points)

pandas - Python : T test ind looping over columns of df

My dataframe is composed of accounting variables and a dummy variable that allows me to identify two types of company. I would like to perform a t-test for every column of my dataframe in order to compare the means of the variables between the two types of company.

For the moment I have separated my df into two different df based on the dummy variable and run the following code:

for column_type1, column_type2 in zip(df_type1.columns[1:],df_type2.columns[1:]):
    print(ttest_ind(column_type1,column_type2, equal_var=False, nan_policy='omit'))

However, I'm getting the following error:

TypeError: cannot perform reduce with flexible type

If you know how to solve this or have a better way to do it your help is more than welcome!

Thanks

**** EDIT & SOLUTION ****

I've come along my issue and here the code for it.

for column_type1, column_type2 in zip(df_type1,df_type2):
    print(ttest_ind(df_type1[column_type1],df_type2[column_type2], equal_var=False, nan_policy='omit'))
See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
for column_type1, column_type2 in zip(df_type1,df_type2):
print(ttest_ind(df_type1[column_type1],df_type2[column_type2], equal_var=False, nan_policy='omit'))

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...