python - How could I detect subtypes in pandas object columns?

Question

Welcome To Ask or Share your Answers For Others

python - How could I detect subtypes in pandas object columns?

posted Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

python - How could I detect subtypes in pandas object columns?

I have the next DataFrame:

df = pd.DataFrame({'a': [100, 3,4], 'b': [20.1, 2.3,45.3], 'c': [datetime.time(23,52), 30,1.00]})

and I would like to detect subtypes in columns without explicit programming a loop, if possible.

I am looking for the next output:

column a = [int]
column b = [float]
column c = [datetime.time, int, float]

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-10-23T19:39:42+0000

You should appreciate that with Pandas you can have 2 broad types of series:

Optimised structures: Usually numeric data, this includes np.datetime64 and bool.
object dtype: Used for series with mixed types or types which cannot be held natively in a NumPy array. The series is structured as a sequence of pointers to arbitrary Python objects and is generally inefficient.

The reason for this preamble is you should only ever need to apply element-wise logic to the second type. Data in the first category is homogeneous by nature.

So you should separate your logic accordingly.

Regular dtypes

Use pd.DataFrame.dtypes:

print(df.dtypes)

a      int64
b    float64
c     object
dtype: object

`object` dtype

Isolate these series via pd.DataFrame.select_dtypes and then use a dictionary comprehension:

obj_types = {col: set(map(type, df[col])) for col in df.select_dtypes(include=[object])}

print(obj_types)

{'c': {int, datetime.time, float}}

You will need to do a little more work to get the exact format you require, but the above should be your plan of attack.

Categories

python - How could I detect subtypes in pandas object columns?

python - How could I detect subtypes in pandas object columns?

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Regular dtypes

`object` dtype

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags

Categories

python - How could I detect subtypes in pandas object columns?

python - How could I detect subtypes in pandas object columns?

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Regular dtypes

object dtype

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags

`object` dtype