Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
655 views
in Technique[技术] by (71.8m points)

dataset - 如何查找一个数据集是否受另一个数据集影响?(How to find if a dataset is affected by another dataset?)

i am pretty new into this data mining.

(我对这种数据挖掘非常陌生。)

I have 2 dataset which is Human Development Index and Life Expectancy Index and i want to know if the Life Expectancy dataset is affectting the value of HDI dataset.

(我有两个数据集,分别是人类发展指数和预期寿命指数,我想知道预期寿命数据集是否正在影响HDI数据集的价值。)

What is the best way to know that?

(知道这一点的最好方法是什么?)

Thank You

(谢谢)

The HDI Datasets look like this:

(HDI数据集如下所示:) 在此处输入图片说明

The Life Expectancy Dataset look like this:

(预期寿命数据集如下所示:) 在此处输入图片说明

  ask by Muhammad Naimullah translate from so

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

You will need to join these two data sets.

(您将需要加入这两个数据集。)

That may involve gathering additional data to fill missing values, or to discard some data points that were only present in one of the data sets.

(这可能涉及收集其他数据以填充丢失的值,或丢弃仅在数据集之一中存在的某些数据点。)

Because the join key may differ, you may need to also construct a mapping between the two keys.

(由于连接键可能不同,因此您可能还需要在两个键之间构造一个映射。)


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...