Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
75 views
in Technique[技术] by (71.8m points)

python - How to delete very close values to a numpy array?

I have a numpy array which looks like this

array([ 1219,  1220,  2215,  2216,  3459,  3460,  4686,  4687,  5920,
        5921,  7200,  7201,  8498,  8499,  9834,  9835, 10046, 11138,
       11139, 12520, 12521, 12522, 13812, 13813, 14033, 15099, 15100,
       16375, 16376, 17576, 17577, 18634, 18635, 19849, 19850])

And I want to delete the elements which are very close. For example I don't want both 2215 and 2216, I want to keep only the first one 2215. Or for the 4686 and 4687, I want to keep only 4686. How can I do it using only numpy commands?

question from:https://stackoverflow.com/questions/65905055/how-to-delete-very-close-values-to-a-numpy-array

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

One solution I came up with is to calculate the difference of the array, and remove those whose forward difference values are small. Taking advantage of the fact that your array is sorted, the following code works for me.

import numpy as np

arr = np.array([ 1219,  1220,  2215,  2216,  3459,  3460,  4686,  4687,  5920,
    5921,  7200,  7201,  8498,  8499,  9834,  9835, 10046, 11138,
    11139, 12520, 12521, 12522, 13812, 13813, 14033, 15099, 15100,
    16375, 16376, 17576, 17577, 18634, 18635, 19849, 19850])

threshold = 1
diff = np.empty(arr.shape)
diff[0] = np.inf  # always retain the 1st element
diff[1:] = np.diff(arr)
mask = diff > threshold

new_arr = arr[mask]

print(new_arr)

You can adjust the threshold value to play with different levels of tolerance.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...