Numpy (42)

Notice

Recent Posts

Recent Comments

Link

« 2024/11 »
일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

Note

Numpy (42) 본문

Numpy

Numpy (42)

알 수 없는 사용자 2022. 9. 6. 18:53

728x90

How to do probabilistic sampling in numpy?

# Import iris keeping the text column intact
url = 'https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data'
iris = np.genfromtxt(url, delimiter=',', dtype='object')

# Solution
# Get the species column
species = iris[:, 4]

# 1: Generate Probablistically
np.random.seed(100)
a = np.array(['Iris-setosa', 'Iris-versicolor', 'Iris-virginica'])
species_out = np.random.choice(a, 150, p=[0.5, 0.25, 0.25])

# 2: Probablistic Sampling (preferred)
np.random.seed(100)
probs = np.r_[np.linspace(0, 0.500, num=50), np.linspace(0.501, .750, num=50), np.linspace(.751, 1.0, num=50)]
index = np.searchsorted(probs, np.random.random(150))
species_out = species[index]
print(np.unique(species_out, return_counts=True))

# output

(array([b'Iris-setosa', b'Iris-versicolor', b'Iris-virginica'], dtype=object), array([77, 37, 36]))

저작자표시 비영리

'Numpy' 카테고리의 다른 글

Numpy (44) (0)	2022.09.10
Numpy (43) (0)	2022.09.07
Numpy (41) (0)	2022.09.04
Numpy (40) (2)	2022.09.03
Numpy (39) (0)	2022.08.31

'Numpy' Related Articles

Comments

Note

Numpy (42) 본문

Numpy (42)

How to do probabilistic sampling in numpy?

'Numpy' 카테고리의 다른 글

티스토리툴바