๐Ÿ† ์ž๊ฒฉ์ฆ, ์–ดํ•™

[๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ] ์‹ค๊ธฐ 3ํšŒ - 1์œ ํ˜• ์กฐ๊ฑด ์ธ๋ฑ์Šค, set

๋ฐ์ดํ„ฐํŒ์Šค 2024. 8. 19. 16:50

 

๋ฌธ์ œ

**1990๋…„๋„๋Š” ํ•ด๋‹น๋…„๋„ ํ‰๊ท  ์ดํ•˜ GDP๋ฅผ ๊ฐ€์ง€์ง€๋งŒ, 2010๋…„๋„์—๋Š” ํ•ด๋‹น๋…„๋„ ํ‰๊ท  ์ด์ƒ GDP๋ฅผ ๊ฐ€์ง€๋Š” ๊ตญ๊ฐ€์˜ ์ˆซ์ž๋ฅผ ๊ตฌํ•˜์—ฌ๋ผ**

cond1=df[(df['Year']==1990)]
cond2=df[(df['Value']<=75508617.9)]
df[cond1&cond2]
 
๋Œ€ํ‘œ์‚ฌ์ง„ ์‚ญ์ œ

์‚ฌ์ง„ ์„ค๋ช…์„ ์ž…๋ ฅํ•˜์„ธ์š”.

์ด๋ ‡๊ฒŒ ์˜ค๋ฅ˜๊ฐ€ ๋‚˜๋Š”๊ฑฐ๋‹ค.. ๊ทผ๋ฐ ์•„๋ฌด๋ฆฌ ์ƒ๊ฐํ•ด๋„ ์ฝ”๋“œ๋ฅผ ์ œ๋Œ€๋กœ ์ผ๋‹ค ์ƒ๊ฐ ๋“ค์–ด์„œ ๋‹ต์•ˆ์„ ๋ดค๋‹ค

df_1990 = df[df.Year ==1990]
df_2010 = df[df.Year ==2010]


df_1990_filter = df_1990[df_1990.Value <= df_1990.Value.mean()]
df_2010_filter = df_2010[df_2010.Value >= df_2010.Value.mean()]

result = len(set(df_2010_filter['Country Code']) & set(df_1990_filter['Country Code']))
print(result)
 

๋‚˜๋ž‘ ๋ณ„ ์ฐจ์ด ์—†๋Š”๋ฐ ์™œ ์•ˆ๋˜์ง€?? ํ–ˆ๋Š”๋ฐ cond1์„ ์กฐ๊ฑด์œผ๋กœ ๊ฑธ๋ ค๊ณ  ํ•˜๋Š”๊ฑด๋ฐ ๊ทธ๊ฑธ df[]๋กœ ํ•ด๋ฒ„๋ฆฐ ๊ฒƒ์ด๋‹ค..

mean1=df[(df['Year']==1990)]['Value'].mean()
mean2=df[(df['Year']==2010)]['Value'].mean()

cond1=(df['Year']==1990)
cond2=(df['Value']<=mean1)
df1=df[cond1&cond2]

cond3=(df['Year']==2010)
cond4=(df['Value']<=mean2)
df2=df[cond3&cond4]
 

์•„๋ฌดํŠผ ์—ฌ๊ธฐ๊นŒ์ง€๋Š” ์™”๋‹ค. ๊ทผ๋ฐ ๋˜ ๋ฌธ์ œ๊ฐ€ ์ƒ๊น€

cond1&cond2 ๋ฅผ ๋งŒ์กฑํ•˜๋Š” ๊ฒƒ๊ณผ cond3&cond4๋ฅผ ๋งŒ์กฑํ•˜๋Š” ๊ฒƒ์„ ์–ด๋–ป๊ฒŒ ํ•˜๋‚˜์˜ ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„์•ˆ์—์„œ ๋ณด๋Š”๊ฐ€???

๋‘ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์—์„œ ํŠน์ • ์กฐ๊ฑด์„ ๋งŒ์กฑํ•˜๋Š” ์ปฌ๋Ÿผ์˜ ํ•ฉ์ง‘ํ•ฉ, ๊ต์ง‘ํ•ฉ ๋“ฑ์„ ์ฐพ์„ ๋•Œ๋Š” set ์„ ์‚ฌ์šฉ

# ๊ต์ง‘ํ•ฉ : set( ) & set( ), ํ•ฉ์ง‘ํ•ฉ : set( ) | set( ), ์ฐจ์ง‘ํ•ฉ : set( ) - set( )

df1conlist = df1[condi1]['A'].values,   df2conlist = df2[condi2]['A'].values
print(set(df2conlist) & set(df1conlist))
 

 

๊ทธ๋Ÿฌ๋‹ˆ๊นŒ ์ด ์ฝ”๋“œ๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด

 
์‚ฌ์ง„ ์‚ญ์ œ
 
์‚ฌ์ง„ ์‚ญ์ œ

์‚ฌ์ง„ ์„ค๋ช…์„ ์ž…๋ ฅํ•˜์„ธ์š”.

df1 ๋ฐ์ดํ„ฐ ์™ผ์ชฝ

df2 ๋ฐ์ดํ„ฐ ์˜ค๋ฅธ์ชฝ

์™ผ์ชฝ๊ณผ ์˜ค๋ฅธ์ชฝ์„ ๋™์‹œ์— ๋งŒ์กฑํ•˜๋Š” '๊ตญ๊ฐ€'๋ฅผ ์ฐพ์œผ๋ฉด ๋˜๋Š” ๊ฒƒ์ด๋‹ค

๋‚ด๊ฐ€ ๋†“์ณค๋˜ ํฌ์ธํŠธ๊ฐ€ ์™ผ์ชฝ๊ณผ ์˜ค๋ฅธ์ชฝ์˜ Value ๊ฐ’์ด ๊ฐ™์€ ๊ฒƒ์„ ์ฐพ์œผ๋ ค๊ณ  ํ–ˆ๋Š”๋ฐ ๊ทธ๊ฒŒ ์•„๋‹ˆ๋ผ

๋ฌธ์ œ๋Š” 1990๋…„๋Œ€์—์„œ ํ•ด๋‹น ์กฐ๊ฑด์„ ๋งŒ์กฑํ•˜๋Š” ๊ตญ๊ฐ€๊ฐ€๊ฐ€ 2010๋…„๋Œ€์—์„œ๋„ ํ•ด๋‹น ์กฐ๊ฑด์„ ๋งŒ์กฑํ•˜๋Š” ๊ตญ๊ฐ€์—ฌ์•ผ ํ•œ๋‹ค๋Š” ์  >> ๊ตญ๊ฐ€๋ช…๋งŒ ์ผ์น˜ํ•˜๋ฉด ๋จ

set(df1['Country Code']) & set(df2['Country Code'])
 

 

 

๋ชจ๋ฅด๋Š” ํ•จ์ˆ˜๊ฐ€ ๊ณ„์† ๋‚˜์˜จ๋‹ค ใ…‹ใ…‹ใ…‹..