๐Ÿ† ์ž๊ฒฉ์ฆ, ์–ดํ•™ 42

[๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ] ์‹ค๊ธฐ 3ํšŒ - 1์œ ํ˜• ์กฐ๊ฑด ์ธ๋ฑ์Šค, set

๋ฌธ์ œ**1990๋…„๋„๋Š” ํ•ด๋‹น๋…„๋„ ํ‰๊ท  ์ดํ•˜ GDP๋ฅผ ๊ฐ€์ง€์ง€๋งŒ, 2010๋…„๋„์—๋Š” ํ•ด๋‹น๋…„๋„ ํ‰๊ท  ์ด์ƒ GDP๋ฅผ ๊ฐ€์ง€๋Š” ๊ตญ๊ฐ€์˜ ์ˆซ์ž๋ฅผ ๊ตฌํ•˜์—ฌ๋ผ**cond1=df[(df['Year']==1990)]cond2=df[(df['Value'] ๋Œ€ํ‘œ์‚ฌ์ง„ ์‚ญ์ œ์‚ฌ์ง„ ์„ค๋ช…์„ ์ž…๋ ฅํ•˜์„ธ์š”.์ด๋ ‡๊ฒŒ ์˜ค๋ฅ˜๊ฐ€ ๋‚˜๋Š”๊ฑฐ๋‹ค.. ๊ทผ๋ฐ ์•„๋ฌด๋ฆฌ ์ƒ๊ฐํ•ด๋„ ์ฝ”๋“œ๋ฅผ ์ œ๋Œ€๋กœ ์ผ๋‹ค ์ƒ๊ฐ ๋“ค์–ด์„œ ๋‹ต์•ˆ์„ ๋ดค๋‹คdf_1990 = df[df.Year ==1990]df_2010 = df[df.Year ==2010]df_1990_filter = df_1990[df_1990.Value = df_2010.Value.mean()]result = len(set(df_2010_filter['Country Code']) & set(df_1990_filter['Country Cod..

[๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ] ์‹ค๊ธฐ 3ํšŒ - 1์œ ํ˜• reset_index, iloc์™€ loc ์ฐจ์ด

๋ฌธ์ œ```{admonition} 1-1**๊ฒฐ์ธก์น˜๊ฐ€ ํ•˜๋‚˜๋ผ๋„ ์กด์žฌํ•˜๋Š” ํ–‰์˜ ๊ฒฝ์šฐ ๊ฒฝ์šฐ ํ•ด๋‹น ํ–‰์„ ์‚ญ์ œํ•˜๋ผ. ๊ทธํ›„ ๋‚จ์€ ๋ฐ์ดํ„ฐ์˜ ์ƒ์œ„ 70%์— ํ•ด๋‹นํ•˜๋Š” ๋ฐ์ดํ„ฐ๋งŒ ๋‚จ๊ฒจ๋‘” ํ›„ median_income ์ปฌ๋Ÿผ์˜ 1๋ถ„์œ„์ˆ˜๋ฅผ ๋ฐ˜์˜ฌ๋ฆผํ•˜์—ฌ ์†Œ์ˆซ์ ์ดํ•˜ 2์งธ์ž๋ฆฌ๊นŒ์ง€ ๊ตฌํ•˜์—ฌ๋ผ**```df=df.dropna(axis=0).reset_index(drop=True)df=df.loc[:upper]q1=df['median_income'].quantile(0.25)round(q1,2) ๋‚ด๊ฐ€ ํ‘ผ ์ฝ”๋“œdf = df.dropna().reset_index(drop=True)df_filter = df.iloc[:int(len(df)*0.7)]result = df_filter['median_income'].quantile(0.25).round(2)..

[๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ] ์‹ค๊ธฐ 4ํšŒ - 1์œ ํ˜• str.contains, strftime, astype('datetime64[ns]')

๋ฌธ์ œ**date_added๊ฐ€ 2018๋…„ 1์›” ์ด๋ฉด์„œ country๊ฐ€ United Kingdom ๋‹จ๋… ์ œ์ž‘์ธ ๋ฐ์ดํ„ฐ์˜ ๊ฐฏ์ˆ˜** ๋Œ€ํ‘œ์‚ฌ์ง„ ์‚ญ์ œ์‚ฌ์ง„ ์„ค๋ช…์„ ์ž…๋ ฅํ•˜์„ธ์š”.๋ฐ์ดํ„ฐ๊ฐ€ ์ด๋ ‡๊ฒŒ ์ƒ๊ฒจ๋จน์—ˆ์Œ date_added์˜ '์›”'์ด ์ˆซ์ž๊ฐ€ ์•„๋‹Œ ์˜์–ด๋กœ ๋˜์–ด์žˆ์Œใ…‹ใ…‹ใ…‹ใ…‹๋ณด์ž๋งˆ์ž ์—‡.. str.contains() ๋“œ๋””์–ด ์จ๋จน์„์ˆ˜ ์žˆ๋‚˜ ์ƒ๊ฐํ•จ ์‹ค๊ธฐ 5ํšŒ - 1์œ ํ˜• str ๊ธ€ ์ฐธ๊ณ  ์ด๋ฏธ์ง€ ์ธ๋„ค์ผ ์‚ญ์ œ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ ์‹ค๊ธฐ 5ํšŒ - 1์œ ํ˜• str์›ํ•˜๋Š” ์นผ๋Ÿผ๋งŒ ์„ ํƒํ•˜๊ณ  ์‹ถ์€๋ฐ ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๋ชฐ๋ผ์„œ ์ผ๋‹จ ๋…ธ๊ฐ€๋‹ค๋กœ ๊ตฌํ•จ... ํ’€์ด๋ฅผ ๋ดค๋‹ค ??? for๋ฌธ์ด ์™œ ๋˜...blog.naver.comcond1=(df['date_added'].str.contains('January'))cond2=(df['date_added'].str.contai..

[๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ] ์‹ค๊ธฐ 4ํšŒ - replace

๋ฌธ์ œ **Temperature์ปฌ๋Ÿผ์—์„œ ์ˆซ์ž๊ฐ€ ์•„๋‹Œ ๋ฌธ์ž๋“ค์„ ์ œ๊ฑฐํ›„ ์ˆซ์ž ํƒ€์ž…์œผ๋กœ ๋ฐ”๊พธ๊ณ  3๋ถ„์œ„์ˆ˜์—์„œ 1๋ถ„์œ„์ˆ˜์˜ ์ฐจ์ด๋ฅผ ์†Œ์ˆซ์  ์ดํ•˜ 2์ž๋ฆฌ๊นŒ์ง€ ๊ตฌํ•˜์—ฌ๋ผ**```df['Temperature'].astype(float) ์ด๋ ‡๊ฒŒ ํ–ˆ๋”๋‹ˆ ์˜ค๋ฅ˜๋‚จ.. *77.22 ๋ผ๋Š” ๊ฐ’์ด Temperature์— ์žˆ์—ˆ์Œ*์„ ์—†์• ์ค˜์•ผํ•จ ๊ทธ๋Ÿด ๋•Œ ์“ฐ๋Š”๊ฒŒ str.replace() ์˜ˆ๋ฅผ ๋“ค์–ด ์ˆซ์žํ˜• ๋ฐ์ดํ„ฐ๋ฅผ ๊ฐ€์žฅํ•œ ๋ฌธ์žํ˜• ๋ฐ์ดํ„ฐ์˜ ๊ฒฝ์šฐ ์ฒœ๋‹จ์œ„ ์ฝค๋งˆ๊ฐ€ ์กด์žฌํ•ด์„œ ์ˆซ์žํ˜•์œผ๋กœ ๋ณ€ํ™˜ํ•ด์•ผ ํ•จ10,000 ์ด๋ผ๋Š” ๋ฌธ์žํ˜• ๋ฐ์ดํ„ฐ๊ฐ€ ์กด์žฌํ•œ๋‹ค๋ฉด ์ด๋ฅผ ์ˆซ์žํ˜•์œผ๋กœ ๋ณ€ํ™˜ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” "," ๋ฅผ ์—†์• ์„œ 10000 ์œผ๋กœ ๋ณ€๊ฒฝํ•˜์‹  ๋’ค ์ˆซ์žํ˜• ๋ฐ์ดํ„ฐ๋กœ ๋ณ€ํ™˜ํ•ด์•ผํ•จdf['Temperature'] = df['Temperature'].str.replace('*','')df[๋ณ€๊ฒฝ..