๐Ÿ† ์ž๊ฒฉ์ฆ, ์–ดํ•™

[๋น…๋ฐ์ดํ„ฐ ๋ถ„์„๊ธฐ์‚ฌ] ์‹ค๊ธฐ - 3์œ ํ˜• ๋…๋ฆฝ๊ฒ€์ •(๋ชจ์ง‘๋‹จ 2๊ฐœ) ์˜ˆ์ œ

๋ฐ์ดํ„ฐํŒ์Šค 2024. 8. 20. 18:00

 

import pandas as pd
import numpy as np
import scipy.stats as stats
import scipy.stats as shaprio
 

 

#์ •๊ทœ์„ฑ ๊ฒ€์ •
sA, pA = stats.shapiro(df['A'])
sB, pB = stats.shapiro(df['B'])
print(sA,pA)
print(sB,pB)
 

๋Œ€์‘ ํ‘œ๋ณธ์˜ ์ •๊ทœ์„ฑ ๊ฒ€์ • : ๋‘ ์ง‘๋‹จ์˜ ์ฐจ์ด๋ฅผ shapiro

๋…๋ฆฝ ํ‘œ๋ณธ์˜ ์ •๊ทœ์„ฑ ๊ฒ€์ • : ์ง‘๋‹จ์„ ๊ฐ๊ฐ shapiro > ๋ชจ๋‘ ๋งŒ์กฑํ•ด์•ผ ์ •๊ทœ์„ฑ O

#๋“ฑ๋ถ„์‚ฐ์„ฑ ๊ฒ€์ •
statistic, pvalue = stats.bartlett(df['A'],df['B'])
print(statistic,pvalue)
 

๋…๋ฆฝ๊ฒ€์ •์€ ๋“ฑ๋ถ„์‚ฐ์„ฑ๋„ ๊ฒ€์ •ํ•ด์ค˜์•ผ ํ•œ๋‹ค

๋“ฑ๋ถ„์‚ฐ์„ฑ ๊ฒ€์ • ํ•จ์ˆ˜ bartlett

statistic, pvalue = stats.ttest_ind(df['A'],df['B'],equal_var=True, alternative='greater')
print(pvalue)
 

๋“ฑ๋ถ„์‚ฐ์„ฑ ๋งŒ์กฑ ์—ฌ๋ถ€ equal_var์„ ์žŠ์ง€ ๋ง๊ฒƒ

๋Œ€์‘ํ‘œ๋ณธ์—์„œ๋Š” stats.ttest_rel ํ•จ์ˆ˜

๋…๋ฆฝํ‘œ๋ณธ์—์„œ๋Š” stats.ttest_ind ํ•จ์ˆ˜

statistic, pvalue = stats.ranksums(df['A'],df['B'],alternative='greater')
print(pvalue)
 

์ •๊ทœ์„ฑ ๋งŒ์กฑ ๋ชป ํ• ๋•Œ๋Š” ranksums()