【发布时间】:2018-09-03 20:35:46
【问题描述】:
我有 2 个 csv 文件与此类似:
date,high,low,precip
1-Jan,43,41,0
2-Jan,50,25,0
3-Jan,51,25,0
4-Jan,44,25,0
5-Jan,36,21,0
6-Jan,39,20,0
7-Jan,47,21,0.04
8-Jan,30,14,0
9-Jan,30,12,0
10-Jan,35,12,0
11-Jan,42,15,0
12-Jan,55,29,0
13-Jan,57,29,0
14-Jan,61,33,0
15-Jan,52,46,0.1
我需要对“高”列进行 T 检验,但我还没有找到很多方法来做到这一点。我已经使用这个导入了总和:
import pandas as pd
import re, csv
from scipy.stats import ttest_ind
high_mean = round(pd.read_csv(r'2010-Jan-June.csv', usecols=['high'], squeeze=True).mean(), 1)
high_mean17 = round(pd.read_csv(r'2017-Jan-June.csv', usecols=['high'], squeeze=True).mean(), 1)
但我不确定如何使用我那里的数据来运行 t 检验。
【问题讨论】:
标签: python pandas csv statistics