回答編集履歴

コード追加

2017/11/12 14:01

投稿

スコア1170

answer CHANGED Viewed

@@ -1,1 +1,29 @@
+```python
+import pandas as pd
+from pprint import pprint
+df = pd.read_csv("adult.data", header=None, delimiter=r"\s+", )
+df.columns = ['age', 'workclass', 'fnlwgt', 'education', 'education-num',
+				  'marital-status', 'occupation', 'relationship', 'race',
+				  'sex', 'capital-gain', 'capital-loss', 'hours-per-week',
+				  'native-country', 'income_class']
+df["income_class"] = df["income_class"].map({"<=50K": 0, ">50K": 1})
+pprint(df["income_class"].head(10))
+"""
+0    0
+1    0
+2    0
+3    0
+4    0
+5    0
+6    0
+7    1
+8    1
+9    1
+Name: income_class, dtype: int64
+"""
+```
 [Analysis of the Adult data set from UCI Machine Learning Repository](http://blog.pangyanhan.com/posts/2017-02-15-analysis-of-the-adult-data-set-from-uci-machine-learning-repository.ipynb.html)