Skip to content

Commit

Permalink
Merge pull request #22 from XRubberDuck/main
Browse files Browse the repository at this point in the history
label
  • Loading branch information
luweizheng authored Sep 23, 2023
2 parents 9a71932 + efa3ec9 commit 572f674
Show file tree
Hide file tree
Showing 5 changed files with 3 additions and 8 deletions.
3 changes: 1 addition & 2 deletions ch-pandas/data-preprocessing.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,6 @@
"metadata": {},
"source": [
"# 数据处理\n",
":label:`data-preprocessing`\n",
"\n",
"数据处理工作包括处理重复值、缺失值和异常值,生成新的列或者行等。"
]
Expand Down Expand Up @@ -3423,4 +3422,4 @@
},
"nbformat": 4,
"nbformat_minor": 5
}
}
1 change: 0 additions & 1 deletion ch-pandas/dataframe-groupby.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,6 @@
"metadata": {},
"source": [
"# 分组汇总\n",
":label:`dataframe-groupby`\n",
"\n",
"实际的数据分析中,经常需要对某一类数据进行统计分析。比如,假如我们拥有全国所有人的身高和体重数据,我们想按照省份分组,统计每个省的平均身高和平均体重,这时候就需要使用分组操作。pandas 提供了 `groupby` 函数进行类似的分组汇总操作。:numref:`groupby-img` 计算平均身高的分组汇总流程,主要包括两部分:分组与汇总。其中分组阶段将同一类的内容归结到相同的组中;汇总阶段将所关心的数据进行计算,比如求和、求平均等。\n",
"\n",
Expand Down
1 change: 0 additions & 1 deletion ch-pandas/dataframe-merge-concat.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,6 @@
"metadata": {},
"source": [
"# 多表操作\n",
":label:`dataframe-merge-concat`\n",
"\n",
"之前的操作主要在单个 `DataFrame` ,实际上,我们经常需要对多个 `DataFrame` 联合起来进行分析。pandas 提供了多 `DataFrame` 之间的合并和连接的操作,分别是 `merge()` 和 `concat()` 函数。比如,我们可以将两个 `DataFrame` 合并成一个,且保留所有的列。\n",
"\n",
Expand Down
3 changes: 1 addition & 2 deletions ch-pandas/dataframe-slicing.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,6 @@
"metadata": {},
"source": [
"# 数据切片\n",
":label:`dataframe-slicing`\n",
"\n",
"实际中,我们常常不是分析整个数据,而是数据中的部分子集。如何根据特定的条件获得所需要的数据是本节的主要内容。"
]
Expand Down Expand Up @@ -2453,4 +2452,4 @@
},
"nbformat": 4,
"nbformat_minor": 5
}
}
3 changes: 1 addition & 2 deletions ch-pandas/series-dataframe.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,6 @@
"metadata": {},
"source": [
"# Series 与 DataFrame\n",
":label:`series-dataframe`\n",
"\n",
"pandas 的核心数据结构有两个: Series 和 DataFrame。"
]
Expand Down Expand Up @@ -1218,4 +1217,4 @@
},
"nbformat": 4,
"nbformat_minor": 5
}
}

0 comments on commit 572f674

Please sign in to comment.