Debian 上 Python 数据分析入门指南
一 环境准备与版本确认
sudo apt update && sudo apt upgradesudo apt install python3 python3-pippython3 --version、pip3 --version二 两种常用环境方案
python3 -m venv ~/venvs/data310source ~/venvs/data310/bin/activatepip install -U pippip install numpy pandas matplotlib seaborn jupyter scikit-learndeactivatewget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.shconda create -n datasci python=3.11conda activate datasciconda install numpy pandas matplotlib seaborn scikit-learn jupyterconda deactivate三 入门流程与核心库
四 五分钟上手示例
source ~/venvs/data310/bin/activate(或 conda activate datasci)pip install pandas matplotlib seaborn scikit-learn jupyterjupyter notebookimport pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
# 内置示例数据
tips = sns.load_dataset("tips")
# 基本统计
print(tips.describe())
# 可视化:总账单与消费额关系
plt.figure(figsize=(8,5))
sns.scatterplot(data=tips, x="total_bill", y="tip", hue="sex", style="smoker")
plt.title("Total Bill vs Tip")
plt.show()
五 常见问题与优化建议
%matplotlib inline(Notebook)或 Agg 后端:import matplotlib; matplotlib.use('Agg')openpyxl 或 xlrd;读取 SQL 需安装对应驱动(如 psycopg2、pymysql)。