{
  "cells": [
    {
      "cell_type": "markdown",
      "metadata": {},
      "source": [
        "# Descriptive Statistics in Python\n",
        "\n",
        "Use pandas to summarize wage, education, and experience before estimating any model."
      ]
    },
    {
      "cell_type": "code",
      "execution_count": null,
      "metadata": {},
      "outputs": [],
      "source": [
        "import pandas as pd\n",
        "\n",
        "df = pd.read_csv(\"wage_sample.csv\")\n",
        "print(df[[\"wage\", \"education\", \"experience\"]].describe())"
      ]
    },
    {
      "cell_type": "code",
      "execution_count": null,
      "metadata": {},
      "outputs": [],
      "source": [
        "print(\"Average wage:\", round(df[\"wage\"].mean(), 2))\n",
        "print(\"Median wage:\", round(df[\"wage\"].median(), 2))\n",
        "print(\"Wage standard deviation:\", round(df[\"wage\"].std(), 2))"
      ]
    },
    {
      "cell_type": "code",
      "execution_count": null,
      "metadata": {},
      "outputs": [],
      "source": [
        "grouped = df.groupby(\"education\")[\"wage\"].mean()\n",
        "print(grouped)"
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {},
      "source": [
        "## Interpretation prompt\n",
        "\n",
        "Write one sentence describing the typical wage and one sentence describing wage spread."
      ]
    }
  ],
  "metadata": {
    "kernelspec": {
      "display_name": "Python 3",
      "language": "python",
      "name": "python3"
    },
    "language_info": {
      "name": "python",
      "version": "3.11"
    }
  },
  "nbformat": 4,
  "nbformat_minor": 5
}
