NREC4107: Applied Intermediate Econometrics

This coursebook supports NREC4107 Applied Intermediate Econometrics for Natural Resource Economics students. It focuses on applied data analysis, visualization, regression, diagnostics, prediction, machine learning basics, and empirical project writing using Python.

Tip

Use the chapters for learning concepts and the appendices for quick reference while completing assignments, exams, and the empirical project.

How to use this coursebook

  • Start with Part I if you are new to Python or data work.
  • Use Part II before modeling to understand the data.
  • Use Part III for the core regression material.
  • Use Part IV when checking model problems.
  • Use Part V for prediction and machine learning extensions.
  • Use Part VI for the empirical project.
  • Use Appendices for formulas, code, project checklists, and dataset integrity.

Course map

Part Focus Use when
Part I. Data and Python Data foundations, Python workflow, and cleaning You need to load, inspect, prepare, or document a dataset
Part II. Visualization Descriptive statistics and graphs You need to understand patterns before modeling
Part III. Regression OLS, inference, functional forms, and model selection You need the core econometrics material
Part IV. Econometric Problems Heteroskedasticity, autocorrelation, multicollinearity, endogeneity, and diagnostics You need to check whether model results are reliable
Part V. Machine Learning Train-test splits, prediction, trees, boosting, and feature importance You need prediction and machine learning extensions
Part VI. Empirical Project Research question, data, methodology, results, and final article You are preparing the semester empirical project
Appendices Formulas, Python templates, project checks, and dataset integrity You need a quick reference or checklist

What students should be able to do

  • clean and document an applied dataset
  • create descriptive statistics and graphs
  • estimate and interpret regression models
  • test hypotheses and compare models
  • diagnose common econometric problems
  • distinguish prediction from causal interpretation
  • prepare a short empirical project report

Software and data

The course uses Python for applied data analysis and Quarto for the coursebook. Students can work in Google Colab or a local Python environment. The main teaching dataset is Milk_Data_S2025n.csv.

Note

Most examples use the milk dataset so that data cleaning, visualization, regression, diagnostics, and prediction build on a common applied dataset.