Weeks 6
Spark SQL/RDD
Important
2 Sessions
- Thursday 20 February 2025 Olympes de Gouges 358 10h45-12h45
- Friday 21 February 2025 Sophie Germain 0014 15h45-17h45
- Calendar
Lecture : slides
We may come back to several parts of
Notebooks
We shall spend most of the lectures on
and possibly compare with:
You shall have gone through (on your own)
References
Logistics
pyspark
To work the jupyter
notebooks, install python 3
, and modules related to jupyter
: jupyter-cache
, jupyter_client
, jupyter_core
, jupyterlab_widgets
(this induces the installation of dependencies).
Download the jupyter notebooks from notebooks listings.
If you do not already have an ENT account, follow instructions on Moodle to get one. You shall need this account to connect to PostGres cluster.
Back to Agenda ⏎