Week 9

A deeper dive into Spark

Important
  • Friday 21 March 2025 Sophie Germain 1005 13h30-15h30
  • Calendar

Lecture : slides

We may come back to several parts of

Notebooks

If time allows, we shall go back to

You shall have gone through (on your own)

References

Logistics

pyspark

To work the jupyter notebooks, install python 3, and modules related to jupyter: jupyter-cache, jupyter_client, jupyter_core, jupyterlab_widgets (this induces the installation of dependencies).

https://jupyter.org

Download the jupyter notebooks from notebooks listings.

If you do not already have an ENT account, follow instructions on Moodle to get one. You shall need this account to connect to PostGres cluster.



Back to Agenda