IFEBY310 Syllabus – IFEBY310

Organization

We will have one weeky lecture. Each lecture is organized around Slides and Notebooks. We will switch from blackboard to laptop and back. You are invited to bring your laptop to the lectures.

	Day	Hour	Room	Start
Lecture	Monday	10:45 - 12:45	Halle aux Farines 166 E	2026-01-12–2026-04-13

We will not attempt to complete the notebooks during the sessions. You are expected to complete the noteboks on your own time. Solutions (at least partial solutions) are available on the course website.

You can fork the course repository and post issues, comments, and corrections.

Objectives

During this course, you shall learn to:

Handle middlesize data using Python Data Stack: Numpy/Scipy/Pandas
Scale up and down with Dask
Handle Big Data with Spark (PySpark)
Manage and store data using dedicated columnar formats (Parquet, ORC, Avro, Arrow)

Communication

Course material: s-v-b.github.io/IFEBY310 Fork the repo, use github issues to send feedback (no email please)

Alerts are spread through Moodle

Register at Moodle portal to be updated

Computing environment

We use the PostGreSQL server from UFR de Mathématiques. To obtain an ENT account follow procedure Moodle

You may install and use

Références

Courses

slides

Notebooks (html and ipynb)

Evaluation

Two homeworks/projects
Grading

Trucs

Have a look at slides before the course
Don’t jump to corrections
Use online help (StackOverflow, ChatGPT, copilot, …)
Read error messages

Code of conduct

TL;DR: No cheating!

Save the dates !

January 12: Course kick-off
March 2nd: Winter Holidays
April 6: Eastern Monday
April 13: Last session

Université Paris Cité calendar

M1 MIDS calendar

Université Paris Cité

Useful links: