Plan du cours
Python Fundamentals for Data Tasks
- Installing Python and setting up the development environment
- Language fundamentals: variables, data types, control structures
- Writing and running simple Python scripts
File Handling: CSV and Excel
- Reading and writing CSV files using the csv module and Pandas
- Working with Excel files using openpyxl/xlrd and Pandas
- Practical exercises: automating file conversions
Introduction to Pandas
- DataFrame basics: creation, indexing, selection, and filtering
- Aggregation and grouping operations
- Common cleaning operations: missing values, duplicates, and type conversions
Introduction to Polars
- Polars concepts and performance characteristics compared to Pandas
- Basic DataFrame operations in Polars
- Use-case example: when to choose Polars over Pandas
Advanced Data Transformation (Intermediate)
- Complex joins, window functions, and pivot operations in Pandas
- Efficient data processing patterns with Polars
- Chaining operations and optimizing memory usage
Process Automation with Python
- Writing scripts to automate repetitive data tasks and ETL steps
- Scheduling scripts with OS schedulers or task schedulers
- Logging, error handling, and notifications
Packaging Scripts and Best Practices
- Creating executables with PyInstaller or similar tools
- Project structuring, virtual environments, and dependency management
- Version control basics and documenting workflows
Hands-on Mini-Project
- End-to-end task: read raw files, clean and transform data, produce outputs
- Automate the workflow and package as a runnable script or executable
- Review and improvements based on peer feedback
Summary and Next Steps
Pré requis
- Basic familiarity with programming concepts or willingness to learn
- Comfort using command-line or terminal for package installation
- Experience working with spreadsheets (CSV/Excel)
Audience
- Data analysts and operations staff automating data tasks
- Analytical engineers seeking lightweight ETL scripting
- Professionals interested in practical Python-based data workflows
Nos clients témoignent (5)
Le fait d'avoir plus d'exercices pratiques utilisant des données plus proches de ce que nous utilisons dans nos projets (images satellites en format raster)
Matthieu - CS Group
Formation - Scaling Data Analysis with Python and Dask
Traduction automatique
J'ai trouvé que le formateur était très compétent et a répondu aux questions avec assurance pour clarifier la compréhension.
Jenna - TCMT
Formation - Machine Learning with Python – 2 Days
Traduction automatique
Une très bonne préparation et expertise de la part du formateur, une communication parfaite en anglais. Le cours était pratique (exercices + partage d'exemples de cas d'utilisation)
Monika - Procter & Gamble Polska Sp. z o.o.
Formation - Developing APIs with Python and FastAPI
Traduction automatique
La explanation
Wei Yang Teo - Ministry of Defence, Singapore
Formation - Machine Learning with Python – 4 Days
Traduction automatique
Formateur développe la formation selon le rythme des participants
Farris Chua
Formation - Data Analysis in Python using Pandas and Numpy
Traduction automatique