Python Training

Python for Predictive Data Analytics

This is a course for data analysts, quants, statisticians, software developers, and other technical staff interested in learning to use Python for analysing and visualising data and performing powerful predictive analytics. Includes a thorough introduction to machine learning for regression and classification.


Some familiarity with programming concepts (in any language) is assumed.

Expected Outcomes

By the end of the course, you will have all the knowledge you need to start using Python competently for processing, analysing, modelling, and visualising various kinds of data, with a focus on time series. You will have had experience with using Python for various scripting, data-manipulation and plotting tasks with data in a variety of formats, including CSV, Excel spreadsheets, SQL databases, JSON, and API endpoints. You will have applied powerful tools for optimisation, regression, classification, and clustering, in useful practical settings on a variety of data sets. You will understand the elegance and power of the Python language and its powerful ecosystem of packages for data analytics, and you will be well-placed to continue learning more as you use it day-to-day.

Course Syllabus

Day 1: Python Basics

Day 1 covers how to use Python for basic scripting and automation tasks, including tips and tricks for making this easy. The syllabus is as follows:

  • Why use Python for predictive analytics? What’s possible? Python versus other languages …
  • Setting up your Python development environment (IDE, Jupyter)
  • Modules and packages
  • Python concepts: an introduction through examples
  • Essential data types: strings, tuples, lists, dicts, sets
  • Worked example: fetching and ranking real-time temperature data for global cities
  • Raising and handling exceptions
  • Handling CSV data: introduction to Pandas

Day 2: Handling, Analysing, and Presenting Data in Python

The Pandas package is an amazingly productive tool for working with and analysing data in Python. Day 2 gives a thorough introduction to analysing data with Pandas and visualising it easily:

  • Reading and writing essential data formats: CSV, Excel, SQL databases, JSON, time-series
  • Indexing and selecting data in Pandas
  • Data fusion: joining & merging datasets
  • Summarisation with “group by” operations; pivot tables
  • Visualisation and statistical graphics with Seaborn
  • Worked example: creating automated reports with Jupyter, Pandas, and nbconvert

Day 3: Time-series, simulation, inference and modelling

Day 3 demonstrates more advanced features of Pandas for working with data, including time-series data. It then describes Monte Carlo simulation methods and walks you through using powerful methods of inference and modelling for different kinds of data in Python:

  • Time-series analysis: parsing dates, resampling, handling time-zones
  • Introduction to NumPy for manipulating vector and matrix data: data types, powerful indexing, reshaping, ufuncs
  • Classical statistics: Monte Carlo simulation and linear regression with scipy.stats and statsmodels
  • Density estimation with scikit-learn
  • Dimensionality reduction with PCA

Day 4: Machine learning

Day 4 introduces a more automated approach to modelling real-world data with several powerful machine learning algorithms using scikit-learn. The datasets are selected from a range of industries: financial, geospatial, medical, and social sciences. The syllabus is:

  • Concepts of machine learning (ML)
  • Overview of the ML package ecosystem in Python
  • Classification with scikit-learn: Naive Bayes, logistic regression, SVMs, random forests, with application to diagnosis, AI systems, and time-series prediction
  • Nonlinear regression, with application to forecasting
  • Clustering data with DBScan, with application to outlier detection
  • Dimensionality reduction with PCA
  • Validation and model selection; feature selection
  • Deploying machine learning models in production

Supplemental materials

We will supply you with printed course notes, cheat sheets, and a USB stick containing kitchen-sink Python installers for multiple platforms, solutions to the programming exercises, several written tutorials, and reference documentation on Python and the third-party packages covered in the course.

Other information

Personal help:
Your trainer(s) will be available after the course each day for you to ask any one-on-one questions you like — whether about the course material and exercises or about specific problems you face in your work and how to use Python to solve them.
Food and drink:
We will provide lunch, morning and afternoon tea, and drinks.
The course will run from 9:00 to roughly 17:00 each day, with a breaks of 45 minutes for lunch and 20 minutes each for morning and afternoon tea.

Upcoming Courses


Python for Predictive Data Analytics:
23 Mar – 26 Mar 2020

Level 1, 440 Collins Street, Melbourne CBD

Brochure Book Now


Python for Predictive Data Analytics:
20 Apr – 23 Apr 2020

Level 13, 60 Margaret Street, Sydney CBD

Brochure Book Now


Python for Predictive Data Analytics:
01 Jun – 04 Jun 2020

Level 1, 440 Collins Street, Melbourne CBD

Brochure Book Now


Python for Predictive Data Analytics:
22 Jun – 25 Jun 2020

Level 2, 10 Moore Street, Canberra CBD

Brochure Book Now


Python for Predictive Data Analytics:
06 Jul – 09 Jul 2020

Level 13, 60 Margaret Street, Sydney CBD

Brochure Book Now

Other Locations

Python for Predictive Data Analytics:
Dates TBA

Be notified


“The course was delivered by trainers who were extremely knowledgeable in their field. It was really good to learn from the best!”

- Marius Roman

“Great course. Enjoyed it and will follow up with some practical implementation of some of the work.”

- Adam Grace

“One of the best training courses I've been on.”

- David Scurrah

“I loved it. Ed was inspiring.”

- Onoriode Coast

“Really impressed by Python's capability and excited to use as alternative to MatLab, as is free and better supported.”

- Carsten Hofmann

“Very comprehensive intro to every aspect of python. Highly qualified trainer. Beyond my expectation on every aspect.”

- Baichuan Sun

“The VM setup and USB is great. Ed is an excellent instructor - he presents well and welcomes any questions. He is clearly a super smart guy who has a great grasp on what he is teaching - able to just prototype on the fly and the course overall really opened my eyes to python.”

- Jack Hendy

“Course content was well presented and easily digested. Practical exercises were an essential part of the course – a good ratio of lecture/play was achieved. Well done Ed and Henry!”

- Steve Zegelin

“Simply awesome!!”

- James Park

“Both Ed and Henry presented well…. The course structure was adjusted to suit the participants quickly and easily.”

- Jenet Austin

“Excellent training course, excellently presented. Perhaps the best that I have had in the area of IT / programming.”

- George Grozev

“One of the best programming courses I have attended - thanks!”

- Giant Billen

“This course has shown me how I could have done the work I was doing just last week 10x more efficiently in Python.”

- Maruf Rahman

“It was a pleasure ... Shared feedback from all involved was that it’s been one of the most beneficial and well delivered training courses we’ve been a part of.”

- Dylan Matthews

“Was the most fulfilling and rewarding class I have taken since "general relativity" at uni. Was extremely well run. Excellent all round!”

- Dr Millicent Maier

“Very impressed with the course, delivery. And depth of knowledge of Ed and Henry. Far exceeded my expectations and has greatly improved my core skills as well as inspired so many new ideas for my current work / projects. Thank you!”

- Kelsey Druken