This is a course for data analysts, financial analysts, statisticians, software developers, and other technical staff interested in learning to use Python as a unified language for automating and sharing data analysis and performing powerful predictive analytics. This is a course intended for seasoned programmers who are experienced in programming in other languages such as C#, Java, R, or Matlab.
Some familiarity with programming concepts (in any language) is assumed.
By the end of the course, you will have all the knowledge you need to start using Python competently for automating various processes involving analysis, modelling, visualising and predicting various kinds of data. You will have gained experience with using Python for various practical data-manipulation tasks with data in a variety of formats, including time-series data, in CSV, Excel spreadsheets, and SQL databases. You will have applied powerful tools for clustering, classification, and regression, in useful practical settings on small and large data sets. You will understand the elegance and power of the Python language and its powerful ecosystem of packages for data analytics, and you will be well- placed to continue learning more as you use it day-to-day.
Day 1 covers how to use Python for basic scripting and automation tasks, including tips and tricks for making this easy. The syllabus is as follows:
Day 2 introduces further important concepts for real-world scripting in Python. The syllabus is as follows:
The Pandas package is an amazingly productive tool for working with and analysing data in Python. Day 3 gives a thorough introduction to Pandas and related tools for working with different kinds of data, including spreadsheets, time-series data, and SQL databases. The syllabus is:
Day 4 introduces three of the most fundamental and powerful techniques for analysing many kinds of real-world data in Python. The datasets are selected from a range of industries: financial, geospatial, medical, and social sciences. The syllabus is:
We will supply you with printed course notes and a USB stick containing a complete Python environment based on VirtualBox. This saves time in the course and allows us to focus on using Python rather than installing it. The USB stick also contains kitchen-sink Python installers for multiple platforms, solutions to the programming exercises, several written tutorials, and reference documentation on Python and the third-party packages covered in the course.