Bootstrapping Loss Ratios
Let's take a look at loss ratios and how to use bootstrapping (aka bootstrap resampling, sampling with replacement) to quickly and easily calculate empirical probability distributions to qualify these popular point-estimate portfolio performance metrics.
Contagious Coughs & Colds 2019 Commute Calculator
A naive interactive calculator to illustrate of effect of population mixing in a contagious environment
Build You a Library
If you run a data science team, buy yourselves a technical reference library
PyMC3 Examples: GLM with Custom Likelihood for Outlier Classification
A worked example of a novel generative model to filter out noisy / erroneous datapoints in a set of observations, compared to alternative methods. Implemented in the probabilistic programming language `pymc3` in a fully reproducible Notebook, open-sourced and submitted to the examples documentation for the PyMC3 project
On contractor day rates
How does an annual salary convert to a day-rate?
Don't call it a comeback
I've been here for years
Approaches to Data Anonymisation
Using Bloom filters, hash tables, obfuscation and aggregation to ensure the information present in datasets is available only to the appropriate level of access
Delivering Value Throughout the Analytical Process
Data science doesn't just lead to insights and products: here we define SPEACS, a generalised analytical process that illustrates the business benefit at every stage.
Survival Analysis: Part1 - A Brief Overview
Survival analysis is long-established within life actuarial work but infrequently used in general data science projects. This series of posts investigates why it's so useful for time-dependent effects, with worked examples.
Tools of the trade (an overview)
We use a variety of software tools for preparing, exploring and modelling data; usually scientific, lightweight and flexible, allowing bespoke insight.