๐ผ Career
Leading multi-disciplinary teams of data engineers and scientists across Customer Service and Fraud domains.
Lead Data Engineer/Architect for a talented team of data engineers and analysts, providing mentorship whilst actively building the wider Data & AI practice and advocating for DevOps best practices.
Key Achievements:
- Successfully delivered the RBA (Reception Baseline Assessment) Data Platform for the Department for Education
- Managed key stakeholders to grow the account commercially whilst ensuring successful delivery
- Enabled and advocated for DevOps best practices across the team
Technologies: Microsoft Azure, Databricks, Spark, Terraform, Power BI

Operational and technical lead for the data engineering team at Penguin. Managed a talented team of 5 engineers with specialisations across analytics, machine learning, platform, and test engineering.
Technologies: AWS, Dagster, Airflow, Terraform, DBT, Snowflake, FiveTran, Valohai, Power BI
Developed and optimised data pipelines for the NHS in response to Covid-19.
Key Responsibilities:
- Developed data pipelines using Python, Spark & SQL on the Foundry platform
- Developed technical framework for healthcare data pipelines under Palantir’s NHS contract
- Liaised with technical and non-technical teams to understand client requirements
- Implemented Python-based ETL with Git and CI/CD to enable data science workflows
Technologies: Palantir Foundry, Spark, Python, SQL
Implemented various proof-of-concepts to help optimise cost and increase revenue across payments, quantitative risk, financial crime, and retail/corporate banking.
Key Projects:
Mortgage Churn Model - Built predictive model using Python and Plotly Dash to identify customers likely to default, improving customer retention by up to 3%
Chatbot Optimisation - Applied NLP to improve customer satisfaction and identify where agents/chatbots could improve responses, with automated alerts for negative sentiment thresholds
Entity Resolution - Automated client remediation within the data lake using API integrations with DueDil, presenting results to C-Suite stakeholders
BBB Reporting - Created ETL scripts to automate invoicing reports to the British Business Bank for COVID business loans, processing 150,000 contract accruals daily and generating GBP 28M revenue in 2020
Credit Risk Modelling - Developed capital models (economic + regulatory) for the corporate bank using SAS
Infrastructure - Automated Data Lake ETL, CI/CD, and model builds using Docker, Cloudera CDSW, and AWS (Redshift, S3, SageMaker, ECR)
Technologies: AWS, Docker, Cloudera, Impala, HiveQL, Python (Sklearn, XGBoost, Scipy, Plotly Dash), SAS