Shaan Aucharagram

Shaan Aucharagram

Data Engineering Manager

About Me

Data Engineering Manager at Penguin Random House UK πŸ§πŸ“š, leading a team of 5 talented data engineers. Combining Data Science and Engineering expertise to build robust pipelines and drive growth πŸš€πŸ”₯. Passionate about unlocking the power of data πŸ§ πŸ’‘ to inform data-driven strategies πŸ’ΌπŸ”‘. Prior experience at Palantir and at Santander.

πŸ† BIMA100 (Young Trailblazer in Tech)

Interests

  • Data Science
  • Data Engineering
  • Machine Learning
  • Web3

Skills

Python

Git

R

Data Viz

SQL

Machine Learning

Statistics

Domain Knowledge

Stakeholder Management

Skills Breakdown

Technical Skills

  • Python
  • R
  • SQL
  • SAS
  • AWS
  • Fivetran
  • DBT
  • Airflow
  • Docker
  • Spark (PySpark)
  • Palantir Foundry
  • Cloudera
  • Tableau
  • Git
  • Unix (bash) scripting
  • Plotly Dash
  • HDFS: Hadoop, Impala, Hive
  • Web Development
  • Package Building

Soft Skills/Business Skills

  • Stakeholder Management
  • Public Speaking

Interests/Hobbies

  • Sustainability
  • Web3
  • Football, Skiing & Tennis
  • Vegan Food

Experience

 
 
 
 
 

Data Engineering Manager

Penguin Random House UK

Feb 2023 – Present London
 
 
 
 
 

Data Engineer

Palantir (Contractor via Hexegic)

Aug 2022 – Oct 2022 London

● Developing data pipelines using Python, Spark & SQL – debugging any issues with the pipelines using the foundry platform within Palantir.

● Promoting a goal-setting and result-orientated attitude within the team using agile development techniques. Role further includes developing technical framework for data engineering for the healthcare data pipelines, which Palantir has a contract with the NHS.

● Liaising with technical and non-technical teams within Palantir to understand client requirements for the data pipelines and using Python-based ETL, with Git and CI/CD to ensure that the data scientists can operate.

 
 
 
 
 

Data Scientist

Santander UK

Sep 2019 – Jul 2022 London

● Chatbot optimization using natural language processing in Python to improve customer satisfaction and identify where agents/chatbots can improve responses and alert appropriate stakeholders when customer sentiment reaches a certain negative threshold.

● Mortgage churn model using Python and Plotly Dash to help improve customer retention by predicting customers who are likely to default on their mortgage. Improved customer retention by up to 3%.

● Entity resolution using Python to automate client remediation within the banks data lake using API calls from the third-party DueDil and implement a robust data engineering solution, presenting the project C-Suite stakeholders, technologies used include Python, Spark & Impala SQL.

● BBB reporting using SQL to create ETL scripts to automate the invoicing reports to the British Business Bank for COVID business loans. This replaced the slow manual reporting tools by processing 150,000 contract accruals daily, generating Β£28M revenue in 2020.

● Model experimentation using Python for a range of proof-of-concept models internally using AWS for deployment.

● Automating Data Lake ETL, CI/CD and model build in Docker images using Cloudera CDSW and AWS (Redshift, S3, Sagemaker, ECR).

● Operating using the agile project management methodology within a team of data scientists.

● Credit Risk using SAS to develop capital (economic + regulatory) models for the corporate bank.

● Technical presentations such as CI/CD, Git for version control and AWS to employees within the bank.

 
 
 
 
 

Chief Technology Officer (CTO)

Suneeta London

Jan 2017 – Sep 2019 London

Managing the development of the Suneeta London website www.suneetalondon.co.uk and the respective CBD website www.suneetacbd.co.uk, whilst ensuring that we complied with legislation for the CBD website.

Attending a variety of events and acting on the companies behalf, most notable collaboration was being invited to the HQ of ASOS.

Certifications & Awards

BIMA 100 Class of 2021

Top 100 most influential individuals within the technology sector

Finalist in Outstanding Higher/Degree Apprentice Category (Level 6/7)

Data Scientist With Python

See certificate

Machine Learning

See certificate

Prestigious Principal’s Award In Computing For Outstanding Achievement

Recognition for the AI and software programmed in Computing. Involved in creating a unique ‘Flappy Bird’ style game and for demonstrating a strong interest in AI/Machine Learning.

Best Television Script

Recognition for my Media Studies coursework, and for extensive use of postmodernism within the television script. Awarded out of all Film and Media students within Reigate College.

Posts

Classifying Random Cat Facts using NLP and Sentiment Analysis

Natural Language Processing using SpaCy and Classifying Random Cat Facts Retrieved Via a Web API Into Positive, Negative or Neutral Using Sentiment Analysis.

Why I Chose A Degree Apprenticeship Over University

My thoughts on the traditional university route vs the modern degree apprenticeship path.

What Is Machine Learning?

Machine Learning explained in an easy-to-understand format without the jargon.

Some Of My Work

*

Algorithmic Trading

Algorithmic Trading using Moving Weighted Average trading strategy in Python.

My Personal Website

My website, the site you are on right now, made in R using the blogdown package.

Recreating Flappy Bird With a Twist

My sixth form coursework, made in C# with Unity based on Flappy Bird but with additional functionality.

Contact

For any business enquiries, use any of the hyperlinks