projects

Selected work across AI engineering, data engineering, and software engineering. Each card links to the case study + code where available.

20 projects
project TomTom

DCP Guardian, AI agent that validates data-collection plans

An AI-powered Slack bot that reads a data-collection-plan document on Confluence, cross-references it against the live event registry and metrics catalog, optionally analyses a GitHub PR alongside, and posts a structured pass/fail review back into the thread. Multi-agent workflow with Agno + ChromaDB vector search + deterministic result caching.

project TomTom

TomTom Traffic Agent (CES 2026 demo)

Production multi-agent system for traffic analytics, demoed by Product Management at CES 2026 Las Vegas. Five specialised agents (routing, route monitoring, junction analytics, area analysis, traffic volumes) orchestrated through Agno's Team abstraction, served over AG-UI v1.0 with persistent chat memory.

project TomTom

Neighbourly, AI location analyser (hackathon project)

Full-stack AI app that takes an address and returns a comprehensive neighbourhood profile (POIs, accessibility, routing to key destinations, market signal, livability score) by orchestrating a multi-step agent workflow over the TomTom MCP servers. Hackathon project that showcases MCP usage.

project TomTom

Vehicle-telemetry silver-layer ETL

Refactored a vehicle-telemetry processing pipeline. Transforms raw nested JSON / protobuf from millions of in-car navigation clients into clean Delta tables that power navigation-quality dashboards. Designed for query simplicity: PMs answer 'route success rate in country X this week' with a single SELECT.

project TomTom

ETL template for API analytics pipelines

Refactored the inconsistent set of ETL pipelines feeding TomTom's API analytics into a single OOP template, distributed as an internal Python package. Engineers subclass the base, get live Azure Data Explorer connections for free, and only write the business logic in extract / transform / load. Result: consistent, version-controlled pipelines across volume, response-time, and error-rate use cases.

project TomTom

Developer-portal analytics APIs

REST API layer that powers the analytics dashboards on developer.tomtom.com. Sits on top of an Azure Data Explorer (Kusto) backend that ingests every API call across TomTom's developer products. Volume reports, response-time percentiles, error-rate breakdowns, and per-product usage all flow through this layer.

project Oyo Vacation Homes

Enterprise data lake on GCP, from scratch

Architected an enterprise data lake on Google Cloud (BigQuery, Cloud Storage, Cloud Composer / Airflow) at Oyo Vacation Homes. Designed a metadata-driven ETL framework in PySpark that cut new-pipeline development time by ~70%. Established schema-versioning and quality-validation patterns the team still uses.

project Tata Consultancy Services

PySpark ETL optimisation for Citi Bank Singapore

Optimised long-running ETL pipelines for Citi Bank Singapore (TCS engagement). Cut Data Stage job execution from 4 hours to 1 hour through Python multiprocessing + PySpark parallelisation. Designed real-time message handling with Kafka, plus AVRO / Parquet conversion and HDFS compression strategies for the bank's machine-learning data pipelines.

project Tata Consultancy Services

User-management platform for Sainsbury's

First professional engagement: built REST APIs and backend services for Sainsbury's (UK's largest retailer) user-management platform via TCS. Python Flask, OOP design patterns, and direct collaboration with the Corporate Pingit Business Product Owners through an Agile SDLC.