Welcome to Toby Tuan Nguyen (aka Toshi) personal page
Here is what I would like you to know about me.
About me
- Senior Data Engineer with 7 years’ experience building cloud data platforms and automation systems across finance, e-commerce, and consultancy
- Delivered production pipelines, Azure/AWS infrastructure-as-code, and AI-integrated workflows at scale, with proven outcomes including £300M+ GMV campaign support, reliability improvements, and GDPR-aligned delivery across multi-region teams (London, Berlin, Singapore)
- I enjoy building reliable systems that make teams faster (and yes, I still code at night)
Experience
-
Fiftyminds — Exeter, Devon, UK
Senior Software Engineer, Jan 2026 – Present
Partnered directly with South West England SME clients to assess existing systems, lead requirements discovery, and design secure, scalable solution architectures
Architected AI-integrated and automation-driven solutions (Azure Functions, Power Automate) to streamline IT and business processes, reducing manual processing and improving efficiency
Implemented Computer Vision and NLP components (OpenCV, OpenAI API), achieving 80% document map image analysis accuracy
Developed and deployed scalable, secure Azure infrastructure (Blob Storage, Azure SQL Database, Azure Queue Storage and Service Bus) using Infrastructure-as-Code (Terraform/Bicep)
Implemented data protection and security-by-design practices (Key Vault, RBAC, VNet, Private Endpoints) to support UK GDPR and DPA 2018 compliance, including handling of NHS sensitive data -
Global Fashion Group S.A. (FWB: GFG) — London, Remote
Data Engineer II, Aug 2021 – Jul 2024
Developed and deployed production-grade 20+ Spark (Scala, PySpark) data product pipelines orchestrated with Airflow for GFG’s Pricing Engine as the sole Data Engineer
Powered dynamic pricing at scale in Zalora’s monthly and annual promotional campaigns, including Asia’s largest sales event 11/11, supporting ~£300M+ gross merchandise value
Architected the group data platform, including a Redshift data analytics warehouse and an S3 and Athena lakehouse, enabling Data-as-a-Service for subsidiaries across Europe (UK, Germany) and regions including South East Asia, Latin America and Australia
Automated data pipelines CI/CD with Jenkins, data platform infrastructure provisioning with Terraform and monitoring using observability stack (Prometheus, Grafana, Slack alerting), reducing downtime by 20%
Led Python/PySpark migration of 40+ legacy Scala/Spark pipelines onto Kubernetes clusters and dbt orchestration, cutting technical debt and enhancing maintainability -
Home Credit — Ho Chi Minh City, Vietnam
Data Engineer, Nov 2017 – Aug 2021
Promoted from BI Developer (2017–2018) to Data Engineer (2018–2021) in recognition of technical performance and business impact; awarded Honoured Employee in 2018
Developed Customer Overpayment Tool as sole data engineer to detect and return excess loan repayments, safeguarding >110B VND (~£3.5M), improving customer trust, and gaining visibility at C-level
Built cloud-based data lake using Keboola (Snowflake & AWS) integrating multiple on-prem databases (SQL Server, Oracle, MySQL, Google BigQuery, PostgreSQL, Cassandra) by both batch and stream processing (Apache Kafka)
Led rollout of the company’s first Hadoop-based Big Data Platform (Hive, Impala, Apache NiFi), reducing reliance on legacy systems (Oracle-based DWH) and enabling scalable analytics across business units
Automated ETL/ELT pipelines, reducing manual workloads and improving BI dashboards and ad-hoc reports delivery speed by ~50%
Technical Skills
Data Engineering & Cloud
- Languages: Python (PySpark), R, SQL (T-SQL, PL/SQL), Scala (Spark), Node.js
- Cloud Platforms: Microsoft Azure, Amazon Web Services (AWS), Google Cloud Platform (GCP)
- Infrastructure: Linux/Unix, Terraform, Bicep, Docker, Kubernetes/K8s, CI/CD (GitHub Actions, Jenkins)
- Data Systems & Databases: AWS Glue, Azure Data Factory, Amazon Redshift, Google BigQuery, Oracle, PostgreSQL, Cassandra, Snowflake, Databricks, MongoDB, Hadoop ecosystem (HDFS, HBase, Spark, Hive, Impala)
- Data Pipeline Orchestration: Apache Airflow, dbt, Apache NiFi
- Analytics Engineering & Visualisations: Power BI, Tableau, Superset
- Architecture: Distributed Systems, System Design, Dimensional Modelling, Data Warehouse, Data Lake, ETL/ELT, OLAP, Data Streaming (Kinesis/Kafka), Change Data Capture (CDC), DevOps, IaC, MLOps
- ML/AI: scikit-learn, XGBoost, PyTorch, NLP (Hugging Face, NLTK), Computer Vision (OpenCV), LLM integrations (OpenAI API)
- Security & Compliance: Key Vault, RBAC, VNet, Private Endpoints, UK GDPR / DPA 2018 compliance, secure-by-design delivery
Web Development
- Languages: C#, JavaScript
- Frameworks: ASP.NET Core, Razor, Node.js
- Webserver: Apache, NGINX
Fundamentals
CS Fundamentals
- Data structures & algorithms
- OOP and functional programming
- E/R and relational modelling
DB Fundamentals
- ACID transactions; OLTP vs OLAP
- Normalisation vs denormalisation
- CAP theorem; scaling patterns
- Dimensional modelling; ETL/ELT
Education
-
University of Exeter (UK)
Jan 2025 – Jan 2026
MSc Statistical Data Science (grade pending)
Key modules: Mathematics, Statistics, R Data Analysis, Statistical Data Modelling, Machine Learning/Deep Learning, Social Network & Text Analysis/Natural Language Processing
Awarded Postgraduate Taught Global Excellence Scholarship 2024/25 -
University of Science - Vietnam National University, Vietnam
Sep 2019 – May 2023
BSc Information Technology
Key Modules: Object-Oriented Programming, Databases, Computer Network, Data Structure & Algorithms, System Design -
International University - Vietnam National University, Vietnam
Sep 2012 – May 2017
BBA Business Management
Awarded Summer Scholarship, 2016 -
Le Hong Phong High School for the Gifted, Vietnam
Sep 2009 – May 2012
High School Diploma (GCSEs equivalent), Specialist School in Math and Computer Science
Certifications
Honors & Awards
- Postgraduate Taught Global Excellence Scholarship — University of Exeter (2024/25)
- Honoured Employee, IT Division — Home Credit Vietnam (2018)
- Vietnam Delegate — Japan-East Asia Network of Exchange for Students and Youths (JENESYS) (2016)
- Summer Scholarship — International University (2016)
- Student Excellency in Computer Science, City Level — Ho Chi Minh City, Vietnam (2009, 2011 & 2012)
Volunteering (Open to contribute)
- Happy to support non-profits/communities with data cleaning, dashboards, automation, and lightweight data pipelines.
- Prefer projects with clear impact and maintainable handover.
Connect
Toby (Tuan) Nguyen