Quang Minh Ha

AI Engineer / Data Scientist

AI Engineer and Data Science student based in Hong Kong & Vietnam. Specializing in MLOps, Generative AI, Machine Learning applications, and cloud technologies.

Get in Touch

Work Experience

Fleet Management Limited

AI Engineer Intern

🗓️ June 2024 — Aug 2024 📍 Hong Kong

Developed and deployed a GenAI document rewriting application using AWS Bedrock for LLM inference, Amazon EC2 for scalable hosting, and S3 for secure document storage.
Implemented a RAG pipeline leveraging Gemini models for text embeddings and inference via Google Vertex AI. Integrated Pinecone Vector Store for efficient data storage and retrieval, implemented hybrid search with BM25 and re-ranking strategies, and optimized deployment on Google Kubernetes Engine (GKE).
Developed advanced prompt engineering techniques to generate synthetic data and implemented an LLM-as-a-judge framework to evaluate metrics like response relevancy, contextual precision/recall, faithfulness for selecting best chunking strategies.
Implemented end-to-end observability on GKE using OpenTelemetry for distributed tracing, Grafana and Prometheus for real-time monitoring, and a CronJob for periodic online evaluation. Developed an interactive Tableau dashboard for daily reporting, providing stakeholders real-time access to chat histories and key performance metrics.

DRESIO Limited

AI Engineer Intern

🗓️ June 2023 — Aug 2023 📍 Hong Kong

Optimized application backend for real-time pose detection using MediaPipe models by implementing multi-stream data handling with highly parallelized architectures combining multithreading and multiprocessing, achieving 50 FPS with CPU processing.
Containerized the application with Docker and orchestrated deployments using Kubernetes; implemented Prometheus and Grafana to monitor application performance and visualize real-time metrics, ensuring scalability and system reliability.
Created a Streamlit-based health monitoring application with YOLO-NAS from Ultralytics framework, deployed via ONNX Runtime, achieving 95% accuracy in real-time mouth movement analysis for patient health assessments.
Developed a cross-platform mobile demo app using Flutter and Google ML Kit, showcasing real-time pose detection capabilities and enhancing usability across iOS and Android devices.

Education

City University of Hong Kong

Bachelor of Science in Data Science, Minor in Mathematics

🗓️ Aug 2021 — June 2025 📍 Hong Kong

GPA: 4.0/4.3 (Rank 1/73)
Dean's List, HKSAR Government Scholarship, XYD Scholarship, CityUHK Full Tuition Scholarship
Coursework: Fundamentals of Machine Learning, Big Data Algorithms, Reinforcement Learning, Data-Intensive Computing, Data Mining, Database Systems, Regression Analysis, Bayesian Analysis, Mathematical Finance

McGill University

Exchange Semester

🗓️ Jan 2024 — May 2024 📍 Montreal, Canada

GPA: 3.83/4.0
Shanghai Commercial Bank Exchange Scholarship
Coursework: Applied Machine Learning, NLP with Deep Learning, Stochastic Processes, Statistical Learning

Featured Projects

Spotifu Music

A music streaming app that emulates Spotify's core features.

Source

Shopp App

An e-commerce platform that replicates Shopify's key features.

Source

ClonTagram

A social network that replicates the features of Instagram

Source

Skills

Programming

Python C/C++ Java R SQL Scala

Data Science

Scikit-Learn PyTorch TensorFlow Keras OpenCV NLTK SpaCy MySQL MongoDB Tableau

MLOps & Cloud

Git Jenkins Docker Kubernetes AWS GCP Azure Ansible Terraform Prometheus Grafana Jaeger

Data Engineer

Hadoop Spark Hive Kafka Trino MinIO Delta Lake Iceberg Hudi Airflow Flink

About Me

Hi, I’m Quang Minh Ha, a passionate AI Engineer and Data Scientist with experience in MLOps, GenAI, and building scalable machine learning systems. I enjoy tackling challenging problems at the intersection of data, algorithms, and software engineering. Currently pursuing a BSc in Data Science at City University of Hong Kong with a minor in Mathematics, and recently completed an exchange semester at McGill University. My experience includes internships focused on deploying LLMs, building RAG pipelines, optimizing real-time ML models, and developing full-stack applications.