About Me
Hey there! đź‘‹
I’m Sum, a software engineer passionate about building AI systems that solve real-world problems. What started as curiosity about how machines could “think” during my Computer Science studies at HKU has evolved into a career focused on creating practical AI applications that make people’s lives easier.
Currently, I’m fascinated by the intersection of traditional software engineering and AI – building systems that don’t just work in research notebooks, but can handle thousands of documents daily, process audio in real-time, and seamlessly integrate into business workflows.
What I’m Working On
AI-Powered Chess Engine: ElephantFormer
I recently built an end-to-end transformer-based chess AI for Elephant Chess (Chinese Chess) that treats move prediction as a sequence modeling problem. Training on ~300 out of 41,738 professional games, I designed a custom tokenization system and multi-output architecture that achieves strategic pattern learning beyond random play. The project challenged me to think about game AI from a completely different angle – viewing chess moves as language sequences rather than traditional tree search problems.
Production ML Systems at iSWIM
At iSWIM Technology, I’ve been architecting end-to-end AI systems that process real business data. My latest project is a report generation system that combines React frontends, ASP.NET backends, and Python FastAPI services with RAG capabilities. The system uses RabbitMQ job queues for async audio transcription with local GPU inference, reducing report creation time by several hours per report.
I also built a data annotation tool that transforms unstructured notes into structured event data – the kind of “invisible AI” that makes workflows smoother without users even realizing there’s ML happening behind the scenes.
My Journey
My path into AI wasn’t linear. I started with traditional software development during internships at TRM and Aspex Management, building document management systems and data pipelines. But I kept gravitating toward the challenging problems: How do you extract meaning from messy documents? How do you predict outcomes from incomplete data?
This curiosity led me to dive deep into machine learning during my final year at HKU, where I built a large scale sentiment analysis system under Prof. Lingpeng Kong’s supervision. The project taught me that the real challenge isn’t just building pipelines – it’s understanding the math behind models like VAEs, where concepts like KL divergence show up again and again across machine learning. It deepened my intuition for how models learn structured representations, especially in noisy, high-dimensional data.
Since graduating, I’ve focused on bridging the gap between cutting-edge AI research and practical business applications. Whether it’s document classification models that process hundreds of pages daily or RAG systems that align with users’ writing styles, I’m most excited when AI becomes a natural, helpful part of someone’s workflow.
Beyond Code
When I’m not training models or debugging systems, keeping up with AI news, exploring new architectures, and experimenting with machine learning beyond my day-to-day work. ElephantFormer was one such project — sparked by my curiosity about applying Transformers to gameplay, a less traditional domain, and my desire to understand how reinforcement learning algorithms like PPO, used in the post-training phase, contribute to the success of modern LLMs.
I’m always looking for new ways to learn — whether through hands-on projects, reading about recent breakthroughs, or digging into the techniques that make modern AI systems work in practice.
Technical Background
Current Focus: Production ML systems, RAG applications, distributed AI architectures
Languages & Frameworks: Python, C#, JavaScript, PyTorch, React, ASP.NET, FastAPI
AI/ML: LLMs (OpenAI, Claude, Gemini), transformers, computer vision, NLP, time series forecasting
Infrastructure: Docker, RabbitMQ, GPU inference, cloud platforms (GCP, AWS, Azure)
Education: BEng Computer Science, University of Hong Kong (2019-2023)
Looking to collaborate on interesting AI projects or just want to chat about the future of artificial intelligence? Feel free to reach out – I’m always excited to connect with fellow builders and thinkers!