APPLIED AI ENGINEER

I build intelligent systems that solve real engineering problems.

My focus is on bridging the gap between research and production. I architect systems that optimize for latency, deployment constraints, and reliability in the real world.

Selected Projects

Edge Vision Analytics Pipeline

Scope: Continuous multi-threaded service managing 5 RTSP streams.

We needed to process 5 concurrent 1080p video streams on edge hardware. The initial PyTorch implementation had high latency (150ms), causing frame drops.

I migrated the inference to the OpenVINO runtime and applied INT8 quantization. I also replaced batch processing with an asynchronous streaming architecture to manage memory usage effectively.

METRICS
Latency Reduced from 150ms to 90ms
RAM Usage Stable under 450MB
*Constraint: Switched to stream-based processing as batching caused unacceptable latency spikes.

Hybrid RAG System

Scope: Self-contained API service handling ingestion and query serving.

Standard dense vector search struggled with queries requiring exact matches, such as specific part numbers or error codes.

I implemented a hybrid search system combining dense vectors with BM25 keyword retrieval. I also built a fallback mechanism to switch providers automatically if API rate limits were triggered.

PERFORMANCE
Retrieval P95 Under 250ms
*Limitation: The in-memory FAISS index would require migration to a vector database for datasets exceeding 5 million vectors.

Async Document Processing

Scope: Distributed task queue architecture.

Synchronous processing of large documents was blocking the main application thread, leading to timeouts.

I decoupled the ingestion process using FastAPI background tasks for lightweight updates and Celery workers for CPU-intensive OCR tasks. This ensured the application remained responsive under load.

THROUGHPUT
Capacity 100+ PDFs per hour

Traffic Anomaly Detection

View IEEE Paper

This research focused on optimizing vehicle detection accuracy under challenging lighting conditions.

I implemented the training pipeline and defined evaluation metrics on a custom annotated dataset. The findings highlighted specific failure cases relevant to deploying vision systems in real-world environments.

VALIDATION
F1-Score 92% on test set
*Published in IEEE ESCI 2024.

Technical Stack

Languages

  • Python
  • SQL
  • C (Basics)
  • Bash

Vision & Edge

  • OpenVINO
  • YOLOv8
  • DeepStream
  • OpenCV

Core AI

  • RAG Pipelines
  • LangChain
  • PyTorch
  • Hybrid Search

Backend

  • FastAPI
  • Docker
  • Vector DBs
  • Git

Engineering Writing

I write to clarify my thinking on system design and data strategy.

Read on Medium →