Siddharth Prabhu | AI Engineer

Edge Vision Analytics Pipeline

Scope: Continuous multi-threaded service managing 5 RTSP streams.

We needed to process 5 concurrent 1080p video streams on edge hardware. The initial PyTorch implementation had high latency (150ms), causing frame drops.

I migrated the inference to the OpenVINO runtime and applied INT8 quantization. I also replaced batch processing with an asynchronous streaming architecture to manage memory usage effectively.

METRICS

Latency Reduced from 150ms to 90ms

RAM Usage Stable under 450MB

*Constraint: Switched to stream-based processing as batching caused unacceptable latency spikes.

Hybrid RAG System

Scope: Self-contained API service handling ingestion and query serving.

Standard dense vector search struggled with queries requiring exact matches, such as specific part numbers or error codes.

I implemented a hybrid search system combining dense vectors with BM25 keyword retrieval. I also built a fallback mechanism to switch providers automatically if API rate limits were triggered.

PERFORMANCE

Retrieval P95 Under 250ms

*Limitation: The in-memory FAISS index would require migration to a vector database for datasets exceeding 5 million vectors.

Async Document Processing

Scope: Distributed task queue architecture.

Synchronous processing of large documents was blocking the main application thread, leading to timeouts.

I decoupled the ingestion process using FastAPI background tasks for lightweight updates and Celery workers for CPU-intensive OCR tasks. This ensured the application remained responsive under load.

THROUGHPUT

Capacity 100+ PDFs per hour

Traffic Anomaly Detection

View IEEE Paper

This research focused on optimizing vehicle detection accuracy under challenging lighting conditions.

I implemented the training pipeline and defined evaluation metrics on a custom annotated dataset. The findings highlighted specific failure cases relevant to deploying vision systems in real-world environments.

VALIDATION

F1-Score 92% on test set

*Published in IEEE ESCI 2024.

I build intelligent systems that solve real engineering problems.

Selected Projects

Edge Vision Analytics Pipeline

Hybrid RAG System

Async Document Processing

Traffic Anomaly Detection

Technical Stack

Languages

Vision & Edge

Core AI

Backend

Engineering Writing

Data Defensibility

Algorithmic Feedback Loops