Mohammad Ausaf Logo Image
Mohammad Ausaf

mohammad ausaf

GenAI · Backend · Infra
@ Galleri5 (Collective Artists Network)

get to know me

Software Engineer at Galleri5 (now part of Collective Artists Network). Joined when the tech team was around 10 people, and one of four engineers building our generative AI stack — so I've had the chance to work across backend architecture, ML research, DevOps, and internal tooling.

Core work revolves around Generative AI, ML and backend systems — everything from classical techniques (NLP, clustering, vector search powering trend-based product discovery for Myntra and Ajio) to generative AI for video and image creation.

As one of the core engineers on the team, I built the backend and AI pipelines powering actual production content — including Mahabharat - Ek Dharmayudh streaming on JioHotstar. Most recently: worked with the team on a hybrid AI + 3D pipeline with multi-model orchestration designed for cinema-grade output, powering the AI-generated teaser for Chiranjeevi Hanuman: The Eternal (8M+ on YouTube / 12M+ on Instagram, premiered on IMAX).

A big part of the work involves designing autonomous AI agents that decompose creative prompts into multi-step workflows with dependency resolution, orchestrating 78 models across 7 providers (Replicate, FAL, ElevenLabs, BytePlus, Google, RunPod, GPU cluster) — plus building the internal tooling that makes these workflows usable by non-technical teams.

Infrastructure work includes building distributed systems like the Inference Gateway with WRR-based fair scheduling and 4-level concurrency control, GPU Dispatcher managing an 8 H100 GPU server fleet with lease-based locking and crash recovery via WAL. Managing deployments and migrations across GCP, Azure, and AWS with on-call rotation and monitoring.

Currently co-developing AI asset platform with Microsoft — designing multi-layer architecture for versioned storage, semantic retrieval, automated quality evaluation, and preference-aware ranking with real-time review workflows.

On the R&D side: building ComfyUI workflows, training LoRAs for character consistency and style transfer (e.g., Flux LoRAs for cloth styling in AI cataloguing), fine-tuning models, and developing new generation pipelines — from research to production.

Email Me

Tech Stack

Agentic AI
Generative AI
Backend Development
Cloud & DevOps
Python
FastAPI
PyTorch
MongoDB
Redis
Google Cloud
Firebase
Pinecone
ComfyUI
Docker
Azure
LangChain
Google Gemini
Replicate API
RunPod
CLIP
OpenAI
FAL AI
ElevenLabs
Workflow Orchestration
Enterprise Permissions
Asset Management
Multi-Modal AI
Content Generation
Real-time Collaboration

experience

software engineer (ai)

Galleri5 - Bengaluru

Dec 2023 - Present

  • Agentic AI Platform: Led development of AI Studio - multi-tenant content creation platform with RBAC and enterprise permissions. Built Workflow Planner using Gemini for natural language interpretation and multi-step execution plans with dependency resolution
  • Microsoft Collaboration: Co-developing AI asset platform with Microsoft. Designed multi-layer architecture handling versioned storage, semantic retrieval, automated quality evaluation, and preference-aware ranking. Implemented real-time review workflows with human feedback integration for creative iteration
  • Distributed Infrastructure: Built Inference Gateway orchestrating 78 models across 7 providers with WRR-based fair scheduling and 4-level concurrency control. Designed GPU Dispatcher managing 8 H100 GPU server fleet with lease-based locking, priority queuing (Redis sorted sets), and crash recovery via WAL
  • ML Pipelines: Built end-to-end ML pipelines serving major e-commerce clients (Myntra, Ajio, H&M). Developed CLIP-based vectorization for semantic product search across millions of SKUs, text classification with sentence transformers (BGE, mxbai-embed), and real-time vector upsertion with Pinecone
  • R&D & Prototyping: Researched and prototyped SOTA image/video generation techniques. Built custom ComfyUI workflows for client-specific image generation. Developed image generation evaluation loops using Gemini Vision for automated quality assessment
  • Team Leadership: Technical lead for 4-person junior engineering team. Established code review processes, JIRA workflows, and onboarding practices

Tech Stack: Python, FastAPI, MongoDB, Redis, AWS, GCP, Azure, Firebase, PyTorch, CLIP, Pinecone, ComfyUI, Docker, LangChain, OpenAI, Gemini, Replicate, FAL AI

bachelor of technology - computer science

KIET Group of Institution, Ghaziabad

2024

Relevant Coursework:

  • Probability and Statistics, Calculus, Operating Systems
  • Data Structures and Algorithms, Machine Learning
  • Databases

work

Tinify Platform Screenshot

ai studio - agentic ai content platform

Enterprise agentic AI platform where autonomous agents orchestrate complex multi-modal content workflows. Features intelligent workflow planning with Gemini, real-time collaboration systems, and dynamic task execution across 78 models and 7 providers. Technologies: FastAPI, MongoDB, LangChain, Redis, Multi-Agent Systems, Computer Vision.

Read More
Searchy Screenshot

searchy - on-device ai photo manager

Native macOS photo management app with AI-powered features, entirely on-device with zero cloud dependency. Features CLIP-based semantic search, duplicate detection with visual similarity scoring, face recognition and grouping, smart filtering, and auto-indexing with directory watchers. Modular architecture with swappable ML backends. Technologies: Swift, SwiftUI, CLIP, Face Recognition, Metal (GPU).

View Project
Boring Notch Lyrics

boring.notch lyrics - macOS notch music widget

Fork of Boring Notch adding real-time synchronized lyrics display. Features multiple display modes (flowing, alternating, stacked), per-monitor configuration, and timing offset controls. Supports Spotify, Apple Music, and YouTube Music. Technologies: Swift, SwiftUI, macOS APIs, Music Integration.

Read More