Senior AI/ML & Software Engineer

Sameer Balraj Chopra

Senior AI/ML & Software Engineer with 15+ years of experience building production-grade AI systems, Python-based backend platforms, and scalable software solutions. Expert in Generative AI, LLM-powered applications, and machine learning pipelines using Python, Django, Flask, FastAPI, and Hugging Face Transformers. Skilled in designing and deploying retrieval-augmented workflows, multi-modal AI systems, and cloud-deployed backend architectures. Proven ability to deliver measurable business impact through high-performance, maintainable, and secure software solutions.

San Francisco, CA, USA15+ Years ExperienceB.S. CS, Carnegie Mellon

What I Build

I specialize in building:

Generative AI & RAG Applications

End-to-end retrieval-augmented generation pipelines using GPT-4, Claude, and LangChain — grounding LLM responses in real knowledge bases with FAISS and Pinecone vector search for production-grade accuracy

Agentic Workflow Automation

LLM-powered automation systems using function-calling and tool-use APIs that replace repetitive manual tasks — from content management to document processing — with intelligent, self-orchestrating workflows

Multi-Modal AI Systems

Combined text and image generation pipelines using Hugging Face Transformers, OpenAI DALL-E, and transformer-based NLP models for automated content creation, document intelligence, and visual asset generation

Python Backend APIs & Microservices

Scalable backend architectures with Django, Flask, and FastAPI — RESTful and GraphQL APIs, distributed microservices, and high-throughput data pipelines powering enterprise platforms with millions of users

Vector Search & Knowledge Retrieval

Semantic search infrastructure using FAISS, Pinecone, Weaviate, and Chroma — embedding pipelines, real-time knowledge base queries, and retrieval systems achieving 95%+ relevance accuracy at scale

Cloud-Deployed AI Platforms

Production AI infrastructure on AWS with Docker, Kubernetes, and Terraform — SageMaker model deployments, CI/CD pipelines for ML workflows, and automated scaling for high-traffic AI services

Technical Expertise

AI / LLM Engineering

GPT-4/3.5Anthropic ClaudeHugging Face TransformersLangChainRAGAgentic WorkflowsPrompt EngineeringLLM Fine-TuningMulti-Modal AIModel EvaluationHallucination Mitigation

Machine Learning & NLP

TensorFlowPyTorchScikit-learnBERTT5RoBERTaComputer VisionText ClassificationSummarizationNamed Entity Recognition

Backend & Software Engineering

Python (Django, Flask, FastAPI)RESTful APIsGraphQLMicroservices ArchitectureDistributed Systems

Databases & Vector Search

PostgreSQLMySQLMongoDBRedisPineconeFAISSWeaviateChromaETL PipelinesQuery OptimizationSemantic SearchEmbedding Models

Cloud & DevOps

AWS (EC2, S3, Lambda, SageMaker)DockerKubernetesCI/CD PipelinesTerraformLinuxInfrastructure Automation

Career Timeline

May 2023 — Feb 2026

Senior AI/ML & Full-Stack Engineer

Institute For Multi-Sensory Education • Southfield, MI

  • Architected and deployed end-to-end RAG pipelines using LangChain, FAISS, and OpenAI GPT-4, reducing customer support resolution time by 42% through an AI-powered knowledge retrieval assistant.
  • Engineered agentic workflow automation systems integrating LLM function-calling and tool-use APIs, automating 60%+ of repetitive content-management tasks across the e-learning platform.
  • Built multi-modal AI features for automated course content generation using Hugging Face Transformers and OpenAI DALL-E, increasing content production velocity by 3x.
  • Designed and maintained Django/PostgreSQL backend systems supporting enterprise e-learning, e-commerce, and CMS platforms with 50,000+ monthly active users.
  • Integrated payment processing infrastructure and product catalog systems handling $1M+ in annual online transactions with 99.9% uptime.
  • Implemented model evaluation frameworks to benchmark LLM output quality, reducing hallucination rates by 35% through systematic prompt engineering and fine-tuning strategies.
Jun 2021 — Apr 2023

AI/LLM Engineer & Full-Stack Developer

SymSoft Solutions • Sacramento, CA

  • Designed and deployed production LLM applications integrating OpenAI GPT-3.5/GPT-4 and Hugging Face models, serving 10,000+ daily API requests with sub-300ms average latency.
  • Built intelligent chatbot and virtual assistant platforms using LangChain agent frameworks with tool-use, memory, and retrieval capabilities, automating 70%+ of tier-1 customer support queries.
  • Developed automated document processing pipelines leveraging NLP and transformer-based models (BERT, T5) for entity extraction, summarization, and classification — reducing manual review effort by 55%.
  • Architected and optimized prompt engineering strategies (chain-of-thought, few-shot, retrieval-augmented) improving LLM task accuracy by 28% across multiple production use cases.
  • Built full-stack AI-powered web applications using Python (Django), Node.js, React, and Vue with RESTful APIs and microservices, improving operational efficiency by 40% across internal systems.
  • Integrated vector search (Pinecone, FAISS) for semantic retrieval, enabling real-time knowledge base queries with 95%+ relevance accuracy.
Jul 2015 — May 2021

Senior Software Engineer

Databricks • San Francisco, CA

  • Developed large-scale data and ML pipelines leveraging Spark, MLflow, and distributed processing systems.
  • Built AI-powered data processing workflows improving model training efficiency and scalability.
  • Designed microservices and REST APIs enabling real-time ML inference across enterprise systems.
  • Implemented feature engineering pipelines and model lifecycle management systems.
  • Improved platform performance by 40% through optimization of distributed workloads and caching strategies.
  • Collaborated with cross-functional teams to deliver AI-driven analytics solutions for enterprise clients.
May 2013 — Jun 2015

Software Engineer

Accenture • San Jose, CA

  • Developed scalable backend services using Python (Django) and PostgreSQL for enterprise SaaS platforms.
  • Built RESTful APIs enabling seamless integration across enterprise systems.
  • Designed ETL pipelines supporting real-time analytics and reporting dashboards.
  • Improved system performance by 30% through query optimization and architecture improvements.
  • Delivered solutions across multiple client engagements in finance and healthcare sectors.
Oct 2009 — Apr 2013

Software Engineer

Microsoft • Sunnyvale, CA

  • Developed enterprise-scale backend services and internal tooling using C# and .NET, supporting 10,000+ daily active internal users across engineering and operations teams.
  • Designed and implemented RESTful APIs and service integrations enabling communication between internal platforms and external enterprise systems at Microsoft scale.
  • Reduced internal workflow bottlenecks by 25% through improvements to system architecture, caching strategies, and automation of repetitive processes.
  • Built internal automation and reporting systems that reduced manual operational overhead by an estimated 15 engineer-hours per week across teams.

Academic Background

Bachelor of Science, Computer Science

Carnegie Mellon University

2006 — 2009 · Pittsburgh, PA

Coursework focused on software engineering, distributed systems, and AI-driven applications.

Selected Work

2,847Active Patients$84.2kRevenue MTD97.3%Uptime142AlertsPatient Activity (30 days)Recent Records
View Details

AI Knowledge Retrieval Assistant

Production RAG assistant for enterprise knowledge retrieval that reduced customer support resolution time by 42% across an e-learning platform.

PythonLangChainFAISSOpenAI GPT-4Django
Automation Pipeline OverviewAmazon APIeBay ScraperWalmart Feed+497 SourcesCelery WorkersData ExtractionPrice ComparisonRedis CacheQueue + CacheREST APIJSON OutputWebhooksCSV Export500+Data Sources40hrsSaved / Week99.8%Accuracy
View Details

Agentic Content Automation Platform

Agentic workflow automation system using LLM tool-use APIs to automate 60%+ of repetitive content-management work across an e-learning platform.

PythonLangChainFunction CallingAutomation APIsDjango
Document Analysis PipelineInput DocumentsPDFDOCIMGNLP EngineOCR + ExtractionEntity RecognitionClassificationSummarizationConfidence ScoreStructured Output96% AccuracyClassification ScoreEntities: 47 foundCategories: Contract, NDASummary generated
View Details

Multi-Modal Course Generation

Multi-modal AI content generation workflow using Hugging Face Transformers and OpenAI image generation, increasing course production velocity by 3x.

PythonHugging FaceOpenAIMulti-Modal AIDjango
AIAnalytics80%Auto-resolved2.3sAvg Response12Languages4.8SatisfactionSentimentTopicsBillingTechnicalAccount
View Details

LLM Support Assistant Platform

Production LLM assistant platform serving 10,000+ daily API requests with sub-300ms latency and automating 70%+ of tier-1 support queries.

PythonOpenAILangChainHugging FacePinecone
ETL Pipeline Architecture50+ SourcesPostgreSQLMySQLMongoDBREST APIsCSV / SFTPS3 Buckets+44 more...Apache AirflowExtractTransform (dbt)ValidateDeduplicateLoadSnowflakeData Warehouse10M+records / day99.97%uptimeOutputsREST APIDashboardsAlertsReports
View Details

Document Intelligence Pipeline

Automated document processing pipeline using transformer models for extraction, summarization, and classification, reducing manual review effort by 55%.

PythonBERTT5NLPAutomation
Distributed Scraping ArchitectureCrawlerEngineZillowListingsRealtorMLS DataRedfinPricingTruliaMarket+196 moreStructured Output200+SourcesJSON+ CSV99.5%Uptime
View Details

Predictive Analytics Platform

AI-powered document processing and predictive analytics platform that improved document classification accuracy to 94% and increased throughput by 40%.

TensorFlowPyTorchComputer VisionGraphQLRedis

Let's Work Together

Have a project in mind or just want to connect? I'd love to hear from you.

Get in Touch