AI / ML / Serverless

Planned

Portfolio AI Assistant

RAG-powered technical Q&A over curated portfolio content

Overview

Purpose-built to demonstrate practical AI agent architecture in a real production context. Rather than a generic chatbot, this assistant is scoped specifically to portfolio content: project documentation, architecture notes, and professional background. The RAG pipeline retrieves relevant content chunks from a vector store before passing context to the model, keeping responses grounded and auditable. The inference workflow is fully serverless: API Gateway routes requests through Lambda to the model API, with CloudWatch logging for every request and response. Safety controls constrain the prompt scope and prevent the assistant from generating content outside its defined boundaries. This is the same architecture pattern used in enterprise AI assistant deployments, applied at portfolio scale.

Key highlights

  • RAG pipeline over curated portfolio content: projects, architecture docs, background
  • Vector store for semantic retrieval (embeddings over structured portfolio data)
  • Serverless inference orchestration: API Gateway, Lambda, Bedrock or model API
  • Chat interface embedded into the portfolio site
  • Safety controls: prompt constraints, content boundaries, request logging
  • CloudWatch audit trail for all requests and responses
  • Designed to scale the same pattern to enterprise RAG deployments

Architecture

Architecture diagram coming soon. HLD and LLD documents will be linked below when complete.


Technology Stack

aws

BedrockLambdaAPI GatewayS3CloudWatchIAM

app

ReactTypeScriptChat UI component

tooling

Vector DB (TBD)GitHubVS Code

practices

RAGPrompt engineeringSafety controlsServerlessObservability

Documents

High-Level Design (HLD)

Coming soon

Coming soon

Low-Level Design (LLD)

Coming soon

Coming soon