Ad-hoc insights from our engineering and delivery work.
3/28/2026 By Chirag Pandya
A practical guide to identifying silent AI cost drift in production systems, instrumenting the right metrics, and controlling spend without hurting response quality.
llm-opsai-cost-optimizationproduction-aiunit-economicsobservabilityagent-systems
Read post
3/25/2026 By Chirag Pandya
A practical explanation of the Black-Scholes options pricing model, what every term means, how it connects to the order book, and where it fails in real markets including crypto.
black-scholesoptions-pricingderivativesvolatilityquantitative-financecrypto-options
Read post
3/25/2026 By Chirag Pandya
A plain-English guide to how high frequency trading firms became modern market makers, what improved, what got riskier, and why it matters for execution quality.
high-frequency-tradingnew-market-makersmarket-makingliquidityalgorithmic-trading
Read post
3/25/2026 By Chirag Pandya
A practical, simple-language guide to making RAG systems more accurate in production, from baseline setup to evaluation loops, dynamic chunking, hybrid retrieval, and reranking.
ragretrieval-augmented-generationrag-evaluationhybrid-searchrerankersllm-applications
Read post
3/25/2026 By Chirag Pandya
Hedge funds are named after a specific risk-management technique that most of them no longer use. Here's what the original hedge was, why it worked, and what the term actually means today.
hedge-fundsfinanceinvestinglong-short-equityalternative-investments
Read post
3/25/2026 By Chirag Pandya
Transformers and LLMs work because language has syntax, semantics, and recoverable structure. Financial return sequences have none of that. A deep look at why tokenizing returns the way you tokenize words is a category error.
machine-learningtransformersfinancial-returnstime-seriesquantitative-financellmtokenization
Read post
3/25/2026 By Chirag Pandya
A plain-English guide to Knowledge Graph RAG, why vector search alone falls short in complex domains, and how combining graph relationships with semantic search makes AI answers more accurate and explainable.
knowledge-graph-raggraphragragneo4jvector-searchllm-applications
Read post
3/25/2026 By Chirag Pandya
A plain-English guide to why FastAPI needs Celery and Redis for background tasks, what they actually do, and a step-by-step setup including auto-reload for development.
celeryredisfastapibackground-taskstask-queuespython
Read post
3/24/2026 By Chirag Pandya
A practical, plain-English explanation of why LLMs often output double dashes, what tokenization has to do with it, and how to control punctuation in generated text.
llmtokenizationprompt-engineeringai-writing-styledouble-dash
Read post
3/23/2026 By Chirag Pandya
A plain-English but technical guide to the Avellaneda-Stoikov market making model, from reservation price intuition to real-world implementation trade-offs.
avellaneda-stoikovmarket-making-strategyquantitative-tradinginventory-riskalgorithmic-trading
Read post
3/22/2026 By Chirag Pandya
A practical guide to Smart Order Routing and Consolidated Order Books, how they work together, and why they are essential for best execution in fragmented markets.
smart-order-routerconsolidated-order-bookbest-executionmarket-microstructurealgorithmic-trading
Read post
3/21/2026 By Chirag Pandya
A practical guide to how AGENTS.md files improve AI coding agent efficiency, and how engineering teams can apply these patterns in real repositories.
agents-mdai-coding-agentsdeveloper-productivitysoftware-engineering
Read post