Antaripa Saha

All my blogs related to AI, new research, coding agents, and math.

Needle in haystack issue for long context, the margin workflow, chunked prefill, KV cache, and evaluation results.

Handle messy and complex tables for accurate retrieval using hierarchies, coordinates, and summaries.

RAG pipelines break on real-world documents because parsers flatten structure and chunkers chunk blindly.

Pair Tensorlake with Outlines to keep document pipelines clean: structured inputs in, schema-validated generations out.

Trace every tool call, visualize deviations, and stop hallucinated portfolio changes before they hit production workflows.

Add telemetry to every tool invocation, classify failures, and auto-heal retries so production agents degrade gracefully.

Define agent memory, understand its architectures, and see how it keeps instructions, traits, and goals alive across conversations.

Diagnose why stateless agents forget preferences, then layer in Mem0-style traces to keep context, taste, and tone consistent.

Build a hybrid retriever that weighs embeddings, freshness windows, and web snippets, then persists the good stuff back into memory.

Legal research pipelines need grounding, traceability, and evals—this blueprint shows how we shipped that stack for counsel teams.