Building a Second Brain for Organizations: An Experiment with Knowledge Graphs and GraphRAG

Apps like Notion and Obsidian market themselves as a “second brain” for individuals—a place where you can dump everything you know, connect ideas through links, and have instant access to your accumulated knowledge. Obsidian even visualizes your notes as a graph, revealing connections you never knew existed. But what about organizations? Company knowledge lives in a different kind of chaos: Slack threads, Jira tickets, Salesforce records, HR systems, meeting transcripts, project updates, email chains. This information exists across 10-15 different systems, owned by different teams, formatted differently, and rarely talking to each other. When an executive asks “What are our top organizational risks right now?” or “What’s the impact if we lose our senior platform engineer?”, someone has to spend days manually piecing together information from disparate sources. ...

December 25, 2025 · 11 min

How I Slashed a 1 million Email processing Pipeline from 11 Days to 38 Hours with Lightweight Parallelism

In the era of Generative AI, the quality and scale of data processing have become more critical than ever. While sophisticated language models and ML algorithms steal the spotlight, the behind-the-scenes work of data preparation remains the unsung hero of successful AI implementations. From cleaning inconsistent formats to transforming raw inputs into structured information, these preparatory steps directly impact model performance and output quality. However, as data volumes grow exponentially, traditional sequential processing approaches quickly become bottlenecks, turning what should be one-time tasks into resource-intensive operations that delay model training and deployment. For organizations working with moderate to large datasets—too small to justify a full Hadoop or Spark implementation, yet too unwieldy for single-threaded processing—finding the middle ground of efficient parallelism has become essential for maintaining agile AI development cycles. ...

April 18, 2025 · 16 min