Building a Medallion Architecture on Databricks (AWS)
The Medallion Architecture has become the de facto standard for organizing data in a lakehouse. In this post, we expl...
Hands-on Staff Data Engineer (Assistant Director, Data Engineering) at Moody's Analytics. Previously at FactSet, Franklin Templeton & S&P Global. I build the governed Databricks & Microsoft Fabric lakehouse platforms an engineering org builds on — and, most recently, my first GenAI/RAG proof-of-concept on top.
Enterprise-grade data platform designs and reference patterns
Bronze-Silver-Gold layered data platform on Databricks with Delta Lake and Unity Catalog governance.
Unified analytics platform leveraging OneLake, Data Factory, and Power BI for enterprise reporting.
Event-driven architecture with Kafka, Spark Structured Streaming, and Delta Live Tables.
Domain-oriented decentralized data ownership with federated governance and self-serve infrastructure.
A journey through data engineering leadership across global financial enterprises
Leading data platform modernization using Databricks on AWS — building lakehouse architectures, Delta Live Tables pipelines, and Unity Catalog governance frameworks.
Engineered high-throughput data pipelines processing financial market data at scale with Spark and cloud-native services.
Built ETL frameworks and data warehousing solutions for investment analytics and portfolio management systems.
Developed data integration solutions and analytics dashboards for credit risk and market intelligence platforms.
Evolving with the data stack — from Python & SQL pipelines to Databricks and Microsoft Fabric — building the platforms that serve analysts, data scientists and ML engineers
Aggregated and normalized real-time vendor pricing for US structured finance securities into one consistent store for the pricing desk.
View repository →Conformed multi-vendor market data into analysis-ready, point-in-time feature tables serving analysts and data scientists.
View repository →Built the ingestion, feature-engineering and train/serve-parity pipeline feeding downstream error-prediction and root-cause models.
View repository →First cloud build: a deployed data-quality tool for yield-curve validation, provisioned on AWS (EC2/S3/Redshift) with Infrastructure as Code.
View repository →Orchestrated an Azure data platform with a zoned lake and star schema, serving always-current self-service BI to stakeholders.
View repository →Moved to the lakehouse: a medallion architecture enriching the Orbis data product with large-scale international-trade analytics.
View repository →End-to-end automated integration with data quality as a first-class concern — validation gates, quarantine and a reusable ingestion template.
View repository →The team's first RAG proof-of-concept over financial research — prototyping grounded, cited retrieval on Databricks Mosaic AI Vector Search.
View repository →A unified Fabric lakehouse serving BI (Direct Lake), analysts (SQL) and data science from one governed Gold layer — mirroring the Databricks build to show both platforms.
View repository →Deep dives into data architecture, Databricks, and Microsoft Fabric
The Medallion Architecture has become the de facto standard for organizing data in a lakehouse. In this post, we expl...
Empowering the Telugu tech community — enterprise data engineering, demystified.
తెలుగులో డేటా ఇంజనీరింగ్ — మన భాషలో, మన కోసం.
డేటాబ్రిక్స్ లేక్హౌస్ — మొదటి నుండి ప్రొడక్షన్ వరకు
Your complete roadmap to mastering Databricks — clusters, notebooks, Delta Lake, Unity Catalog, and production-grade pipelines. Let's build this together!
మైక్రోసాఫ్ట్ ఫ్యాబ్రిక్ — యూనిఫైడ్ అనలిటిక్స్ విప్లవం
OneLake, Lakehouses, Data Factory, and Direct Lake mode — everything you need to architect modern analytics in Fabric. This changes the game.
మెడాలియన్ ఆర్కిటెక్చర్ — బ్రాంజ్, సిల్వర్ & గోల్డ్ వివరణ
The architecture pattern powering modern lakehouses. I'll walk you through real-world implementations with Delta Live Tables on Databricks.
డేటా ఇంజనీర్ల కోసం పైస్పార్క్ — ఇంటర్వ్యూ & అంతకు మించి
Not just interview prep — real production patterns. Transformations, window functions, performance tuning, and the questions top companies actually ask.
యూనిటీ క్యాటలాగ్ — ఎంటర్ప్రైజ్ డేటా గవర్నెన్స్
Access control, data lineage, and quality enforcement at scale. I'll show you how to set up governance that actually works across multi-cloud Databricks.
డెల్టా లైవ్ టేబుల్స్ — డిక్లరేటివ్ ETL పైప్లైన్స్
Stop writing boilerplate. DLT lets you declare your pipeline logic and handles orchestration, quality, and recovery. Let me show you how the pros do it.
Tools and technologies I build with daily
Building the future of enterprise data, one architecture at a time
I'm a hands-on Staff Data Engineer with deep expertise in designing, building, and owning enterprise-scale data platforms. My career spans senior data-engineering roles at some of the world's most respected financial-services firms — Moody's Analytics, FactSet, Franklin Templeton, and S&P Global.
Currently at Moody's Analytics as Assistant Director, Data Engineering (a Staff-level IC role), I build and own the governed Databricks (on AWS) and Microsoft Fabric lakehouse platform my engineering org builds on — design and CI/CD standards adopted across 12+ production pipelines, Unity Catalog governance, and FinOps that cut cross-environment compute ~35%. Most recently I built the team's first GenAI/RAG proof-of-concept on Mosaic AI Vector Search, and I'm extending into AI/ML platform engineering (feature stores, model serving, MLOps).
Beyond work, I create Telugu-language tutorials on YouTube, making data engineering concepts accessible to the Telugu-speaking tech community worldwide.