Data & AI Platform Engineering • Databricks • Microsoft Fabric • GenAI/RAG

Building Data & AI Platforms
at Enterprise Scale

Hands-on Staff Data Engineer (Assistant Director, Data Engineering) at Moody's Analytics. Previously at FactSet, Franklin Templeton & S&P Global. I build the governed Databricks & Microsoft Fabric lakehouse platforms an engineering org builds on — and, most recently, my first GenAI/RAG proof-of-concept on top.

Professional Path

A journey through data engineering leadership across global financial enterprises

Current

Assistant Director - Data Engineer

Moody's Analytics

Leading data platform modernization using Databricks on AWS — building lakehouse architectures, Delta Live Tables pipelines, and Unity Catalog governance frameworks.

Previous

Product Specialist II / Senior Data Engineer

FactSet

Engineered high-throughput data pipelines processing financial market data at scale with Spark and cloud-native services.

Previous

Research Analyst / Data Engineer

Franklin Templeton

Built ETL frameworks and data warehousing solutions for investment analytics and portfolio management systems.

Previous

Data Researcher II/ Data Engineer

S&P Global

Developed data integration solutions and analytics dashboards for credit risk and market intelligence platforms.

Project Journey

Evolving with the data stack — from Python & SQL pipelines to Databricks and Microsoft Fabric — building the platforms that serve analysts, data scientists and ML engineers

2014

Structured Finance Pricing Pipeline

Python 2.7 · SQL — Data Engineer

Aggregated and normalized real-time vendor pricing for US structured finance securities into one consistent store for the pricing desk.

View repository →
2016

Market Data Pipeline & Feature Platform

Python · pandas — Data Engineer

Conformed multi-vendor market data into analysis-ready, point-in-time feature tables serving analysts and data scientists.

View repository →
2018

Error Telemetry Pipeline (DE for ML)

Python · NLP — Data Engineer → Senior Data Engineer

Built the ingestion, feature-engineering and train/serve-parity pipeline feeding downstream error-prediction and root-cause models.

View repository →
2019

Yield Curve Outlier Detection

AWS · Streamlit · Terraform — Senior Data Engineer

First cloud build: a deployed data-quality tool for yield-curve validation, provisioned on AWS (EC2/S3/Redshift) with Infrastructure as Code.

View repository →
2021

Platform Usage Analytics

Azure (ADF · Synapse · ADLS Gen2) · Power BI — Senior Data Engineer

Orchestrated an Azure data platform with a zoned lake and star schema, serving always-current self-service BI to stakeholders.

View repository →
2022

Customs & Trade Analytics Lakehouse

Databricks · PySpark · Delta · Unity Catalog — Senior DE → Staff Data Engineer

Moved to the lakehouse: a medallion architecture enriching the Orbis data product with large-scale international-trade analytics.

View repository →
2023

Grant Data Integration Pipeline

Databricks · Delta · Great Expectations — Staff Data Engineer

End-to-end automated integration with data quality as a first-class concern — validation gates, quarantine and a reusable ingestion template.

View repository →
2025

Financial Research RAG

Databricks GenAI · Mosaic AI Vector Search · MLflow — Data & AI Platform Engineering

The team's first RAG proof-of-concept over financial research — prototyping grounded, cited retrieval on Databricks Mosaic AI Vector Search.

View repository →
2026 · Present

Enterprise Lakehouse on Microsoft Fabric

Microsoft Fabric · OneLake · Direct Lake — Data & AI Platform Engineering

A unified Fabric lakehouse serving BI (Direct Lake), analysts (SQL) and data science from one governed Gold layer — mirroring the Databricks build to show both platforms.

View repository →

Technical Blog

Deep dives into data architecture, Databricks, and Microsoft Fabric

Data Architect Telugu

Empowering the Telugu tech community — enterprise data engineering, demystified.
తెలుగులో డేటా ఇంజనీరింగ్ — మన భాషలో, మన కోసం.

Series
తెలుగు · Telugu

Databricks Lakehouse — Zero to Production

డేటాబ్రిక్స్ లేక్‌హౌస్ — మొదటి నుండి ప్రొడక్షన్ వరకు

Your complete roadmap to mastering Databricks — clusters, notebooks, Delta Lake, Unity Catalog, and production-grade pipelines. Let's build this together!

Series
తెలుగు · Telugu

Microsoft Fabric — The Unified Analytics Revolution

మైక్రోసాఫ్ట్ ఫ్యాబ్రిక్ — యూనిఫైడ్ అనలిటిక్స్ విప్లవం

OneLake, Lakehouses, Data Factory, and Direct Lake mode — everything you need to architect modern analytics in Fabric. This changes the game.

Deep Dive
తెలుగు · Telugu

Medallion Architecture — Bronze, Silver & Gold Explained

మెడాలియన్ ఆర్కిటెక్చర్ — బ్రాంజ్, సిల్వర్ & గోల్డ్ వివరణ

The architecture pattern powering modern lakehouses. I'll walk you through real-world implementations with Delta Live Tables on Databricks.

Masterclass
తెలుగు · Telugu

PySpark for Data Engineers — Interview & Beyond

డేటా ఇంజనీర్ల కోసం పైస్పార్క్ — ఇంటర్వ్యూ & అంతకు మించి

Not just interview prep — real production patterns. Transformations, window functions, performance tuning, and the questions top companies actually ask.

Tutorial
తెలుగు · Telugu

Unity Catalog — Enterprise Data Governance

యూనిటీ క్యాటలాగ్ — ఎంటర్‌ప్రైజ్ డేటా గవర్నెన్స్

Access control, data lineage, and quality enforcement at scale. I'll show you how to set up governance that actually works across multi-cloud Databricks.

Hands-On
తెలుగు · Telugu

Delta Live Tables — Declarative ETL Pipelines

డెల్టా లైవ్ టేబుల్స్ — డిక్లరేటివ్ ETL పైప్‌లైన్స్

Stop writing boilerplate. DLT lets you declare your pipeline logic and handles orchestration, quality, and recovery. Let me show you how the pros do it.

Subscribe to Data Architect Telugu

Tech Stack

Tools and technologies I build with daily

Cloud Platforms

Microsoft Azure AWS Microsoft Fabric Databricks

Data Engineering

Apache Spark PySpark Delta Lake Delta Live Tables Apache Kafka Airflow

Architecture Patterns

Medallion Architecture Data Mesh Lakehouse Data Vault 2.0 Star Schema

Governance & Security

Unity Catalog Purview RBAC Data Lineage Data Quality

Languages & Tools

Python SQL Scala Terraform Git Docker

Analytics & BI

Power BI Databricks SQL Azure Synapse dbt

About Me

Building the future of enterprise data, one architecture at a time

Kamalakar Peta

Kamalakar Peta

I'm a hands-on Staff Data Engineer with deep expertise in designing, building, and owning enterprise-scale data platforms. My career spans senior data-engineering roles at some of the world's most respected financial-services firms — Moody's Analytics, FactSet, Franklin Templeton, and S&P Global.

Currently at Moody's Analytics as Assistant Director, Data Engineering (a Staff-level IC role), I build and own the governed Databricks (on AWS) and Microsoft Fabric lakehouse platform my engineering org builds on — design and CI/CD standards adopted across 12+ production pipelines, Unity Catalog governance, and FinOps that cut cross-environment compute ~35%. Most recently I built the team's first GenAI/RAG proof-of-concept on Mosaic AI Vector Search, and I'm extending into AI/ML platform engineering (feature stores, model serving, MLOps).

Beyond work, I create Telugu-language tutorials on YouTube, making data engineering concepts accessible to the Telugu-speaking tech community worldwide.

11+
Years Experience
4
Tier-1 Firms
50+
Platforms Delivered