I'm a B.Tech Computer Science (Data Science) graduate from Sreyas Institute of Engineering and Technology, Hyderabad β building at the intersection of data engineering, machine learning, and analytics.
I turn raw, messy data into pipelines, predictions, and dashboards that drive real decisions.
- ποΈ Built scalable ETL & data lakehouse pipelines with PySpark, Snowflake, and Airflow β cutting query costs by ~50%
- π€ Designed an LSTM/GRU trading system for NIFTY 50 stocks with 85% prediction accuracy
- π Built Power BI dashboards analyzing Β£16M+ revenue across 1M+ records using RFM segmentation & DAX
- π Automated document extraction (OCR + NER) for Indian IDs, achieving 90% extraction accuracy
Open to roles in: Data Analyst Β· Data Engineer Β· ML/AI Engineer Β· Python Developer
ML Β· LSTM/GRU Β· Time Series Β· Telegram Bot
Automated trading intelligence system built on deep learning:
- π§ LSTM + GRU models with technical indicators β 85% prediction accuracy on NIFTY 50
- β‘ Real-time Telegram alerts β 35% faster response to market signals
- π Reduced manual analysis workload by 40%
Python TensorFlow LSTM GRU Pandas Telegram API
Data Engineering Β· PySpark Β· Snowflake Β· Cost Optimization
Production-grade data lakehouse for real-time financial transactions:
- βοΈ PySpark + Snowflake pipeline for real-time transaction processing
- π° Z-Ordering, caching & compression β ~50% query cost reduction
- π€ AI-based load prediction models for cluster pre-scaling at peak hours
PySpark Snowflake Apache Airflow SQL Python
Data Analytics Β· Power BI Β· RFM Segmentation Β· ETL
Full pipeline from raw transactions to executive-level BI dashboard:
- π§Ή Cleaned and prepared 797K+ records from 1.07M+ raw retail transactions
- π RFM segmentation revealed top 10% of customers drove ~63% of Β£16.36M revenue
- ποΈ Star-schema modeling + 15+ DAX measures across 5,864 customers
Python Pandas Power BI DAX SQL ETL
EDA Β· Power BI Β· Business Intelligence
3-page executive dashboard for a 500K+ order dataset:
- π EDA & cleaning on 500K+ records with Pandas & NumPy
- π΅ Analyzed $347M+ revenue, delivery KPIs, and customer behavior across cities
- π 15+ custom DAX measures across product categories and delivery dimensions
Python NumPy Pandas Power BI DAX
NLP Β· OCR Β· PostgreSQL Β· Snowflake
Automated parsing and verification system for Indian ID and financial documents:
- π OCR (Tesseract + AWS Textract) + NER + regex β 90% extraction accuracy
- β Confidence scoring & exception handling reduced manual review time by 20%
- ποΈ Scalable pipeline with PostgreSQL/Snowflake storage and data quality checks
Python Tesseract AWS Textract NER PostgreSQL Snowflake
SQL Β· Power BI Β· EDA Β· Income Segmentation
End-to-end banking BI solution:
- ποΈ Data cleaning, transformation & income segmentation (Low/Mid/High) on a 24-column dataset
- π Multi-page dashboards (Loan, Deposit, Summary) with SQL + DAX
- π Uncovered correlations across account types to support customer behavior analysis
MySQL Power BI DAX Python SQL
Data Engineering Β· Airflow Β· PySpark Β· Cloud
Production ETL optimization for large-scale data workloads:
- π Incremental PySpark pipelines β 60% runtime reduction, 35% cost savings
- π§ Broadcast joins, vectorized UDFs, predicate pushdown at scale
- π Airflow DAGs with retry & backfill logic β 90% fewer SLA breaches
PySpark Apache Airflow Python SQL
| Degree | Institution | Year | Score |
|---|---|---|---|
| B.Tech β CS (Data Science) | Sreyas Institute of Engineering & Technology, Hyderabad | 2026 | CGPA: 7.5 |
| Diploma β ECE | TKR College of Engineering & Technology, Hyderabad | 2023 | 70% |
- π΅ Cisco β Data Analytics
- π‘ IBM β Python for Data Science
- π΄ Scaler β Machine Learning
- π’ Scaler β DBMS
- β Scaler β Java
- HackAttack 2K25 β 24-hour National Level Hackathon participant
I'm actively looking for Data Analyst Β· Data Engineer Β· ML Engineer Β· Python Developer roles. If you're hiring or want to collaborate, let's talk.
π Hyderabad, Telangana Β· Open to Remote