Skip to content
View Ushnesha's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Ushnesha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ushnesha/README.md

👋 Hi, I'm Ushnesha Daripa

AI/ML & Data Engineer | MS in Computer Science @ Arizona State University (4.0 GPA)

I build intelligent systems at the intersection of large language models, retrieval, and large-scale data engineering. Previously spent ~5 years at Dell Technologies engineering real-time fraud-detection pipelines and semantic retrieval systems that led to a granted patent in language-agnostic hybrid search.

Building LLMs, RAG systems, and applied ML - from local multimodal PDF chat to real-time object detection on edge hardware.

  • 🔭 Currently building with LLMs, RAG, and multimodal pipelines
  • 🎓 MS CS @ ASU (Aug 2025 – May 2027)
  • 📄 Patent holder — Distributed Hybrid Search for Language-Agnostic Retrieval
  • 📫 Reach me at daripa.ushnesha9701@gmail.com

📊 GitHub Stats

GitHub Stats    Top Languages


🛠️ Tech Stack

AI / ML

PyTorch TensorFlow scikit-learn LangChain Hugging Face

Data & Systems

Python Scala Apache Spark Apache Kafka Databricks SQL

Cloud & Tools

Azure Docker Kubernetes


🚀 Featured Projects

🛰️ AI-Powered Autonomous Trash Interceptor Real-time object detection and trajectory prediction on a Raspberry Pi with an ArduCAM ToF depth sensor (45%+ catch accuracy). Built a dual prediction pipeline pairing a physics-based ballistic model with a RandomForest variant trained on 500+ trajectories.

🐦 BirdCLEF+ 2026 (Kaggle) Pretrained and fine-tuned EfficientNet backbones to identify 234 wildlife species from audio recordings of Brazil's Pantanal wetlands. Converted audio to mel-spectrograms, trained with augmentation and AUC-optimized loss, and ensembled segment-level predictions with blended ProtoSSM, SED and BirdNet achieving best score of 0.944.

📄 Multimodal RAG Chat with PDFs (Local LLM) End-to-end multimodal RAG system supporting text, tables, and images from PDFs — fully local and privacy-preserving using Ollama, vector embeddings, and Streamlit.


🤝 Connect

LinkedIn Email

Pinned Loading

  1. GPT-Tokenizer-from-scratch GPT-Tokenizer-from-scratch Public

    Byte Pair Encoding (BPE) tokenizer implemented from scratch in Python. Features an interactive Streamlit playground to visualize token merging, trace vocab rules, analyze compression ratios, and ve…

    Python 1

  2. BirdClef-2026 BirdClef-2026 Public

    A state-of-the-art bioacoustic classification pipeline for BirdCLEF 2026. Combines Google Perch v2, bidirectional Selective State Space Models (ProtoSSM/ResidualSSM), MLP Probes, and distilled SED …

    Python 1

  3. AI-Powered-Autonomous-Trash-Interceptor AI-Powered-Autonomous-Trash-Interceptor Public

    (Winning as Third Place in class Competition) An AI-powered robot that catches thrown objects using real-time computer vision, predictive physics, and machine learning. It uses Raspberry Pi 5 as br…

    Python 1

  4. dsa-crusade dsa-crusade Public

    Daily data-structures & algorithms practice — building toward an AI/ML engineering role

    Python 1

  5. ChatWithPDFs ChatWithPDFs Public

    A comprehensive RAG (Retrieval-Augmented Generation) application suite for interacting with PDF documents using Large Language Models. This repository contains two implementations: a text-only PDF …

    Python

  6. Multi-Agent-RAG Multi-Agent-RAG Public

    A multi-agent Retrieval-Augmented Generation (RAG) system with Agent2Agent communication, built with LangGraph. The pipeline uses three specialized agents (Researcher, Summarizer, and Answer) that …

    Python