Skip to content
View infiniV's full-sized avatar
📟
📟

Block or report infiniV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
infiniV/README.md

Raahim Arbaz

Computer vision engineer, based in Lahore. I do some research on the side and build the tooling I end up using.

Portfolio LinkedIn Email

What I'm up to

CV founding engineer at a clinical computer vision startup. I lead the annotation team, research new architectures, and run deployment and testing on edge hardware.

Projects

Project About Stack
VoiceFlow Local voice dictation on faster-whisper. Runs on your GPU, nothing leaves your machine. Python faster-whisper Pyloid
Vision-Dissect Cracks open CV models to compare layer activations and attention maps across YOLO11, SAM, and DepthPro. PyTorch ONNX Transformers
Android-Ui-MCP MCP server for Android UI automation and testing workflows. TypeScript MCP
ultra-instinct-claude-code 176 Claude Code tips distilled from 17 repos and 500k+ stars. Tagged by difficulty, nothing to install. Research Docs

Research

Mapping Air Pollution Sources with Sequential Transformer Chaining
NeurIPS 2024 Climate Change AI Workshop. Second author.

Chained Vision Transformers with Remote CLIP to find factory and brick-kiln chimneys in South Asian satellite imagery. Filtered a 600K+ image dataset down to the ~1% that actually contained pollution sources. Paper.

LocaGraph: Learning Localized Graph Attention with Anisotropic Adaptation
NeurIPS 2025 submission. Lead author. Graph neural networks for spatial data, under review.

Pinned Loading

  1. VoiceFlow VoiceFlow Public

    VoiceFlow brings the power of OpenAI's Whisper directly to your Windows machine. It runs entirely on your hardware, ensuring your voice data never leaves your device. Designed for privacy, speed, a…

    TypeScript 345 24

  2. Android-Ui-MCP Android-Ui-MCP Public

    MCP server for AI-powered UI feedback across React Native, Flutter, and native Android development.

    TypeScript 21 3

  3. Vision-Dissect Vision-Dissect Public

    Learning repository for exploring deep learning vision models

    HTML

  4. rsearch rsearch Public

    Production-grade query translation service - converts search query syntax to PostgreSQL, MySQL, SQLite, and MongoDB

    Go 1

  5. Anonator Anonator Public

    Desktop application for automatic face detection and anonymization in videos. Supports blur, pixelation, and solid black box anonymization methods.

    Python 1

  6. GitMind GitMind Public

    An AI-powered Git CLI manager that makes complex Git workflows simple through intelligent automation.

    Go 2 1