Skip to content
View ulmentflam's full-sized avatar

Highlights

  • Pro

Block or report ulmentflam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ulmentflam/README.md

Evan Owen - AI Researcher & Engineer

Typing SVG

LinkedIn Blog Portfolio Email


Hey, I'm Evan

AI researcher and systems engineer with 10+ years shipping production systems at scale. Former Co-Founder & CTO at QWERKY AI, where I distilled 70B-parameter LLMs into 3B-8B hybrid models on 24 H200 GPUs with a pending patent on novel attention architectures. Currently pursuing my MS in Computer Science at Georgia Tech (BS CS, Summa Cum Laude, University of South Carolina). I've led teams of 20+ engineers and shipped 15+ production applications across AI, blockchain, and distributed systems.

What I'm Working On

  • LLM Architecture Research -- Custom CUDA kernels for novel attention mechanisms (pending patent)
  • State Space Models -- Contributed Mamba SSM architecture to Modular's MAX framework in Mojo
  • QDistill -- 70B→3B-8B hybrid distillation achieving 4x throughput and 1M token context lengths
  • Open Source -- Selective scan, causal conv1d, and RMSNorm kernels in the Modular ecosystem

Tech Stack

Languages

C++ CUDA Python Go Rust Mojo Swift Solidity Nix

AI / ML

PyTorch Hugging Face DeepSpeed vLLM TensorFlow ROCm

Infrastructure

Kubernetes Docker Terraform AWS GCP

Featured Work

Modular MAX Framework
Mamba SSM architecture with custom selective scan, causal conv1d, and RMSNorm kernels in Mojo
Mojo CUDA Stars

Pulley
iOS Maps-style drawer library with 2k+ stars, created at 52 Inc.
Swift Stars

QWERKY AI
Distilled 70B→3B-8B hybrid models on 24 H200 GPUs. 4x inference throughput, 1M token context. Pending patent on novel attention architecture.
Python CUDA C++

key-gen
BIP-0044 compatible multi-blockchain key generator
Go Stars

Latest Writing

Read more on the QWERKY AI blog →

GitHub Stats

GitHub Stats GitHub Streak

Top Languages


See more of my work below

Profile Views

Pinned Loading

  1. modular/modular modular/modular Public

    The Modular Platform (includes MAX & Mojo)

    Mojo 25.9k 2.8k

  2. 52inc/Pulley 52inc/Pulley Public

    A library to imitate the iOS 10 Maps UI.

    Swift 2k 262

  3. miguelgrinberg/Flask-SocketIO miguelgrinberg/Flask-SocketIO Public

    Socket.IO integration for Flask applications.

    Python 5.5k 898

  4. key-gen key-gen Public

    A BIP-0044 compatible key generator for multiple blockchains. This is not a secure method, but is useful for quick key generation.

    Go 1

  5. asdf-magic asdf-magic Public

    Shell

  6. tteck/Proxmox tteck/Proxmox Public archive

    Proxmox VE Helper-Scripts

    Shell 15.2k 2.5k