Skip to content
View tounsils's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report tounsils

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tounsils/README.md

Ilyes Tounsi

Senior Full-Stack Engineer — LLM evaluation, agentic workflows, advanced developer tooling Carlsbad, CA · tn76.com · LinkedIn · tounsils@gmail.com


I build production web applications end-to-end and design evaluation tasks that benchmark frontier AI agents. Recent focus: deterministic test harnesses, long-context puzzle design for LLM agents, and CLI-driven developer tooling.

What I'm working on

  • LLM evaluation tasks for an agent-benchmark program (Python, Docker, pytest). Authoring stateful environments with hidden constraints, mislabel traps, and seasonal-gate puzzle mechanics that calibrate agent difficulty against models like Claude Opus 4.6 and GPT-5.
  • Timecards — a local-first desktop time tracker for managing multiple concurrent projects (repo).
  • Digital QR Card — full-stack platform (Node, Express, MongoDB, Vite + React) at digitalqrcard.com.

Stack

Languages — Python, TypeScript, JavaScript, PHP, C++, Bash Frontend — Next.js, React, React Native, Vue, Tailwind, Bootstrap Backend — Node.js, Express, Laravel, REST APIs Testing — pytest (boundary + edge-case validation, fixture-driven integration), Jest, deterministic harnesses AI / LLM — agent evaluation, prompt-trap design, long-context puzzle authoring, RLHF-adjacent annotation workflows, Anthropic + OpenAI APIs DevOps / Cloud — AWS (Solutions Architect track), Docker, Kubernetes, Jenkins, Nginx, CI/CD Databases — MongoDB, PostgreSQL, MySQL, MariaDB Tooling — Claude Code, advanced CLI automation, WSL, git workflows

Selected projects

Project What it is Stack
timecards Local desktop time tracker; multi-project concurrent tracking Python, Windows binaries
technical-assessment-deploy Deployable hiring-assessment platform TypeScript, Next.js
tn76-radiotv Streaming/media web app TypeScript
i-CRM Self-hosted CRM (leads, invoicing, staff) — 6★ Laravel, Blade
Human-Resources-App HR platform: payroll, recruiting, performance JavaScript
AWS-Solutions-Architect-Professional Notes + reference material for the SAP-C02 cert

Reach me

Open to senior full-stack roles with an LLM-evaluation, agent-benchmarking, or developer-tooling component.

Pinned Loading

  1. timecards timecards Public

    Timecards — local desktop time tracker for working on multiple projects at once. Windows binary releases.

  2. i-CRM i-CRM Public

    i-CRM is a web-based CRM software that facilitates you to manage leads, customers, proposals, estimates, invoices, items, taxes, staff, messaging and other important features. Web based self hosted…

    Blade 6 1

  3. Human-Resources-App Human-Resources-App Public

    Human Resources application dealing with the people and issues related to people such as compensation and benefits, recruiting and hiring employees , onboarding employees, performance management, t…

    JavaScript 2 1

  4. AWS-Solutions-Architect-Professional AWS-Solutions-Architect-Professional Public

    AWS solutions architect professional

    1

  5. technical-assessment-deploy technical-assessment-deploy Public

    Deployable Next.js platform for delivering and grading technical hiring assessments. TypeScript, edge-case test coverage.

    TypeScript

  6. tn76-radiotv tn76-radiotv Public

    Streaming web app for radio/TV media (TypeScript, React). Production deployment at tn76.com.

    TypeScript