Skip to content

Core PyTorch benchmark suite for Quadtrix#42

Merged
Eamon2009 merged 1 commit into
masterfrom
exp
May 16, 2026
Merged

Core PyTorch benchmark suite for Quadtrix#42
Eamon2009 merged 1 commit into
masterfrom
exp

Conversation

@Eamon2009
Copy link
Copy Markdown
Owner

Summary

Introduces the primary Python benchmark runner, measuring model metadata, data throughput, forward latency, training-step latency, and autoregressive generation. Includes utility functions for dynamic module loading, timing, and percentile calculation.

Model BenchmarkingLatency Profiling:

Tracks forward pass, training step, and autoregressive generation latencies.Throughput Tracking: Measures tokenizer processing speeds and data throughput.Resource Monitoring: Captures model metadata and system memory footprints during runs.

Math UtilitiesDynamic Loading:

Implements safe runtime module loading via importlib to dynamically interact with engine/inference.py.Statistical Metrics: Adds custom mathematical utility functions, including a precise percentile calculator ($P_{50}$, $P_{90}$, $P_{99}$) for latency distribution reporting.Standardized Exports: Lays the groundwork for structured JSON and CSV output formatting.

Introduces the primary Python benchmark runner, measuring model metadata, data throughput, forward latency, training-step latency, and autoregressive generation. Includes utility functions for dynamic module loading, timing, and percentile calculation.
@Eamon2009 Eamon2009 merged commit 2b004da into master May 16, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant