TransInferSim - fast analysis of Transformer Network inference

TransInferSim is a cycle-accurate simulator for analyzing the hardware performance of Transformer NN inference on custom systolic-array accelerators. Combined with Accelergy, it reports latency, energy, area, and other efficiency metrics, enabling cache-policy analysis, memory-hierarchy optimization, hardware design-space exploration, and exportable execution plans for RTL validation and deployment.

Features

Analyzes Transformer NN inference on hardware
Integrates with Accelergy for energy estimation
Includes various plugins for Accelergy's flexibility

Reference

If you find our work useful, please refer our paper.

J. Klhufek, A. Marchisio, V. Mrazek, L. Sekanina and M. Shafique, "TransInferSim: Toward Fast and Accurate Evaluation of Embedded Hardware Accelerators for Transformer Networks," in IEEE Access, vol. 13, pp. 177215-177226, 2025, doi: 10.1109/ACCESS.2025.3621062.

@ARTICLE{transinfersim,
  author={Klhufek, Jan and Marchisio, Alberto and Mrazek, Vojtech and Sekanina, Lukas and Shafique, Muhammad},
  journal={IEEE Access}, 
  title={TransInferSim: Toward Fast and Accurate Evaluation of Embedded Hardware Accelerators for Transformer Networks}, 
  year={2025},
  volume={13},
  number={},
  pages={177215-177226},
  keywords={Transformers;Accuracy;Hardware acceleration;Computational modeling;Schedules;Analytical models;Data models;Computer architecture;Memory management;Register transfer level;Transformers;hardware accelerators;modeling tools;memory subsystem;evaluation and optimizations},
  doi={10.1109/ACCESS.2025.3621062}}

Installation

To get started with TransInferSim, follow these steps:

Prerequisites

Python 3.9 or higher This project requires Graphviz, uv, and basic build tools (make, g++) to be installed. On Ubuntu/Debian:

sudo apt install graphviz build-essential

Clone and build the Repository

Clone the repository and its submodules and build using uv:

git clone --recurse-submodules https://github.com/ehw-fit/TransInferSim
cd TransInferSim
make install
source .venv/bin/activate

Usage

You can find an example run in the example.py script, which demonstrates how to instantiate a transformer model or layer of your choice along with a showcase of an example hardware specification. The script then runs an inference simulation, and the runtime performance statistics are saved to a stats_out.txt file.

Memory Occupancy Profiling

To analyze memory utilization across the simulation, run the memory trace example from the project root:

python mem_trace_example.py

Licence

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
accelergy @ 6911d15		accelergy @ 6911d15
accelergy_plugins		accelergy_plugins
analyzer		analyzer
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
compound_components.yaml		compound_components.yaml
example.py		example.py
mem_trace_example.py		mem_trace_example.py
mem_trace_roberta_base.svg		mem_trace_roberta_base.svg
overall.jpg		overall.jpg
pyproject.toml		pyproject.toml
stats_out.txt		stats_out.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TransInferSim - fast analysis of Transformer Network inference

Features

Reference

Installation

Prerequisites

Clone and build the Repository

Usage

Memory Occupancy Profiling

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TransInferSim - fast analysis of Transformer Network inference

Features

Reference

Installation

Prerequisites

Clone and build the Repository

Usage

Memory Occupancy Profiling

Licence

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages