LLMSimulator

Overview

LLMSimulator simulator is a c++ based cycle-accurate simulator, which based on graph execution of Large Language Models. This simulator supports state-of-the-art LLMs such as DeepSeek, Llama, Mixtral and etc. This simulator supports not only Multi-Head Attention (MHA) mechanism, but also Grouped-Query Attention(GQA), Multi-Query Attention(MQA) and Multi-head Latent Attention (MLA). LLMSimulator equipped with simulation of Mixture of Expert (MoE). It integrates with modified Ramulator 2.0 for detailed memory modeling. LLMSimulator can evaluate various type of GPU generation such as H100, B100 and B200, and also including bank-level PIM, bank-group-level PIM, and Logic-PIM.

Key features:

Supports flexible input/output length, batch sizes, request injection rates, and multi-node hardware configurations
Models energy consumption and performance metrics across various memory systems

Prerequisites

Compiler: g++ version 11.4.0
cmake, clang++

LLMSimulator is tested under the following system.

Getting Started

Building LLMSimulator

Clone the repository

   $ git clone https://github.com/scale-snu/LLMSimulator.git
   $ cd LLMSimulator
   $ git submodule update --init --recursive

Apply patch

   $ cd src/dram/ramulator2
   $ git apply ../../../patch/ramulator2_pim.patch
   $ cd ../../../

Build executable files

   $ mkdir build && cd build
   $ cmake ..
   $ make -j

How to run

LLMSimulator has config file (config.yaml) and you can modify it with your configuration. After modifying config.yaml and saving it, you can run with command below

   $ ./run > test.log

Contact

Sungmin Yun sungmin.yun@snu.ac.kr

Kwanhee Kyung kwanhee.kyung@scale.snu.ac.kr

Juhwan Cho juhwan.cho@scale.snu.ac.kr

Note

This simulator builds upon the simulator introduced in the MICRO 2024 paper “Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous Batching.”

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
eval		eval
patch		patch
src		src
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
dram_config.yaml		dram_config.yaml
dram_config_HBM3E_192GB.yaml		dram_config_HBM3E_192GB.yaml
dram_config_HBM3_80GB.yaml		dram_config_HBM3_80GB.yaml
memory_pool_dram_config.yaml		memory_pool_dram_config.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMSimulator

Overview

Prerequisites

Getting Started

Building LLMSimulator

How to run

Contact

Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLMSimulator

Overview

Prerequisites

Getting Started

Building LLMSimulator

How to run

Contact

Note

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages