Real Time Visualization WebGPU Tutorial

by Lucas Melo, Lukas Herzberger, Patrick Komon

Real Time Visualization WebGPU Tutorial

Introduction

Welcome to the Real Time Visualization WebGPU Tutorial!

This is a 90 minute tutorial. It consists of 4 tasks and 1 bonus task. By the end of it, you will have built your own neat little app to visualize the trees of Vienna.

WebGPU-capable browsers:

Windows: Firefox, Chrome, Edge
Linux: Firefox Nightly or Chromium
- Install from here: https://github.com/scheib/chromium-latest-linux
- Enable the flags listed here: https://github.com/gpuweb/gpuweb/wiki/Implementation-Status#chromium-chrome-edge-etc
MacOS: Chrome or Safari (macOS Tahoe 26 or later)

First steps:

Clone this repository (git clone https://github.com/Welko/rtvis-webgpu-tutorial)
Open index.html on your favorite WebGPU-capable browser (no server needed). On Windows, your URL will look something like file:///C:/Projects/rtvis-webgpu-tutorial/index.html
Open tutorial.js in your favorite IDE. All your Javascript code will go there.
If you want a smoother workflow, we recommend using a live server

Other resources:

Task 0 - Initialize WebGPU

Unlike WebGL, WebGPU does not need a canvas. It can be used only for its compute capabilites.

async initializeWebGPU() {
    if (!this.gpu) {
        this.gpu = navigator.gpu;
        if (!this.gpu) {
            const message = "WebGPU is not supported in your browser. Please use/update Chrome or Edge.";
            alert(message);
            throw new Error(message);
        }
        console.log("Hooray! WebGPU is supported in your browser!");
    }
}

Now open your browser console and look at the message.

Chrome-based - Ctrl + Shift + J
Firefox - Ctrl + Shift + K

Task 1 - Compute Shader Basics

Duration: 10 minutes

We start by creating a very simple shader that adds a constant value to all elements of a list.

To get access to your GPU device and communicate with it, get the device

async initializeWebGPU() {
    ...
    this.adapter = await this.gpu.requestAdapter();
    this.device = await this.adapter.requestDevice();
}

Then, we upload some data to the GPU. For now, a fixed [1, 2, 3, 4] array is good enough.

async initializeBuffers() {
    this.data = new Uint32Array([1, 2, 3, 4]);
    this.buffer = this.device.createBuffer({
      size: this.data.byteLength,
      // Storage buffers can be indexed directly on the GPU
      usage: GPUBufferUsage.STORAGE | GPUBufferUsage.COPY_SRC | GPUBufferUsage.COPY_DST,
    });
    this.device.queue.writeBuffer(this.buffer, 0, this.data);
}

This is the data that we want to process on the GPU through a compute shader.

Open the compute shader shaders/add.js. Note that this is a Javascript file. The shader code is written as a string and stored in the window object with the key add.

The programming language of WebGPU shaders is wgsl. If you are using Visual Studio Code, we recommend you install the extension WGSL Literal.

The first thing we'll add here is the buffer binding.

// At binding 0, we have a read-write storage buffer
@group(0) @binding(0) var<storage, read_write> data: array<u32>; // Array of 32-bit unsigned integers

Then we add our compute entry point.

@compute
@workgroup_size(64)
fn main(@builtin(global_invocation_id) globalId: vec3u) {
    if (globalId.x >= arrayLength(&data)) {
        return;
    }
    // Our computations will go here :)
}

In WebGPU, a workgroup is a fixed-size group of threads. In our entry point, we must specify the size of the workgroup running our code.

Because a workgroup is fixed-size, it is possible that more threads than data elements are being dispatched. To prevent undefined access to our data, we add an out-of-bounds guard.

// globalId.x is our thread ID
if (globalId.x >= arrayLength(&data)) {
    // Current thread is out of bounds. Do nothing.
    return;
}

Finally, we can use the thread ID to do something with the data. In this case, we just add a hard-coded value. The entire shader should then look like this:

@group(0) @binding(0) var<storage, read_write> data: array<u32>;

@compute
@workgroup_size(64)
fn main(@builtin(global_invocation_id) globalId: vec3u) {
    if (globalId.x >= arrayLength(&data)) {
        return;
    }
    data[globalId.x] += 100u;
}

Now that our shader is done, we move back to tutorial.js and define the GPU pipeline that will run our shader.

async initializePipelines() {
    this.pipeline = this.device.createComputePipeline({
        // Use simplistic auto-generation
        // Bigger applications will manually generate a layout, and share it across muliple shaders
        layout: "auto", 
        compute: {
            module: this.device.createShaderModule({
                code: SHADERS.add
            }),
        }
    });
}

Next, we must create a connection between our GPU buffer and a binding in the pipeline. Because we used layout: "auto", the bindings are defined automatically for us. The pipeline bindings must match the shader bindings.

We create this connection via a bind group (a group of bindings).

async initializeBindGroups() {
    this.bindGroup = this.device.createBindGroup({
        layout: this.pipeline.getBindGroupLayout(0), // group(0)
        entries: [
            {
                binding: 0, // Matches our shader!
                resource: {
                    buffer: this.buffer // Our data!
                }
            }
        ]
    });
}

There is one last thing left before executing our pipeline. The pipeline with its bind group is executed through a GPU command. Commands in WebGPU are encoded in batch, so that they can all be sent to the GPU at once. That is done via a command encoder.

async render() {
    const commandEncoder = this.device.createCommandEncoder();
    {
        const computePass = commandEncoder.beginComputePass();
        computePass.setPipeline(this.pipeline);
        computePass.setBindGroup(0, this.bindGroup);
        computePass.dispatchWorkgroups(1); // Only one workgroup! (64 threads, as defined in the shader)
        computePass.end();
    }
    const commandBuffer = commandEncoder.finish();
}

With our command buffer created, we can finally submit it to the GPU, and our shader code will be executed!

async render() {
    ...
    this.device.queue.submit([commandBuffer]);
}

The last step for this task is now to read the data back to the CPU, where we can print it to the console.

However, we cannot read directly from our buffer. Our buffer was created with usage: GPUBufferUsage.STORAGE. To read from it, it must contain the usage flag GPUBufferUsage.MAP_READ. However, this usage flag cannot be used in combination with any other usage flags except for GPUBufferUsage.COPY_DST.

The solution: We must create a separate buffer to copy our data into.

In this part, we skip the details and encourage you to understand it in more detail on your own at another time.

async readBuffer(gpuBuffer, outputArray) {
    // This buffer can be read on the CPU because of MAP_READ
    const readBuffer = this.device.createBuffer({
        size: outputArray.byteLength,
        usage: GPUBufferUsage.MAP_READ | GPUBufferUsage.COPY_DST
    });

    // Copy from 'gpuBuffer' to 'readBuffer'
    const commandEncoder = this.device.createCommandEncoder();
    commandEncoder.copyBufferToBuffer(gpuBuffer, 0, readBuffer, 0, outputArray.byteLength);
    this.device.queue.submit([commandEncoder.finish()]);

    // Map the GPU data to the CPU
    await readBuffer.mapAsync(GPUMapMode.READ);

    // Read the data
    const ArrayType = /** @type {new (buffer: ArrayBufferLike) => T} */ (outputArray.constructor);
    const resultData = new ArrayType(readBuffer.getMappedRange());

    // Copy the data to the output array
    outputArray.set(resultData);

    // The read buffer is no longer needed
    readBuffer.destroy();

    return outputArray;
}

Important! Our buffer can be copied, because it has the GPUBufferUsage.COPY_SRC flag.

The last thing left to do is print our data on the console!

async render() {
    ...
    console.log(await this.readBuffer(this.buffer, new Uint32Array(this.data.length)));
}

And we're done!

Task 2 - Processing Real Data

Duration: 15 minutes

We start by loading some real data. The data we use here is from the Baumkataster bzw. Bäume Standorte Wien, a dataset of trees in Vienna.

The data can be conveniently loaded with the provided LOADER. We use this to load information per tree in a single list, in the format

[
    treeHeightCategory0, crownDiameterCategory0, districtNumber0, circumferenceAt1mInCm0,
    treeHeightCategory1, crownDiameterCategory1, districtNumber1, circumferenceAt1mInCm1,
    ...
]

and write it to a GPU buffer. We may completely replace the old data and buffers with the new ones.

async initializeBuffers() {
    this.trees = await LOADER.loadTrees(); // Load 100 trees

    // TreeInfo
    this.gpuTreeInfo = this.device.createBuffer({
        size: this.trees.getInfoBuffer().byteLength,
        usage: GPUBufferUsage.STORAGE | GPUBufferUsage.COPY_SRC | GPUBufferUsage.COPY_DST
    });
    this.device.queue.writeBuffer(this.gpuTreeInfo, 0, this.trees.getInfoBuffer());
}

Now we have replaced our data array with trees.getInfoBuffer() and renamed our buffer to gpuTreeInfo. This change also needs to be reflected in our initializeBindGroupsand render functions:

async initializeBindGroups() {
    ...
                    buffer: this.gpuTreeInfo // Our new tree data!
    ...
}
...
async render() {
    ...
    console.log(await this.readBuffer(this.gpuTreeInfo, new Uint32Array(this.trees.getInfoBuffer().length)));
}

If we now refresh the page, you'll notice that the first 64 values of our buffer are at least 100, as expected.

However, now we do something more interesting than that. We can now use this data to count the number of trees for each district in Vienna. For a large number of trees, it is much faster to do this on the GPU in parallel rather than on the CPU. For that, we write a new shader.

Open shaders/aggregate.js.

Note that some things are already set up for you. Most importantly, our buffer bindings:

struct AggregatedValues {
    // Array of 23 atomic unsigned integers (one for each district in Vienna)
    districtNumberOccurrences: array<atomic<u32>, 23>,
};

// Our storage buffer at binding 0 now is of type TreeInfo (see the TreeInfo struct)
@group(0) @binding(0) var<storage, read> treeInfo: array<TreeInfo>;

// A second storage buffer is added, where our (atomic) counts are stored
// We need to create a new buffer for this!
@group(0) @binding(1) var<storage, read_write> aggregatedValues: AggregatedValues;

The only thing left to add is accessing the tree info for each tree and incrementing the count of its district.

let treeInfo: TreeInfo = treeInfo[globalId.x];

// Increment one to district number
let districtNumber = treeInfo.districtNumber;
atomicAdd(&aggregatedValues.districtNumberOccurrences[districtNumber - 1], 1);

Back to Javascript, creating the buffer that we will use for the aggregated values is simple, since we don't need to initialize its data (it is initialized with zeros).

async initializeBuffers() {
    ...
    this.gpuAggregatedValues = this.device.createBuffer({
        size: 23 * Uint32Array.BYTES_PER_ELEMENT, // 23 unsigned integers (one per district in Vienna)
        usage: GPUBufferUsage.STORAGE | GPUBufferUsage.COPY_SRC
    });
}

With a new shader and a new buffer, we must also create a new pipeline and a new bind group.

async initializePipelines() {
    this.aggregatePipeline = this.device.createComputePipeline({
        layout: "auto",
        compute: {
            module: this.device.createShaderModule({ code: SHADERS.aggregate }),
        }
    });
}

async initializeBindGroups() {
    this.aggregateBindGroup = this.device.createBindGroup({
        layout: this.aggregatePipeline.getBindGroupLayout(0),
        entries: [
            { binding: 0, resource: { buffer: this.gpuTreeInfo } },
            // Now we have a second buffer on binding 1!
            { binding: 1,  resource: { buffer: this.gpuAggregatedValues } }
        ]
    });
}

Almost done. Now we adjust the number of workgroups we're dispatching. Instead of just one, we calculate it based on how many trees we have.

Don't forget to also rename the pipeline and bind group we're using.

async render() {
    ...
        const numTreeWorkgroups = Math.ceil(this.trees.getNumTrees() / 64); // 64 from shader
        const computePass = commandEncoder.beginComputePass();
        computePass.setPipeline(this.aggregatePipeline);
        computePass.setBindGroup(0, this.aggregateBindGroup);
        computePass.dispatchWorkgroups(numTreeWorkgroups);
        computePass.end();
    ...
}

Finally, we can now print the contents of the aggregates buffer.

async render() {
    ...
    console.log(await this.readBuffer(this.gpuAggregatedValues, new Uint32Array(23)));
}

You should now see displayed on the console the number of trees counted per district (note that we start at index 0). In the image below, district 9 has 30 trees, district 10 has 0, district 11 has 7, etc.

Task 3 - Render an Image

Duration: 10 minutes

Finally we will render something on the screen. In this case, it will be just a simple texture.

The first step is to load our map data, which includes four images of Vienna: satellite, streets, outdoors, and height

async initializeTextures() {
    this.map = await LOADER.loadMap(); // Load map images and geographical data
}

The images contained in map are ready to be used with WebGPU. So we can jump straight into creating our texture.

async initializeTextures() {
    ...
    const image = this.map.images.outdoors; // Try 'satellite' or 'streets' as well
    this.gpuMapTexture = this.device.createTexture({
      size: [image.width, image.height],
      format: "rgba8unorm",
      // TEXTURE_BINDING is needed to bind the texture to the pipeline
      // We also need to copy the image to the texture, so we need COPY_DST and RENDER_ATTACHMENT as well
      usage: GPUTextureUsage.COPY_DST | GPUTextureUsage.TEXTURE_BINDING  | GPUTextureUsage.RENDER_ATTACHMENT,
    });
    this.device.queue.copyExternalImageToTexture(
          {source: image, flipY: true}, // Source
          {texture: this.gpuMapTexture}, // Destination
          [image.width, image.height] // Size
    );
}

To finalize the initialization of our texture, we also create a sampler. It is through the sampler that we access the texture in the shaders.

async initializeTextures() {
    ...
    this.sampler = this.device.createSampler({
        magFilter: 'linear',
        minFilter: 'linear'
    });
}

Finally, we need to set the render size of the canvas to our image dimensions. This avoids sampling artifacts on the map and lets the browser do the work of downsampling our render output to the actual canvas (css) size.

async initializeTextures() {
    ...
    // Set canvas render size to image dimension
    this.canvas.width = image.width;
    this.canvas.height = image.height;
}

With the texture data set up on the GPU, we begin the process of showing it on the screen. For this tutorial, we will write one vertex shader and one fragment shader, and we will render a quad (two triangles forming a rectangle).

For loading large 3D models, it can be beneficial to create a vertex buffer, which is used to feed data into our vertex shader.

Since we only render one quad (6 triangles with triangle-list), we hard-code the vertex and UV positions in the shader.

Open shaders/image.js.

This time, you will find only the definitions of our vertices and UVs.

We begin by adding our bindings.

@group(0) @binding(0) var texture: texture_2d<f32>;
@group(0) @binding(1) var linearSampler: sampler;

Next, we define the vertex shader. Other graphics APIs provide global variables to manage access to the inputs and outputs of shaders (e.g., OpenGL with gl_VertexID or gl_FragColor). In WebGPU, the inputs and outputs are explicitly defined and are available as arguments in our entry point function.

struct VertexInput {
    @builtin(vertex_index) vertexIndex: u32, // Analogous to gl_VertexID in OpenGL
};

struct VertexOutput {
    @builtin(position) position: vec4f, // Built-in vertex position output
    @location(0) uv: vec2f, // Varying passed to fragment shader
};

@vertex
fn vertex(input: VertexInput) -> VertexOutput {
    return VertexOutput(
        // Index our vertex/UV data using the current vertex's index
        // %6 just for safety in case the draw call has more than 6 vertices
        vec4f(VERTICES[input.vertexIndex % 6], 0, 1),
        UVS[input.vertexIndex % 6],
    );
}

In the fragment shader, we then use our sampler to sample the texture.

struct FragmentInput {
    @location(0) uv: vec2f, // Varying from vertex shader
};

struct FragmentOutput {
    @location(0) color: vec4f, // Built-in fragment color output (analogous to gl_FragColor)
};

@fragment
fn fragment(input : FragmentInput) -> FragmentOutput {
    return FragmentOutput(
        // Sample our texture
        textureSample(texture, linearSampler, input.uv),
    );
}

Like previously, we must still create a pipeline and bind group. Additionally, we also need to create a color attachment, which describes how the output of our render pass behaves.

First, for the pipeline:

async initializePipelines() {
    const imageShaderModule = this.device.createShaderModule({ code: SHADERS.image });
    this.imageRenderPipeline = this.device.createRenderPipeline({
        layout: "auto",
        vertex: {
            module: imageShaderModule,
            // buffers: We don't need any vertex buffer :)
        },
        fragment: {
            module: imageShaderModule,
            targets: [
                {
                    // Only one render target, where our color will be drawn
                    format: this.gpu.getPreferredCanvasFormat()
                }
            ]
        },
    });
}

Then for the bind group:

async initializeBindGroups() {
    this.imageBindGroup = this.device.createBindGroup({
        layout: this.imageRenderPipeline.getBindGroupLayout(0),
        entries: [
            { binding: 0, resource: this.gpuMapTexture.createView() },
            { binding: 1, resource: this.sampler },
        ]
    });
}

And lastly, the color attachment:

async initializeAttachments() {
    // Set canvas css size
    const minSide = -100 + Math.min(this.canvas.parentElement.clientWidth, this.canvas.parentElement.clientHeight);
    this.canvas.style.width = minSide + "px";
    this.canvas.style.height = minSide + "px";

    // Color attachment to draw to
    /** @type {GPURenderPassColorAttachment} */
    this.colorAttachment = {
        view: null, // Will be set in render(), i.e., every frame
        loadOp: "clear",
        clearValue: {r: 0, g: 0, b: 0, a: 0},
        storeOp: "store"
    };
}

There is one last step before rendering: setting up the HTML canvas. That is done analogously to WebGL (where we use canvas.getContext("webgl)), followed by a configuration of the context:

async initializeWebGPU() {
    ...
    this.context = this.canvas.getContext("webgpu");
    this.context.configure({
        device: this.device,
        format: this.gpu.getPreferredCanvasFormat()
    });
}

And finally, we can encode our commands into a buffer and submit it to the GPU!

async render() {
    this.colorAttachment.view = this.context.getCurrentTexture().createView();

    const commandEncoder = this.device.createCommandEncoder();
    {
        const renderPass = commandEncoder.beginRenderPass({
            colorAttachments: [this.colorAttachment]
        });
        renderPass.setPipeline(this.imageRenderPipeline);
        renderPass.setBindGroup(0, this.imageBindGroup);
        renderPass.draw(6); // 6 vertices - one quad
        renderPass.end();
    }
    const commandBuffer = commandEncoder.finish();
    this.device.queue.submit([commandBuffer]);
}

And, finally, you should see a map of Vienna on your screen.

Task 4 - Render Trees as Markers

Duration: 20 minutes

Finally, in this task, we will get to render our tree data.

Previously, we had loaded the tree information. Now, in addition to that, we also load the coordinates of the trees.

async initializeBuffers() {
    ...
    // TreeCoordinates
    this.gpuTreeCoodinates = this.device.createBuffer({
        size: this.trees.getCoordinatesLatLonBuffer().byteLength,
        usage: GPUBufferUsage.STORAGE | GPUBufferUsage.COPY_DST,
    });
    this.device.queue.writeBuffer(this.gpuTreeCoodinates, 0, this.trees.getCoordinatesLatLonBuffer());
}

With this data now available, we move on to this task's shaders.

Open shaders/markers.js.

Our vertices and UVs are there again, together with the function latLonToXY, that converts latitude and longitude coordinates into XY coordinates in the range [0,1].

As you may have noticed, the function latLonToXY uses an object u that is not defined anywhere. This is our uniforms. Uniforms are not passed individually to a shader, but through an uniform buffer. We will define it now, together with our two other buffers.

struct TreeCoordinates {
    lat: f32,
    lon: f32,
};

struct TreeInfo {
    treeHeightCategory: u32,
    crownDiameterCategory: u32,
    districtNumber: u32,
    circumferenceAt1mInCm: u32,
};

struct Uniforms {
    mapWidth: f32,
    mapHeight: f32,
    mapLatitudeMin: f32,
    mapLatitudeMax: f32,
    mapLongitudeMin: f32,
    mapLongitudeMax: f32,
    markerSize: f32,
    unused: f32, // Padding
    markerColor: vec4f,
};

@group(0) @binding(0) var<storage, read> treeCoordinates: array<TreeCoordinates>;
@group(0) @binding(1) var<storage, read> treeInfo: array<TreeInfo>;
@group(0) @binding(2) var<uniform> u: Uniforms;

Our vertex shader will read from these buffers and use their information to decide where to place vertices and which color to use. Once again, we will not use vertex buffers here.

WebGPU, like other APIs, has instanced rendering. That allows us to draw one geometry several times with increased performance due to the reduced number of draw calls.

We will NOT use instanced rendering here. Instead, we go even more hardcore and derive the index of our (fake) instance based on its vertex index. Additionally to UV coordinates, we also pass a color to the fragment shader.

struct VertexInput {
    @builtin(vertex_index) vertexIndex: u32,
};

struct VertexOutput {
    @builtin(position) position: vec4f,
    @location(0) uv: vec2f,
    @location(1) color: vec4f,
};

@vertex
fn vertex(input: VertexInput) -> VertexOutput {
    // I'm so cool
    let treeIndex = input.vertexIndex / 6;

    // Get 2D position of tree
    let latLon = treeCoordinates[treeIndex];
    let xy = latLonToXY(latLon.lat, latLon.lon) * 2 - 1;

    // Calculate marker position and size
    let vertex = VERTICES[input.vertexIndex % 6] * u.markerSize + xy;

    // Get tree info
    let treeInfo = treeInfo[treeIndex];

    // Color based on tree district
    // THIS IS BAD VISUALIZATION
    // Ideally, the district number would be mapped into a qualitative color scheme
    // See colorbrewer2.org for some good ones
    let color23 = u.markerColor; // Color for Liesing
    let color1 = vec4f(0, 0, 0, color23.a); // Color for Innere Stadt
    let blendingFactor = f32(treeInfo.districtNumber - 1) / 22;
    let color = mix(color1, color23, blendingFactor);

    return VertexOutput(
        vec4f(vertex, 0, 1),
        UVS[input.vertexIndex % 6],
        color,
    );
}

And very little work is left for our fragment shader to do.

struct FragmentInput {
    @location(0) uv: vec2f,
    @location(1) color: vec4f,
};

struct FragmentOutput {
    @location(0) color: vec4f,
};

@fragment
fn fragment(input : FragmentInput) -> FragmentOutput {
    return FragmentOutput(
        input.color,
    );
}

And now, back to Javascript, we create the uniforms buffer.

async initializeBuffers() {
    ...
    // Uniforms
    this.uniforms = {
        markerSize: 0.01,
        markerColor: [255, 0, 0], // The screenshots use [117, 107, 177]
        markerAlpha: 0.01,
    };
    this.gpuUniforms = this.device.createBuffer({
        size: 1024, // Allocate 1024 bytes. Enough space for 256 floats/ints/uints (each is 4 bytes). That should be enough
        // UNIFORM (of course) and COPT_DST so that we can later write to it
        usage: GPUBufferUsage.UNIFORM | GPUBufferUsage.COPY_DST,
    });
    // Write to the uniforms buffer in render()
}

Note that we do not write into the uniforms buffer immediately. That is because its data may change very often. Because of that, we will write to it every frame.

async render() {
    // Copy the uniforms to the GPU buffer
    // Warning: the layout must match the layout of the uniform buffer in the shader
    this.device.queue.writeBuffer(this.gpuUniforms, 0, new Float32Array([
        this.map.width,
        this.map.height,
        this.map.latitude.min,
        this.map.latitude.max,
        this.map.longitude.min,
        this.map.longitude.max,
        this.uniforms.markerSize,
        0, // Unused
        this.uniforms.markerColor[0] / 255,
        this.uniforms.markerColor[1] / 255,
        this.uniforms.markerColor[2] / 255,
        this.uniforms.markerAlpha,
    ]));
    ...
}

Now that we have all the buffers set up, we can proceed to creating our pipeline and bind group for the new shaders.

async initializePipelines() {
    ...
    const markersShaderModule = this.device.createShaderModule({ code: SHADERS.markers });
    this.markersRenderPipeline = this.device.createRenderPipeline({
        layout: "auto",
        vertex: {
            module: markersShaderModule,
            // buffers: We don't need any vertex buffer :)
        },
        fragment: {
            module: markersShaderModule,
            targets: [{
                format: this.gpu.getPreferredCanvasFormat()
            }]
        }
    });
}

async initializeBindGroups() {
    ...
    this.markersBindGroup = this.device.createBindGroup({
        layout: this.markersRenderPipeline.getBindGroupLayout(0),
        entries: [
            { binding: 0, resource: { buffer: this.gpuTreeCoodinates } },
            { binding: 1, resource: { buffer: this.gpuTreeInfo } },
            { binding: 2, resource: { buffer: this.gpuUniforms } },
        ]
    });
}

And now finally we can render the markers! We can render them immediately after the map, in the same render pass.

async render() {
    ...
    const commandEncoder = this.device.createCommandEncoder();
    {
        const renderPass = commandEncoder.beginRenderPass({
            colorAttachments: [this.colorAttachment]
        });

        // Render map
        renderPass.setPipeline(this.imageRenderPipeline);
        renderPass.setBindGroup(0, this.imageBindGroup);
        renderPass.draw(6); // One quad

        // Render markers
        renderPass.setPipeline(this.markersRenderPipeline);
        renderPass.setBindGroup(0, this.markersBindGroup);
        renderPass.draw(6 * this.trees.getNumTrees()); // As many quads as we have trees

        renderPass.end();
    }
    const commandBuffer = commandEncoder.finish();
    this.device.queue.submit([commandBuffer]);
}

And, with that, you should be able to finally see the tree markers on the screen!

To load more trees, go back to where you load the date (initializeBuffers()) and pass a true to loadTrees(), like so:

this.trees = await LOADER.loadTrees(true); // Load 219,378 trees

BONUS 1/3! Activate blending (i.e., transparency)

async initializePipelines() {
    ...
    this.markersRenderPipeline = this.device.createRenderPipeline({
        ...
        fragment: {
            module: markersShaderModule,
            targets: [{
                format: this.gpu.getPreferredCanvasFormat(),
                blend: { // This activates blending!
                    color: {
                        operation: "add",
                        srcFactor: "src-alpha",
                        dstFactor: "one-minus-src-alpha",
                    },
                    alpha: {
                        operation: "add",
                        srcFactor: "src-alpha",
                        dstFactor: "one-minus-src-alpha",
                    }
                }
            }]
        }
    });
}

BONUS 2/3! Add UI controls

async initializeGUI() {
    const onChange = () => this.render();
    [
        this.gui.add(this.uniforms, "markerSize", 0.001, 0.1, 0.001),
        this.gui.addColor(this.uniforms, "markerColor"),
        this.gui.add(this.uniforms, "markerAlpha", 0.01, 1, 0.01),
    ]
    .forEach((controller) => controller.onChange(onChange));
}

BONUS 3/3! Another (still bad) color scheme

fn toMarkerColor(districtIndex: u32) -> vec3<f32> {
    // THIS IS STILL BAD VISUALIZATION
    // Ideally, the district number would be mapped into a qualitative color scheme
    // See colorbrewer2.org for some good ones

    // This number changes the color scheme
    let magicNumber = 555555u;
    return unpack4x8unorm(((districtIndex % 127) + 1) * magicNumber).rgb;
}

@vertex
fn vertex(input: VertexInput) -> VertexOutput {
    ...
    // Color based on tree district
    let color = vec4f(toMarkerColor(treeInfo.districtNumber - 1), u.markerColor.a);
    ....
}

And so we're done! Your map should have markers, just like the one in the screenshot below.

Bonus Task - Compute and Render Heatmap

Duration: 30 minutes

For the final task, we will combine compute and render passes to produce a heatmap that we display over the map of Vienna.

This will require the compute shader to go through each tree, find out in which cell of our heatmap grid it is contained, and then increment the cell's count by one. When rendering the grid cells, this value must then be mapped to a color. To do that, we must also find out what is the largest value stored among all grid cells.

Open shaders/heatmapCompute.js.

You will find the definitions of two functions. We already know latLonToXY. The other function, xyToCellIndex, converts the XY coordinate of a tree into the index of a cell in our grid.

We start, as per usual, by defining our buffers. We introduce one buffer that stores all the cells in the grid (array<Cell>), and one buffer that stores the aggregate values across the grid (Grid). We also add two new values to our uniforms: the width and height of the grid.

struct TreeCoordinates {
    lat: f32,
    lon: f32,
};

struct TreeInfo {
    treeHeightCategory: u32,
    crownDiameterCategory: u32,
    districtNumber: u32,
    circumferenceAt1mInCm: u32,
};

struct Cell {
    // Each cell will hold its tree count
    treeCount: atomic<u32>
};

struct Grid {
    // Largest number of trees stores in a single cell of the grid
    maxTreeCount: atomic<u32>,
}

struct Uniforms {
    mapWidth: f32,
    mapHeight: f32,
    mapLatitudeMin: f32,
    mapLatitudeMax: f32,
    mapLongitudeMin: f32,
    mapLongitudeMax: f32,
    markerSize: f32,
    unused: f32, // Padding
    markerColor: vec4f,
    gridWidth: f32, // Number of cells on the grid's width
    gridHeight: f32, // Number of cells on the grid's height
};

@group(0) @binding(0) var<storage, read> treeCoordinates: array<TreeCoordinates>;
@group(0) @binding(1) var<storage, read> treeInfo: array<TreeInfo>;
@group(0) @binding(2) var<storage, read_write> cells: array<Cell>;
@group(0) @binding(3) var<storage, read_write> grid: Grid;
@group(0) @binding(4) var<uniform> u: Uniforms;

In this shader, we will introduce three entry points. One (called count) will be responsible to go through each tree and accumulate values on the grid cells (such as tree count). The second one (called max) will be responsible for going through each grid cell and finding the largest tree count. The third one (called clear) will reset the grid values and prepare it for the next pass (set all counts and the max value to zero).

First, we add the count entry point, which is very similar to the add shader we wrote in Task 1.

@compute
@workgroup_size(64)
fn count(@builtin(global_invocation_id) globalId: vec3u) {
    if (globalId.x >= arrayLength(&treeInfo)) {
        return;
    }

    // Get tree index
    let treeIndex = globalId.x;

    // Get 2D position of tree
    let latLon = treeCoordinates[treeIndex];
    let xy = latLonToXY(latLon.lat, latLon.lon);

    // Get cell index
    let cellIndex = xyToCellIndex(xy);

    // Increment one to tree count and height category count
    atomicAdd(&cells[cellIndex].treeCount, 1);
}

Then, we add the max entry point.

@compute
@workgroup_size(64)
fn max(@builtin(global_invocation_id) globalId: vec3u) {
    if (globalId.x >= arrayLength(&cells)) {
        return;
    }

    let cellIndex = globalId.x;

    let treeCount = atomicLoad(&cells[cellIndex].treeCount);

    atomicMax(&grid.maxTreeCount, treeCount);
}

And, finally, the clear entry point.

@compute
@workgroup_size(64)
fn clear(@builtin(global_invocation_id) globalId: vec3u) {
    if (globalId.x >= arrayLength(&cells)) {
        return;
    }

    // Clear max tree count
    if (globalId.x == 0) {
        atomicStore(&grid.maxTreeCount, 0);
    }

    let cellIndex = globalId.x;

    // Clear tree count
    atomicStore(&cells[cellIndex].treeCount, 0);
}

Now having completed our compute shader, before we move on to the vertex and fragment shaders, we go back to Javascript and create the new buffers we introduced.

async initializeBuffers() {
    ...
    // Cells
    this.GRID_MAX_WIDTH = 200;
    this.GRID_MAX_HEIGHT = 200;
    this.gpuGridCells = this.device.createBuffer({
        // Reserve enough space for the maximum number of cells
        size: this.GRID_MAX_WIDTH * this.GRID_MAX_HEIGHT * Uint32Array.BYTES_PER_ELEMENT,
        usage: GPUBufferUsage.STORAGE,
    });
    
    // Grid 
    this.gpuGrid = this.device.createBuffer({
        // One 32-bit unsigned integer
        size: Uint32Array.BYTES_PER_ELEMENT,
        usage: GPUBufferUsage.STORAGE,
    });
}

Because we now introduced two new uniforms, we also set them up.

async initializeBuffers() {
    ...
    this.uniforms = {
        ...
        gridWidth: 50,
        gridHeight: 50,
    };
    ...
}

async render() {
    this.device.queue.writeBuffer(this.gpuUniforms, 0, new Float32Array([
        ...
        this.uniforms.gridWidth,
        this.uniforms.gridHeight,
    ]));
    ...
}

Now, for the creation of the pipeline and bind group of our shaders. Because we have three entry points in our shader, we will need three pipelines that use the same bind group. So far, we've been using the pipeline's "auto" layout, and its getBindGroupLayout() function. However, in order to share the bind group among pipelines, we now need to create this layout manually.

async initializeLayouts() {
    this.heatmapComputeBindGroupLayout = this.device.createBindGroupLayout({
        entries: [
            { binding: 0, visibility: GPUShaderStage.COMPUTE, buffer: { type: "read-only-storage" } },
            { binding: 1, visibility: GPUShaderStage.COMPUTE, buffer: { type: "read-only-storage" } },
            { binding: 2, visibility: GPUShaderStage.COMPUTE, buffer: { type: "storage" } },
            { binding: 3, visibility: GPUShaderStage.COMPUTE, buffer: { type: "storage" } },
            { binding: 4, visibility: GPUShaderStage.COMPUTE, buffer: { type: "uniform" } },
        ]
    });

    this.heatmapComputePipelineLayout = this.device.createPipelineLayout({
        bindGroupLayouts: [
            this.heatmapComputeBindGroupLayout,
        ]
    });
}

With the layouts created, we can then create the pipelines and the bind group as per usual

async initializePipelines() {
    ...
    const pipelineDescriptor = {
        layout: this.heatmapComputePipelineLayout,
        compute: {
            module: this.device.createShaderModule({ code: SHADERS.heatmapCompute }),
            entryPoint: "" // Set below for each pipeline
        }
    };
    pipelineDescriptor.compute.entryPoint = "clear";
    this.heatmapComputeClearPipeline = this.device.createComputePipeline(pipelineDescriptor);
    pipelineDescriptor.compute.entryPoint = "count";
    this.heatmapComputeCountPipeline = this.device.createComputePipeline(pipelineDescriptor);
    pipelineDescriptor.compute.entryPoint = "max";
    this.heatmapComputeMaxPipeline = this.device.createComputePipeline(pipelineDescriptor);
}

async initializeBindGroups() {
    ...
    this.heatmapComputeBindGroup = this.device.createBindGroup({
        layout: this.heatmapComputeBindGroupLayout,
        entries: [
            { binding: 0, resource: { buffer: this.gpuTreeCoodinates} },
            { binding: 1, resource: { buffer: this.gpuTreeInfo } },
            { binding: 2, resource: { buffer: this.gpuGridCells } },
            { binding: 3, resource: { buffer: this.gpuGrid } },
            { binding: 4, resource: { buffer: this.gpuUniforms } },
        ]
    });
}

We now add our new pipelines to a compute pass before rendering. We won't be able to see any result yet, but this will allow us to verify if our pipelines are set up correctly and there are no errors.

async render() {
    ...
    const commandEncoder = this.device.createCommandEncoder();
    {
        // Compute pass
        const computePass = commandEncoder.beginComputePass();
        
        // Calculate number of workgroups to dispatch
        const numWorkgroupsTrees = Math.ceil(this.trees.getNumTrees() / 64);
        const numWorkgroupCells = Math.ceil(this.uniforms.gridWidth * this.uniforms.gridHeight / 64);

        // Set one bindgroup for all three pipelines
        computePass.setBindGroup(0, this.heatmapComputeBindGroup);

        // Clear
        computePass.setPipeline(this.heatmapComputeClearPipeline);
        computePass.dispatchWorkgroups(numWorkgroupCells);

        // Count
        computePass.setPipeline(this.heatmapComputeCountPipeline);
        computePass.dispatchWorkgroups(numWorkgroupsTrees);

        // Max
        computePass.setPipeline(this.heatmapComputeMaxPipeline);
        computePass.dispatchWorkgroups(numWorkgroupCells);

        computePass.end();
    }
    {
        // Render Pass
        ... 
    }
    const commandBuffer = commandEncoder.finish();
    this.device.queue.submit([commandBuffer]);
}

Now that we're done with the computation of the heatmap, we move on to displaying it.

Open shaders/heatmapRender.js;

You will find the familiar vertices and UVs we saw before. That is because we are using the exact same technique we used to render the tree markers, where each grid cell will be rendered as a quad.

First, we introduce the buffer bindings.

struct Cell {
    treeCount: u32,
};

struct Grid {
    maxTreeCount: u32,
}

struct Uniforms {
    mapWidth: f32,
    mapHeight: f32,
    mapLatitudeMin: f32,
    mapLatitudeMax: f32,
    mapLongitudeMin: f32,
    mapLongitudeMax: f32,
    markerSize: f32,
    unused: f32,
    markerColor: vec4f,
    gridWidth: f32,
    gridHeight: f32,
};

@group(0) @binding(0) var<storage, read> cells: array<Cell>;
@group(0) @binding(1) var<storage, read> grid: Grid;
@group(0) @binding(2) var<uniform> u: Uniforms;

We have already discussed and created all buffers introduced here.

Next, we add the vertex shader.

struct VertexInput {
    @builtin(vertex_index) vertexIndex: u32,
};

struct VertexOutput {
    @builtin(position) position: vec4f,
    @location(0) uv: vec2f,
    @location(1) color: vec4f,
};

@vertex
fn vertex(input: VertexInput) -> VertexOutput {
    let cellIndex = input.vertexIndex / 6;

    // Get center of cell
    let xy = vec2f(
        (0.5 + f32(cellIndex % u32(u.gridWidth))) / u.gridWidth,
        (0.5 + f32(cellIndex / u32(u.gridWidth))) / u.gridHeight,
    ) * 2 - 1;

    // Get size of cell
    let size = vec2f(1 / u.gridWidth, 1 / u.gridHeight);
    
    // Calculate cell position and size
    let vertex = VERTICES[input.vertexIndex % 6] * size + xy;
    
    // Dummy value for now
    let color = vec4f(1, 0, 0, f32(cellIndex) / u.gridWidth / u.gridHeight);

    return VertexOutput(
        vec4f(vertex, 0, 1),
        UVS[input.vertexIndex % 6],
        color,
    );
}

Note that we are assigning an arbitrary color to each grid cell. Later, we will calculate it based on its tree count. For now, for debug purposes, we keep it like this.

Once again, the fragment shader doesn't do much.

struct FragmentInput {
    @location(0) uv: vec2f,
    @location(1) color: vec4f,
};

struct FragmentOutput {
    @location(0) color: vec4f,
};

@fragment
fn fragment(input : FragmentInput) -> FragmentOutput {
    return FragmentOutput(
        input.color,
    );
}

With the shader created, we go back to Javascript and introduce our new pipeline and bind group.

async initializePipelines() {
    ...
    const heatmapRenderShaderModule = this.device.createShaderModule({ code: SHADERS.heatmapRender });
    this.heatmapRenderPipeline = this.device.createRenderPipeline({
        layout: "auto",
        vertex: {
            module: heatmapRenderShaderModule,
            // buffers: We don't need any vertex buffer :)
        },
        fragment: {
            module: heatmapRenderShaderModule,
            targets: [{
                format: this.gpu.getPreferredCanvasFormat(),
                blend: {
                    color: { operation: "add", srcFactor: "src-alpha", dstFactor: "one-minus-src-alpha" },
                    alpha: { operation: "add", srcFactor: "src-alpha", dstFactor: "one-minus-src-alpha" }
                }
            }]
        }
    });
}

async initializeBindGroups() {
    ...
    this.heatmapRenderBindGroup = this.device.createBindGroup({
        layout: this.heatmapRenderPipeline.getBindGroupLayout(0),
        entries: [
            // We need to hide these for now until we use them in the shader (optimized out)
            //{ binding: 0, resource: { buffer: this.gpuGridCells } },
            //{ binding: 1, resource: { buffer: this.gpuGrid } },
            { binding: 2, resource: { buffer: this.gpuUniforms } },
        ]
    });
}

Now we render the heatmap to see if we did everything right so far.

    async render() {
        ...
        const commandEncoder = this.device.createCommandEncoder();
        {
            // Compute pass
            ...
        }
        {
            // Render Pass
            const renderPass = commandEncoder.beginRenderPass({
                colorAttachments: [this.colorAttachment]
            });

            // Render map
            renderPass.setPipeline(this.imageRenderPipeline);
            renderPass.setBindGroup(0, this.imageBindGroup);
            renderPass.draw(6); // One quad

            // Render markers (deactivated)
            //renderPass.setPipeline(this.markersRenderPipeline);
            //renderPass.setBindGroup(0, this.markersBindGroup);
            //renderPass.draw(6 * this.trees.getNumTrees()); // As many quads as we have trees

            // Render heatmap
            renderPass.setPipeline(this.heatmapRenderPipeline);
            renderPass.setBindGroup(0, this.heatmapRenderBindGroup);
            renderPass.draw(6 * this.uniforms.gridWidth * this.uniforms.gridHeight); // As many quads as we have grid cells

            renderPass.end();
        }
        const commandBuffer = commandEncoder.finish();
        this.device.queue.submit([commandBuffer]);
    }

You should now see a a red gradient over the map.

Uncomment our buffer bindings.

async initializeBindGroups() {
    ...
    this.heatmapRenderBindGroup = this.device.createBindGroup({
        layout: this.heatmapRenderPipeline.getBindGroupLayout(0),
        entries: [
            { binding: 0, resource: { buffer: this.gpuGridCells } },
            { binding: 1, resource: { buffer: this.gpuGrid } },
            { binding: 2, resource: { buffer: this.gpuUniforms } },
        ]
    });
}

Open shaders/heatmapRender.js.

Remove the // Dummy value for now section and instead calculate a color value based on the tree count.

// Map color based on tree count
let maxCount = grid.maxTreeCount;
let count = cells[cellIndex].treeCount;
let color1 = vec4f(u.markerColor.rgb, 1);
let color0 = vec4f(color1.rgb, color1.a * 0.2);

// Linear blending
let blendingFactor = f32(count) / f32(maxCount);

// Logarithmic blending
//let blendingFactor = log2(f32(count) + 1) / log2(f32(maxCount) + 1);

var color = mix(color0, color1, blendingFactor);
if (count == 0) {
    color.a = 0;
}

And finally, the heatmap is done!

BONUS! Add UI controls

async initializeGUI() {
    const onChange = () => this.render();
    [
        this.gui.add(this.uniforms, "gridWidth", 1, this.GRID_MAX_WIDTH, 1),
        this.gui.add(this.uniforms, "gridHeight", 1, this.GRID_MAX_HEIGHT, 1),
    ]
    .forEach((controller) => controller.onChange(onChange));
}

Congrats!

That concludes the tutorial :)

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
data		data
images		images
lib		lib
map		map
shaders		shaders
tasks		tasks
LICENSE		LICENSE
README.md		README.md
index.html		index.html
jsconfig.json		jsconfig.json
loader.js		loader.js
slides.pdf		slides.pdf
tree.js		tree.js
tutorial.js		tutorial.js
types.js		types.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real Time Visualization WebGPU Tutorial

Introduction

Task 0 - Initialize WebGPU

Task 1 - Compute Shader Basics

Task 2 - Processing Real Data

Task 3 - Render an Image

Task 4 - Render Trees as Markers

Bonus Task - Compute and Render Heatmap

Congrats!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Real Time Visualization WebGPU Tutorial

Introduction

Task 0 - Initialize WebGPU

Task 1 - Compute Shader Basics

Task 2 - Processing Real Data

Task 3 - Render an Image

Task 4 - Render Trees as Markers

Bonus Task - Compute and Render Heatmap

Congrats!

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages