Introducing Kunit test into AMDGPU

Post

 * based in part on anv driver which is:
 * Copyright © 2015 Intel Corporation

$ ./vkoverhead -test 0 -duration 3 -output-only
44854

$ ./vkoverhead -test 0 -duration 3 -output-only
59973

$ ./vkoverhead -test 0 -duration 3 -output-only
71192

$ ./vkoverhead -test 0 -output-only -duration 3
28345

$ VK_ICD_FILENAMES=/home/zmike/amd_pro.json ./vkoverhead -test 0 -output-only -duration 3
32889

$ ./vkoverhead -test 0 -output-only -duration 3
36006

$ ./vkoverhead -test 0 -output-only -duration 3
38629

$ ./vkoverhead -test 0 -output-only -duration 3
41878

$ ./vkoverhead -test 0 -output-only -duration 3
44073

┌────────────────┐           ┌────────────────┐           ┌────────────────┐
│                │           │                │           │                │
│   IRC client   │           │   IRC server   │           │  OAuth server  │
│                │           │                │           │                │
│ (gamja/goguma) │           │     (soju)     │           │  (meta.sr.ht)  │
│                │           │                │           │                │
└───────┬────────┘           └───────┬────────┘           └────────┬───────┘
        │                            │                             │
        │                                                          │
        │              1. Fetch OAuth server matadata              │
        ├────────────────────────────────────────────────────────► │
        │ ◄────────────────────────────────────────────────────────┤
        │                                                          │
        │                                                          │
        │              2. Redirect user to login page              │
        ├────────────────────────────────────────────────────────► │
        │ ◄────────────────────────────────────────────────────────┤
        │                     3. Get back a code                   │
        │                                                          │
        │                                                          │
        │                                                          │
        │                4. Exchange code for a token              │
        ├────────────────────────────────────────────────────────► │
        │ ◄────────────────────────────────────────────────────────┤
        │                                                          │
        │                                                          │
        │       5. Authenticate      │                             │
        │          with token        │                             │
        ├──────────────────────────► │                             │
        │                            │        6. Check token       │
        │                            ├───────────────────────────► │
        │                            │ ◄───────────────────────────┤
        │                            │      7. Get back username   │
        │                            │                             │
        │ ◄──────────────────────────┤                             │
        │                            │                             │
        │                            │                             │

> curl https://chat.sr.ht/.well-known/oauth-authorization-server
{
	"issuer": "https://meta.sr.ht",
	"authorization_endpoint": "https://meta.sr.ht/oauth2/authorize",
	"token_endpoint": "https://meta.sr.ht/oauth2/access-token",
	"response_types_supported": ["code"],
	"grant_types_supported": ["authorization_code"],
	"introspection_endpoint": "https://meta.sr.ht/oauth2/introspect",
	"introspection_endpoint_auth_methods_supported": ["none"]
}

> curl \
    --data-urlencode grant_type=authorization_code \
    --data-urlencode code=YYY \
    --data-urlencode client_id=XXX \
    https://chat.sr.ht/oauth2/access-token
{
	"access_token": "asdf"
}

> curl --data-urlencode token=asdf https://meta.sr.ht/oauth2/introspect
{
	"active": true,
	"username": "emersion"
}

$ ./vkoverhead -list
   0, draw
   1, draw_multi
   2, draw_vertex
   3, draw_multi_vertex
   4, draw_index_change
   5, draw_index_offset_change
   6, draw_rp_begin_end
   7, draw_rp_begin_end_dynrender
   8, draw_rp_begin_end_dontcare
   9, draw_rp_begin_end_dontcare_dynrender
  10, draw_multirt
  11, draw_multirt_dynrender
  12, draw_multirt_begin_end
  13, draw_multirt_begin_end_dynrender
  14, draw_multirt_begin_end_dontcare
  15, draw_multirt_begin_end_dontcare_dynrender
  16, draw_vbo_change
  17, draw_1vattrib_change
  18, draw_16vattrib
  19, draw_16vattrib_16vbo_change
  20, draw_16vattrib_change
  21, draw_16vattrib_change_dynamic
  22, draw_16vattrib_change_gpl
  23, draw_16vattrib_change_gpl_hashncache
  24, draw_1ubo_change
  25, draw_12ubo_change
  26, draw_1sampler_change
  27, draw_16sampler_change
  28, draw_1texelbuffer_change
  29, draw_16texelbuffer_change
  30, draw_1ssbo_change
  31, draw_8ssbo_change
  32, draw_1image_change
  33, draw_16image_change
  34, draw_1imagebuffer_change
  35, draw_16imagebuffer_change
  36, submit_noop
  37, submit_50noop
  38, submit_1cmdbuf
  39, submit_50cmdbuf
  40, submit_50cmdbuf_50submit
  41, descriptor_noop
  42, descriptor_1ubo
  43, descriptor_template_1ubo
  44, descriptor_12ubo
  45, descriptor_template_12ubo
  46, descriptor_1sampler
  47, descriptor_template_1sampler
  48, descriptor_16sampler
  49, descriptor_template_16sampler
  50, descriptor_1texelbuffer
  51, descriptor_template_1texelbuffer
  52, descriptor_16texelbuffer
  53, descriptor_template_16texelbuffer
  54, descriptor_1ssbo
  55, descriptor_template_1ssbo
  56, descriptor_8ssbo
  57, descriptor_template_8ssbo
  58, descriptor_1image
  59, descriptor_template_1image
  60, descriptor_16image
  61, descriptor_template_16image
  62, descriptor_1imagebuffer
  63, descriptor_template_1imagebuffer
  64, descriptor_16imagebuffer
  65, descriptor_template_16imagebuffer
  66, misc_resolve
  67, misc_resolve_mutable

  21, draw_16vattrib_change_dynamic
  22, draw_16vattrib_change_gpl
  23, draw_16vattrib_change_gpl_hashncache

KUNIT_EXPECT_EQ(test, memcmp(buffer1, buffer2, size), 0);

#ifndef QUEUE_H_
#define QUEUE_H_

typedef struct Queue Queue;
struct Queue {
	int *buffer;
	int head;
	int size;
	int tail;
	int (*isFull)(Queue* const me);
	int (*isEmpty)(Queue* const me);
	int (*getSize)(Queue* const me);
	void (*insert)(Queue* const me, int k);
	int (*remove)(Queue* const me);
};

/* Constructor and destructors */
void Queue_Init(Queue const me, (*isFullFunction)(Queue* const me),
	(*isEmptyFunction)(Queue* const me), (*getSizeFunction)(Queue* const me),
	(*insertFunction)(Queue* const me, int k), (*removeFunction)(Queue* const me));

void Queue_Cleanup(Queue* const me);

/* Operations */
int Queue_isFull(Queue* const me);
int Queue_isEmpty(Queue* const me);
int Queue_getSize(Queue* const me);
void Queue_insert(Queue* const me, int k);
int Queue_remove(Queue* const me);

Queue *Queue_Create(void);
void Queue_Destroy(Queue* const me);

#endif

typedef struct CachedQueue CachedQueue;
struct CachedQueue {
	Queue *queue;

	/* new attributes */
	char name[80];
	int numberElementsOnDisk;

	/* aggregation in subclass */
	Queue *outputQueue;

	/* inherited virtual function */
	int (*isFull)(CachedQueue* const me);
	int (*isEmpty)(CachedQueue* const me);
	int (*getSize)(CachedQueue* const me);
	void (*insert)(CachedQueue* const me, int k);
	int (*remove)(CachedQueue* const me);

	/* new virtual functions */
	void (*flush)(CachedQueue* const me);
	int (*load)(CachedQueue* const me);
};

start_processing(obj)
{
	mutex_lock(&obj->lock);
	/* set up the data for the async work */;
	schedule_work(&obj->work);
	mutex_unlock(&obj->lock);
}

stop_processing(obj)
{
	mutex_lock(&obj->lock);
	/* clear the data for the async work */;
	cancel_work_sync(&obj->work);
	mutex_unlock(&obj->lock);
}

work_fn(work)
{
	obj = container_of(work, work);

	mutex_lock(&obj->lock);
	/* do some processing */
	mutex_unlock(&obj->lock);
}

obj_find_in_cache(id)
{
	xa_lock();
	obj = xa_find(id);
	if (!kref_get_unless_zero(&obj->kref))
		obj = NULL;
	xa_unlock();

	return obj;
}

$ vlan
Device not provided

    vlan $DEV $VLAN $SUBNET

    vlan eth0 42 10.31.155.1/27

function vlan {
    DEV=$1
    VLAN=$2
    ADDR=$3

    HELP="

    vlan \$DEV \$VLAN \$SUBNET

    vlan eth0 42 10.31.155.1/27
"

    if [ -z "$DEV" ]; then
        echo "Device not provided"
        echo "$HELP"
        return 1
    fi

    ip link | grep "${DEV}: " >/dev/null 2>&1
    if [ $? -ne 0 ]; then
        echo "\"$DEV\" is not a valid device"
        echo "$HELP"
        return 1
    fi

    if [ -z "$VLAN" ]; then
        echo "VLAN not provided"
        echo "$HELP"
        exit 1
    fi
    REGEX='^[0-9]+$'
    if ! [[ $VLAN =~ $REGEX ]] ; then
        echo "\"$VLAN\" is not a number" >&2; exit 1
        echo …
static int drm_cmdline_test_force_D_only_not_digital(void *ignored)
{
	struct drm_cmdline_mode mode = { };

	FAIL_ON(!drm_mode_parse_command_line_for_connector("D",
							   &no_connector,
							   &mode));
	FAIL_ON(mode.specified);
	FAIL_ON(mode.refresh_specified);
	FAIL_ON(mode.bpp_specified);

	FAIL_ON(mode.rb);
	FAIL_ON(mode.cvt);
	FAIL_ON(mode.interlace);
	FAIL_ON(mode.margins);
	FAIL_ON(mode.force != DRM_FORCE_ON);

	return 0;
}

static void drm_cmdline_test_force_D_only_not_digital(struct kunit *test)
{
	struct drm_cmdline_mode mode = { };

	KUNIT_EXPECT_FALSE(test, !drm_mode_parse_command_line_for_connector("D",
							   &no_connector,
							   &mode));
	KUNIT_EXPECT_FALSE(test, mode.specified);
	KUNIT_EXPECT_FALSE(test, mode.refresh_specified);
	KUNIT_EXPECT_FALSE(test, mode.bpp_specified);

	KUNIT_EXPECT_FALSE(test, mode.rb);
	KUNIT_EXPECT_FALSE(test, mode.cvt);
	KUNIT_EXPECT_FALSE(test, mode.interlace);
	KUNIT_EXPECT_FALSE(test, mode.margins);
	KUNIT_EXPECT_FALSE(test, mode.force != DRM_FORCE_ON);
}

KUNIT_EXPECT_FALSE(test, !drm_mode_parse_command_line_for_connector("D",
							   &no_connector,
							   &mode));

KUNIT_EXPECT_TRUE(test, drm_mode_parse_command_line_for_connector("D",
							   &no_connector,
							   &mode));

static void drm_cmdline_test_force_D_only_not_digital(struct kunit *test)
{
	struct drm_cmdline_mode mode = { };

	KUNIT_EXPECT_TRUE(test, drm_mode_parse_command_line_for_connector("D",
							   &no_connector,
							   &mode));
	KUNIT_EXPECT_FALSE(test, mode.specified);
	KUNIT_EXPECT_FALSE(test, mode.refresh_specified);
	KUNIT_EXPECT_FALSE(test, mode.bpp_specified);

	KUNIT_EXPECT_FALSE(test, mode.rb);
	KUNIT_EXPECT_FALSE(test, mode.cvt);
	KUNIT_EXPECT_FALSE(test, mode.interlace);
	KUNIT_EXPECT_FALSE(test, mode.margins);
	KUNIT_EXPECT_EQ(test, mode.force, DRM_FORCE_ON);
}

#version 430 core
#extension GL_ARB_enhanced_layouts : require

layout(isolines, point_mode) in;

struct TestStruct {
   dmat3x4 a;
   double b;
   float c;
   dvec2 d;
};
struct OuterStruct {
    TestStruct inner_struct_a;
    TestStruct inner_struct_b;
};
layout (location = 0, xfb_offset = 0) flat out OuterStruct goku;

layout(std140, binding = 0) uniform Goku {
    TestStruct uni_goku;
};

void main()
{

    goku.inner_struct_a = uni_goku;
    goku.inner_struct_b = uni_goku;
}

/**
 * A single output for vertex transform feedback.
 */
struct pipe_stream_output
{
   unsigned register_index:6;  /**< 0 to 63 (OUT index) */
   unsigned start_component:2; /** 0 to 3 */
   unsigned num_components:3;  /** 1 to 4 */
   unsigned output_buffer:3;   /**< 0 to PIPE_MAX_SO_BUFFERS */
   unsigned dst_offset:16;     /**< offset into the buffer in dwords */
   unsigned stream:2;          /**< 0 to 3 */
};

/**
 * Stream output for vertex transform feedback.
 */
struct pipe_stream_output_info
{
   unsigned num_outputs;
   /** stride for an entire vertex for each buffer in dwords */
   uint16_t stride[PIPE_MAX_SO_BUFFERS];

   /**
    * Array of stream outputs, in the order they are to be written in.
    * Selected components are tightly packed into the output buffer.
    */
   struct pipe_stream_output output[PIPE_MAX_SO_OUTPUTS];
};

struct TestStruct {
   dmat3x4 a; <--this is effectively dvec3[4];
                 a dvec3 consumes 2 locations
                 so 4 * 2 is 8, so this consumes locations [0,7]
   double b; <--location 8
   float c; <--location 9
   dvec2 d; <--location 10
};
struct OuterStruct {
    TestStruct inner_struct_a; <--locations [0,10]
    TestStruct inner_struct_b; <--locations [11,21]
};

dmat3x4 a;
double b;
float c;
dvec2 d;
dmat3x4 a2;
double b2;
float c2;
dvec2 d2;

static bool
split_blocks(nir_shader *nir)
{
   bool progress = false;
   bool changed = true;
   do {
      progress = false;
      nir_foreach_shader_out_variable(var, nir) {
         const struct glsl_type *base_type = glsl_without_array(var->type);
         nir_variable *members[32]; //can't have more than this without breaking NIR
         if (!glsl_type_is_struct(base_type))
            continue;
         if (!glsl_type_is_struct(var->type) || glsl_get_length(var->type) == 1)
            continue;
         if (glsl_count_attribute_slots(var->type, false) == 1)
            continue;
         unsigned offset = 0;
         for (unsigned i = 0; i < glsl_get_length(var->type); i++) {
            members[i] = nir_variable_clone(var, nir);
            members[i]->type = glsl_get_struct_field(var->type, i);
            members[i]->name = (void*)glsl_get_struct_elem_name(var->type, i);
            members[i]->data.location += offset;
            offset += glsl_count_attribute_slots(members[i]->type, false);
            nir_shader_add_variable(nir, members[i]);
         }
         nir_foreach_function(function, nir) {
            bool func_progress = false;
            if (!function->impl)
               continue;
            nir_builder b;
            nir_builder_init(&b, function->impl);
            nir_foreach_block(block, function->impl) {
               nir_foreach_instr_safe(instr, block) {
                  switch (instr->type) {
                  case nir_instr_type_deref: {
                  nir_deref_instr *deref = nir_instr_as_deref(instr);
                  if (!(deref->modes & nir_var_shader_out))
                     continue;
                  if (nir_deref_instr_get_variable(deref) != var)
                     continue;
                  if (deref->deref_type != nir_deref_type_struct)
                     continue;
                  nir_deref_instr *parent = nir_deref_instr_parent(deref);
                  if (parent->deref_type != nir_deref_type_var)
                     continue;
                  deref->modes = nir_var_shader_temp;
                  parent->modes = nir_var_shader_temp;
                  b.cursor = nir_before_instr(instr);
                  nir_ssa_def *dest = &nir_build_deref_var(&b, members[deref->strct.index])->dest.ssa;
                  nir_ssa_def_rewrite_uses_after(&deref->dest.ssa, dest, &deref->instr);
                  nir_instr_remove(&deref->instr);
                  func_progress = true;
                  break;
                  }
                  default: break;
                  }
               }
            }
            if (func_progress)
               nir_metadata_preserve(function->impl, nir_metadata_none);
         }
         var->data.mode = nir_var_shader_temp;
         changed = true;
         progress = true;
      }
   } while (progress);
   return changed;
}

layout (depth_greater) out float gl_FragDepth;

#include <kunit/test.h>
#include "inc/bw_fixed.h"

static void abs_i64_test(struct kunit *test)
{
	KUNIT_EXPECT_EQ(test, 0ULL, abs_i64(0LL));

	/* Argument type limits */
	KUNIT_EXPECT_EQ(test, (uint64_t)MAX_I64, abs_i64(MAX_I64));
	KUNIT_EXPECT_EQ(test, (uint64_t)MAX_I64 + 1, abs_i64(MIN_I64));
}

static struct kunit_case bw_fixed_test_cases[] = {
	KUNIT_CASE(abs_i64_test),
	{  }
};

static struct kunit_suite bw_fixed_test_suite = {
	.name = "dml_calcs_bw_fixed",
	.test_cases = bw_fixed_test_cases,
};

kunit_test_suite(bw_fixed_test_suite);

Multiple definitions of 'init_module'/'cleanup_module' at kunit_test_suites().

LD_PRELOAD=/usr/lib64/librenderdoc.so MESA_LOADER_DRIVER_OVERRIDE=zink <executable>

LD_PRELOAD=/usr/lib64/librenderdoc.so MESA_LOADER_DRIVER_OVERRIDE=zink glretrace --loop portal2.trace

b4 am https://lore.kernel.org/all/[email protected]/

VkResult ret = VKSCR(AllocateMemory)(screen->dev, &mai, NULL, &bo->mem);
if (!zink_screen_handle_vkresult(screen, ret)) {
   if (heap == ZINK_HEAP_DEVICE_LOCAL_VISIBLE) {
      heap = ZINK_HEAP_DEVICE_LOCAL;
      mesa_loge("zink: %p couldn't allocate memory! from BAR heap: retrying as device-local", bo);
      goto demote;
   }
   mesa_loge("zink: couldn't allocate memory! from heap %u", heap);
   goto fail;
}

$ ./scripts/run-tests.sh -t ".*amdgpu.*"

echo 0x19F | sudo tee /sys/module/drm/parameters/debug

git bisect start
git bisect bad # HEAD
git bisect good v5.16

make oldconfig
make bzImage
make modules

sudo make modules_install
sudo make install

dracut --force --kver {KERNEL_NAME}

grub2-editenv - unset menu_auto_hide

sed -i -e 's/GRUB_ENABLE_BLSCFG=true/GRUB_ENABLE_BLSCFG=false/g' /etc/default/grub

make CC="ccache clang" -j8

$ appstream-builder \
   --origin=yourcompanyname \
   --basename=appstream \
   --cache-dir=/tmp/asb-cache \
   --enable-hidpi \
   --max-threads=1 \
   --min-icon-size=32 \
   --output-dir=/tmp/asb-md \
   --packages-dir=x86_64/ \
   --temp-dir=/tmp/asb-icons

modifyrepo_c \
    --no-compress \
    --simple-md-filenames \
    /tmp/asb-md/appstream.xml.gz \
    x86_64/repodata/
modifyrepo_c \
    --no-compress \
    --simple-md-filenames \
    /tmp/asb-md/appstream-icons.tar.gz \
    x86_64/repodata/

/* matrix types always come from array (row) derefs */
assert(deref->deref_type == nir_deref_type_array);
nir_deref_instr *var_deref = nir_deref_instr_parent(deref);
/* let optimization clean up consts later */
nir_ssa_def *index = deref->arr.index.ssa;
/* this might be an indirect array index:
 * - iterate over matrix columns
 * - add if blocks for each column
 * - phi the loads using the array index
 */
unsigned cols = glsl_get_matrix_columns(matrix);
nir_ssa_def *dests[4];
for (unsigned idx = 0; idx < cols; idx++) {
   /* don't add an if for the final row: this will be handled in the else */
   if (idx < cols - 1)
      nir_push_if(&b, nir_ieq_imm(&b, index, idx));
   unsigned vec_components = glsl_get_vector_elements(matrix);
   /* always clamp dvec3 to 4 components */
   if (vec_components == 3)
      vec_components = 4;
   unsigned start_component = idx * vec_components * 2;
   /* struct member */
   unsigned member = start_component / 4;
   /* number of components remaining */
   unsigned remaining = num_components;
   /* component index */
   unsigned comp_idx = 0;
   for (unsigned i = 0; i < num_components; member++) {
      assert(member < glsl_get_length(var_deref->type));
      nir_deref_instr *strct = nir_build_deref_struct(&b, var_deref, member);
      nir_ssa_def *load = nir_load_deref(&b, strct);
      unsigned incr = MIN2(remaining, 4);
      /* repack the loads to 64bit */
      for (unsigned c = 0; c < incr / 2; c++, comp_idx++)
         comp[comp_idx] = nir_pack_64_2x32(&b, nir_channels(&b, load, BITFIELD_RANGE(c * 2, 2)));
      remaining -= incr;
      i += incr;
   }
   dest = dests[idx] = nir_vec(&b, comp, intr->num_components);
   if (idx < cols - 1)
      nir_push_else(&b, NULL);
}
/* loop over all the if blocks that were made, pop them, and phi the loaded+packed results */
for (unsigned idx = cols - 1; idx >= 1; idx--) {
   nir_pop_if(&b, NULL);
   dest = nir_if_phi(&b, dests[idx - 1], dest);
}

Name	Commit Count	Percentage of Total	Affiliation
Jason Ekstrand	1429	19.7%	Intel/Collabora
Timothy Arceri	714	9.9%	Collabora/Valve
Ian Romanick	577	8.0%	Intel
Rhys Perry	298	4.1%	Valve
Caio Oliveira	270	3.7%	Intel
Emma Anholt	268	3.7%	Google
Marek Olšák	260	3.6%	AMD
Kenneth Graunke	224	3.1%	Intel
Samuel Pitoiset	176	2.4%	Valve
Connor Abbott	168	2.3%	Intel/Valve

Patch	Status
#592 Add support for GPUs identified as “Display controller” in kw device	Accepted
#607 Enhance docs for kw-pomodoro and kw-report	Accepted

Patch	Status
lib/igt_kmod: fix trivial typos	Under review
CONTRIBUTING: Add reference for GTKDoc	Under review
lib/kselftests: Skip kselftest when opening kmsg fails	Under review
lib/igt_kmod: add igt_kselftests documentation	Under review

Patch	Status
Documentation: kunit: fix trivial typo	Accepted
Documentation: Kunit: Fix inconsistent titles	Accepted
Documentation: KUnit: Fix non-uml anchor	Accepted
Documentation: Kunit: Add ref for other kinds of tests	Accepted
Documentation: KUnit: remove duplicated docs for kunit_tool	Accepted
Documentation: KUnit: avoid repeating “kunit.py run” in start.rst	Accepted
Documentation: KUnit: add note about mrproper in start.rst	Accepted
Documentation: KUnit: Reword start guide for selecting tests	Accepted
Documentation: KUnit: add intro to the getting-started page	Accepted
Documentation: KUnit: update links in the index page	Accepted
lib: overflow: update reference to kunit-tool	Accepted
lib: stackinit: update reference to kunit-tool	Accepted
kunit: tool: fix –qemu_config help text	Under review

Patch	Status
drm/vkms: check plane_composer->map[0] before using it	Accepted
drm/vkms: return early if compose_plane fails	Discarded

Patch	Status
drm/amd/display: make hubp1_wait_pipe_read_start() static	Accepted
Update AMDGPU glossary and MAINTAINERS	Accepted
drm/amd/display: fix overflow on MIN_I64 definition	Accepted
drm/amd/display: fix minor codestyle problems	Accepted
drm/amd/display: remove unneeded defines from bios parser	Accepted

Case	Draws per second
21 draw_16vattrib_change_dynamic	7,965,000
22 draw_16vattrib_change_gpl	315,000
23 draw_16vattrib_change_gpl_hashncache	4,020,000

Patch	Status
docs: dependencies: Add pv to Fedora dependencies	Accepted
src: kwlib: check if the context is inside a git worktree	Accepted
Add deploy support to Fedora-based systems	Accepted
src: help: Fix renaming of configm to kernel-config-manager	Accepted

Patch	Status
[i-g-t,v2] tests/amdgpu: Skip multihead MPO tests on single display	Accepted
[i-g-t,v2] tests/amdgpu/amd_bypass: skip if connector is not DisplayPort	On Review

Patch	Status
Documentation: KUnit: Fix example with compilation error	Accepted
kunit: Introduce KUNIT_EXPECT_MEMEQ and KUNIT_EXPECT_MEMNEQ macros	Accepted
kunit: Add KUnit memory block assertions to the example_all_expect_macros_test	Accepted
kunit: Use KUNIT_EXPECT_MEMEQ macro	Accepted

Patch	Status
drm: selftest: convert drm_damage_helper selftest to KUnit	Accepted
drm: selftest: convert drm_cmdline_parser selftest to KUnit	Accepted
drm: selftest: convert drm_rect selftest to KUnit	Accepted
drm: selftest: convert drm_format selftest to KUnit	Accepted
drm: selftest: convert drm_plane_helper selftest to KUnit	Accepted
drm: selftest: convert drm_dp_mst_helper selftest to KUnit	Accepted
drm: selftest: convert drm_framebuffer selftest to KUnit	Accepted
drm: selftest: convert drm_buddy selftest to KUnit	Accepted
drm: selftest: convert drm_mm selftest to KUnit	Accepted
drm/tests: Split up test cases in igt_check_drm_format_min_pitch	Accepted
drm/mm: Reduce stack frame usage in __igt_reserve	On Review
drm/tests: Split drm_framebuffer_create_test into parameterized tests	On Review
drm/tests: Change “igt_” prefix to “test_drm_”	On Review

Patch	Status
drm/amd/display: Remove return value of Calculate256BBlockSizes	Accepted
drm/amd/display: Remove duplicate code across dcn30 and dcn31	Accepted
drm/amd/display: Remove unused variables from vba_vars_st	Accepted
drm/amdgpu: Write masked value to control register	Accepted
drm/amd/display: Change get_pipe_idx function scope	Accepted
drm/amd/display: Remove unused clk_src variable	Accepted
drm/amd/display: Remove unused dml32_CalculatedoublePipeDPPCLKAndSCLThroughput function	Accepted
drm/amd/display: Remove unused NumberOfStates variable	Accepted
drm/amd/display: Remove unused variables from dml_rq_dlg_get_dlg_params	Accepted
drm/amd/display: Remove unused value0 variable	On Review
drm/amd/display: Remove unused variables from dcn10_stream_encoder	Accepted
drm/amd/display: Remove unused MaxUsedBW variable	Accepted
drm/amd/display: Remove parameters from dml30_CalculateWriteBackDISPCLK	Rejected
drm/amd/display: Drop dm_sw_gfx7_2d_thin_l_vp and dm_sw_gfx7_2d_thin_gl	On Review
drm/amd/display: Remove duplicated CalculateWriteBackDISPCLK	On Review
drm/amd/display: Remove parameters from rq_dlg_get_dlg_reg	On Review
drm/amd/display: Rewrite CalculateWriteBackDISPCLK function	On Review
drm/amd/display: Remove unused struct freesync_context	Accepted
[PATCH 00/16] Remove entries from struct vba_vars_st	On Review
drm/amd/display: Drop XFCEnabled parameter from CalculatePrefetchSchedule	On Review
drm/amdgpu: Fix use-after-free on amdgpu_bo_list mutex	Accepted
drm/amd/display: Include missing header	Accepted

Patch	Status
drm/amd/display: Introduce KUnit tests to the bw_fixed library	On Review
drm/amd/display: Introduce KUnit tests to the display_mode_vba library	On Review
drm/amd/display: Introduce KUnit to dcn20/display_mode_vba_20 library	On Review
drm/amd/display: Introduce KUnit tests to dc_dmub_srv library	On Review
Documentation/gpu: Add Display Core Unit Test documentation	On Review

Date	Post
May 26, 2022	I’m in GSoC ‘22
Jun 11, 2022	Linux Kernel Developing with Fedora
Jul 11, 2022	About Kernel Symbol Table, Compilation, and more
Jul 19, 2022	From Selftests to KUnit
Aug 10, 2022	Does the Linux Kernel need software engineering?

Command	Time spent	Notes
koji build --scratch --arch-override=x86_64 f36 kernel.src.rpm	129 minutes	It's usually quicker, but that day must have been particularly busy
fedpkg local	70 minutes	No rpmmacros changes except setting the workdir in $HOME
powerprofilesctl launch fedpkg local	25 minutes
localmodconfig / bin-rpmpkg	19 minutes	Defaults to "-j2"
localmodconfig -j16 / bin-rpmpkg	1:48 minutes
powerprofilesctl launch localmodconfig ccache -j16 / bin-rpmpkg	7 minutes	Cold cache
powerprofilesctl launch localmodconfig ccache -j16 / bin-rpmpkg	1:45 minutes	Hot cache
powerprofilesctl launch localmodconfig xzdio -j16 / bin-rpmpkg	1:20 minutes

Hardware	Textures	Images	Samplers	Border Colors	Typed buffers	UBOs	SSBOs
NVIDIA (Kepler+)	H	H	H		H	D/F	D
AMD	D	D	D	H	D	D	D
Intel (Skylake+)	H	H	H		H	H/D/F	H/D
Intel (pre-Skylake)	F	F	F		F	D/F	F
Arm (Valhal+)	B	B	B		B	B/D/F	B/D
Arm (Pre-Valhal)	F	F	F		F	D/F	D
Qualcomm (a5xx+)	B	B	B		B	B	B
Broadcom (vc5)	D	D	D		D	D	D

Type	CXL.cache	CXL.mem
1	y	n
2	y	y
3	n	y

Type	Max bandwidth (GB/s)
PCIe 4.0 x16	32
PCIe 5.0 x16	64
PCIe 6.0 x16	256
DDR4 (1 DIMM)	25.6
DDR5 (1 DIMM)	51.2
HBM3	819

Recap

But How

For Some

It Must Be Said

It’s Happening.

Shoutouts

Cooking Up A Storm

No Memes

What’s next?

Moving to userspace consoles:

cgroups for GPU

Before We Begin

But Now

Barely Hanging On

Results

Homemade Spaghetti

Step 1: Read The Label

It’s Totally Cool

I’m Not Being Defensive

Now We’re Cooking

Finishing Touch

Motivation

High-level overview

Implementation

Future plans

Introducing Kunit test into AMDGPU

Contributing to FLOSS projects

KWorkflow

IGT

Linux Kernel - KUnit

Linux Kernel - DRM

Linux Kernel - AMDGPU

Acknowledgment

Next steps

What had to be done⌗

What was actually done⌗

On the Kernel⌗

On other projects⌗

KWorkflow⌗

IGT GPU Tools⌗

Blog posts⌗

What still needs to be done⌗

Need Another Hit

New Edition

TL;DR drawoverhead

Why Not drawoverhead?

vkoverhead: Mythbusting

More vkoverhead

Contributions during GSoC 2022

kworkflow

IGT

Linux Kernel - KUnit

Linux Kernel - DRM

Linux Kernel - AMDGPU

The KUnit AMDGPU Tests

The Unit Tests and Documentation

The Blog Posts

More than just code

Next Steps

Acknowledgment

How it started

How we made it work

NIR and SPIR-V

Lowering the shaders in the backend

Mesh shading draw calls in RADV

Mesh shading on Intel

The guy who wrote the most mesh shaders on Earth

Conclusion

What happens to NV_mesh_shader now?

When is it coming to my Steam Deck / Linux computer?

Waiting for the gang…

Integration

SP33D

Descriptors: Recap

Descriptors: Faster

More SP33D

New Month, New Post

Release

Render Passes

On Topic

Locking Antipattern: `preempt/local_irq/bh_disable()` and Friends …