summaryrefslogtreecommitdiffstats
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* compiler: add VARYING_SLOT_VIEWPORT_MASKIlia Mirkin2020-04-154-0/+6
| | | | | | | | | See GL_NV_viewport_array2::gl_ViewportMask for how this is supposed to work. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4529>
* ir3: Handle load_ubo_ir3 when promoting to constantsConnor Abbott2020-04-151-10/+30
| | | | | | | This restores support for promoting UBO loads to constant loads when using LDC. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4568>
* ir3: Fix LDC offset unitsConnor Abbott2020-04-157-11/+101
| | | | | | | | | | | | I had missed that LDC actually uses vec4 units for its offset. This means that we have to create a new instruction, and lower it in ir3_nir_lower_io_offsets, similar to the existing SSBO instructions. Unfortunately we can't assume that loads are always vec4-aligned, so we have to use the alignment information that NIR gives us. Unfortunately, it's currently woefully inadequate, and will have to be fixed to give us good codegen in the future. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4568>
* Revert "nvc0: fix line width on GM20x+"Karol Herbst2020-04-151-4/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | This reverts commit a0e57432b76c32f2109dab0ad3df0ba03967441c. It's unclear what caused the test to fail back then. Now it's seems to be reversed. I tested with a close enough piglit and mesa branch and wasn't able to reproduce the same test result I've got in some older piglit runs. Fixes: dEQP-GLES2.functional.rasterization.primitives.lines_wide dEQP-GLES2.functional.rasterization.primitives.line_strip_wide dEQP-GLES2.functional.rasterization.primitives.line_loop_wide dEQP-GLES2.functional.rasterization.limits.points dEQP-GLES2.functional.clipping.line.wide_line_z_clip dEQP-GLES2.functional.clipping.line.wide_line_z_clip_viewport_center dEQP-GLES2.functional.clipping.line.wide_line_z_clip_viewport_corner dEQP-GLES2.functional.clipping.line.wide_line_clip dEQP-GLES2.functional.clipping.line.wide_line_clip_viewport_center dEQP-GLES2.functional.clipping.line.wide_line_clip_viewport_corner dEQP-GLES2.functional.clipping.line.wide_line_attrib_clip dEQP-GLES2.functional.polygon_offset.default_result_depth_clamp dEQP-GLES2.functional.polygon_offset.default_factor_1_slope dEQP-GLES2.functional.polygon_offset.fixed16_result_depth_clamp dEQP-GLES2.functional.polygon_offset.fixed16_factor_1_slope Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4575>
* anv: Fix UBO range detection in anv_nir_compute_push_layoutJason Ekstrand2020-04-151-10/+5
| | | | | | | | | | | This fixes two bugs: First, if the same block index showed up twice, we only pick the first one. Second, we weren't multiplying by 32. This didn't show up in tests because RBA testing is garbage. Found while looking at shaders from the UE4 Shooter demo. Fixes: e03f9652 "anv: Bounds-check pushed UBOs when..." Reviewed-by: Caio Marcelo de Oliveira Filho <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4578>
* anv: Advertise SEND count through VK_EXT_pipeline_executable_propertiesJason Ekstrand2020-04-154-0/+13
| | | | | Reviewed-by: Caio Marcelo de Oliveira Filho <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4578>
* iris: make BATCH_SZ smaller by BATCH_RESERVED bytesPaulo Zanoni2020-04-152-7/+7
| | | | | | | | | | | | | Iris allocates gem buffers using buckets of allocation sizes that are page aligned. We always ask for batch buffers of size BATCH_SZ + BATCH_RESERVED, which is not page aligned: we ask for 65552 bytes, which ends up in the bucket of size 81920, resulting in 20% unused space. Adjust things so there is no waste of space: BATCH_SZ + BATCH_RESERVED is now 65536. Reviewed-by: Lionel Landwerlin <[email protected]> Signed-off-by: Paulo Zanoni <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4561>
* iris: remove useless bo->gtt_offset assignmentPaulo Zanoni2020-04-151-1/+0
| | | | | | | | | We assign a real value a few lines below, and none of the lines in between rely on the zeroed bo->gtt_offset value. Reviewed-by: Lionel Landwerlin <[email protected]> Signed-off-by: Paulo Zanoni <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4561>
* iris: remove unnecessary forward declarationPaulo Zanoni2020-04-151-4/+0
| | | | | | Reviewed-by: Lionel Landwerlin <[email protected]> Signed-off-by: Paulo Zanoni <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4561>
* iris: remove hole from struct iris_boPaulo Zanoni2020-04-151-12/+12
| | | | | | | | | | | | | | | | | This decreases the size of the struct on a 64bit machine from 144 to 136. While that's not a lot, this is one of the structs that we're allocating all the time. For a full Aztec run on BDW we allocate this struct 3273 times, and we can have up to 3259 of them live at the same time. So we end up saving just a little over 6 pages for this benchmark. Spotted this while trying to add another bool for an unrelated feature. Reviewed-by: Lionel Landwerlin <[email protected]> Signed-off-by: Paulo Zanoni <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4561>
* Fix util/process test on CygwinJon Turney2020-04-151-0/+5
| | | | | | | | It seems meson returns the filename with extension for full_path(), even though Cygwin does it's best to pretend the file doesn't have that extension. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4514>
* llvmpipe/nir: free the nir shaderDave Airlie2020-04-161-0/+2
| | | | | | Fixes: 18f896e55d96 (llvmpipe: add initial nir support) Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4563>
* draw/tess: free the NIRDave Airlie2020-04-161-1/+5
| | | | | | Fixes: 0d02a7b8ca794 (draw: add main tessellation code) Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4563>
* draw: free the NIR IR.Dave Airlie2020-04-162-4/+11
| | | | | | | | | Not sure how I missed this, the ownership was a bit blurry, free the NIR. Fixes: bf12bc2dd7a2 (draw: add nir info gathering and building support) Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4563>
* freedreno/turnip: Update GRAS_LAYER_CNTL to GRAS_MAX_LAYER_INDEXBrian Ho2020-04-153-23/+6
| | | | | | | | | | | | | | | | | | | | | | | | After some experimentation, I believe that GRAS_LAYER_CNTL is actually just a count register storing the number of layers in the render target. While debugging cube_array geometry tests, I noticed that the blob was setting an unknown 0x8 to LAYER_CNTL, so I checked the value of LAYER_CNTL for various layer sizes: 1: LAYER_CNTL=0 2: LAYER_CNTL=1 3: LAYER_CNTL=2 4: LAYER_CNTL=3 9: LAYER_CNTL=8 256: LAYER_CNTL=255 2000: LAYER_CNTL=1999 Seems like this register just stores a count of the largest layer that can be written to via gl_Layer. This commit updates the reg docs, freedreno's gs implementation, and turnip's gs implementation. Fixes dEQP-VK.geometry.layered.cube_array.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4541>
* turnip: Emit geometry shader descriptor constsBrian Ho2020-04-151-0/+9
| | | | | | | | | Without these consts, the geometry shader is unable to read from textures or uniforms. Fixes dEQP-VK.geometry.layered.*.readback Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4541>
* turnip: Correctly set layer stride for 3D imagesBrian Ho2020-04-151-2/+4
| | | | | | | | | | | | | | | Previously we were using layout.layer_size for the layer stride, but in Vulkan, you can alias a 3D image as an array of 2D images via the VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT flag. One reason to use this behavior is so the geometry shader can write to a specific depth in a 3D framebuffer with gl_Layer. Since the 3D image is not a *true* layered image, layer_size is 0. Instead, we can copy what freedreno does and use the slice size. Fixes dEQP-VK.geometry.layered.3d.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4541>
* gallium: initialize viewport swizzle in cso_set_viewport_dimsKarol Herbst2020-04-151-0/+4
| | | | | | | | Fixes: dEQP-GLES2.functional.fragment_ops.depth_stencil.stencil_* and more Fixes: 4137a79c2a7e ("gallium: add viewport swizzling state and cap") Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4567>
* mesa: fix enum value of VIEWPORT_SWIZZLE_POSITIVE_W_NVKarol Herbst2020-04-151-1/+1
| | | | | | | | Fixes: ff168b297d94 ("mesa: add GL_NV_viewport_swizzle support") Reported-by: Roy Spliet <[email protected]> Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4567>
* radv/aco: do not advertise VK_KHR_shader_subgroup_extended_typesSamuel Pitoiset2020-04-152-3/+3
| | | | | | | | | It's unsupported because small bitsizes are still not completely supported. It should have been disabled by default with ACO. Acked-by: Daniel Schürmann <[email protected]> Signed-off-by: Samuel Pitoiset <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4549>
* nvc0: enable ASTC and ETC on GM20BKarol Herbst2020-04-151-5/+3
| | | | | | Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4554>
* glx: omit loader_loader() for macOSEmil Velikov2020-04-151-0/+4
| | | | | | | | | | | | | | | | | Earlier commit added the code unconditionally, since the loader code itself is already built on macOS. Although it did not consider the #include mayhem that src/glx is. In particular, none of the __GLXDRI{screen,context,drawable) are available for macOS... those are pulled by dri_common.[ch]. Ideally we'll untangle that, but for the time being simply #ifdef out the include/call. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2726 Fixes: b699d070a6d ("glx: set the loader_logger early and for everyone") Signed-off-by: Emil Velikov <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4490>
* clover: expose cl_arm_shared_virtual_memory for devices with SVM supportKarol Herbst2020-04-151-1/+2
| | | | | | Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* clover: implement cl_arm_shared_virtual_memoryKarol Herbst2020-04-156-54/+287
| | | | | | | | | | | | | | | v2: use static array to keep name -> func mapping v3: use unordered_map v4: handle ARM constants reorder dispatch table wrap enqueue APIs as the command value differs between khr and arm v5: move declarations into dispatch.hpp handle CL_MEM_USES_SVM_POINTER_ARM in clGetMemObjectInfo v6: breaking long lines Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* clover: implement SVM functions for devices with fine grained system SVM supportKarol Herbst2020-04-156-20/+259
| | | | | | | | | | | | | | | | | | all of the functionality can be mapped to malloc/free if the device supports fine grained system SVM. v2: fix some API bugs found with the OpenCL CTS v3: remove validate_even_wait_list improve implementation of clSetKernelExecInfo make clEnqueueSVMFree spec compliant rename can_emulate_non_system_svm to has_system_svm and make it a member method improve validation in clEnqueueSVMMemFill handle CL_MEM_USES_SVM_POINTER in clGetMemObjectInfo v4: break long lines and other minor cosmetic adjustments Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* clover: implement clSetKernelArgSVMPointerKarol Herbst2020-04-153-3/+29
| | | | | | | | | | | | | | | it is pretty much identical to a clSetKernelArg for a scalar field, except it is only valid for global and constant memory pointers. Also the type equals void* on the Host, so we can just check the size of it. v2: prefer using target_size to extend the pointer value v3: handle more corner cases in combiation to clSetKernelArg Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Pierre Moreau <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* clover: implement CL_DEVICE_SVM_CAPABILITIESKarol Herbst2020-04-153-0/+28
| | | | | | | | | | | | v2: without supporting userptrs SVM can't be implemented as it's impossible to ensure memory consistency with HOST_PTR buffers v3: fix comment style v4: fixes typo in comment Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Pierre Moreau <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* clover: add stubs for SVMKarol Herbst2020-04-154-10/+117
| | | | | | | | | | | | | | | although most of those are 2.0 core functions, there is cl_arm_shared_virtual_memory to expose those in a 1.2 context. But we should be able to expose this extension with 1.1 as well as there is no technicaly reason why this shouldn't work. v2: move svm functions into existing files v3: rename func args to match convention Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Reviewed-by: Pierre Moreau <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* gallium: add PIPE_CAP_SYSTEM_SVMKarol Herbst2020-04-153-0/+5
| | | | | | | | | v2: split enum in specific caps to abstract the CL enum v3: remove BUFFER_SVM caps Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Francisco Jerez <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2076>
* aco: fix 1D textureGrad() on GFX9Rhys Perry2020-04-151-1/+1
| | | | | | | Signed-off-by: Rhys Perry <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]> Fixes: 6f718edcedd ('aco: simplify gathering of MIMG address components') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4550>
* iris: drop cache coherent cpu mapping for external BOLionel Landwerlin2020-04-153-1/+19
| | | | | | | | | | | | | | | | | | We have to assume any external buffer could be used by the display HW. In the case that buffer is also CPU mapped, we want to assume no cache coherency as it is only available between GT & CPU, not display. Many thanks to Michel Dänzer for the hint! v2: Move cache coherent drop to bufmgr (Chris) v3: Also make BO external if created with PIPE_BIND_SHARED (Eric) Signed-off-by: Lionel Landwerlin <[email protected]> Cc: <[email protected]> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2552 Reviewed-by: Eric Anholt <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4533>
* aco: fix nir_op_frexp_exp with 16-bit floats and negative exponentsSamuel Pitoiset2020-04-151-1/+6
| | | | | | | | | | | | | | | | | v_frexp_exp_i16_f16 returns the two's complement for negative exponents. For example, with 0.333252 it returns 0.666504 for the mantissa and 65535 for the exponent (-1 in decimal). RADV/LLVM and AMDVLK do a v_bfe_i32 and AMDGPU-PRO uses SDWA with the sign extension bit set. The latter is probably what we want to do in long term but for now RA doesn't support changing non-SDWA instructions to SDWA if useful/needed. Fixes dEQP-VK.glsl.builtin.precision_fp16_storage16b.frexp.compute.*. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Daniel Schürmann <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4546>
* u_blitter: fix stencil blittingDave Airlie2020-04-151-1/+1
| | | | | | | Fixes: KHR-GL45.packed_depth_stencil.blit.depth32f_stencil8 Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* draw: fix user culling pipeline order. (v2)Dave Airlie2020-04-158-130/+214
| | | | | | | | | | | | | | | GL spec requires user culling, then clipping then face culling. llvmpipe was doing clipping then user culling then face culling. Fix the ordering by adding a new user_cull stage that does the user culling Fixes piglit clip_cull-4.shader_test v2: simplify this a lot (Roland) Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* draw/cull: run pipeline for culled points.Dave Airlie2020-04-151-0/+3
| | | | | | | | | This just appears to be missing: Fixes: KHR-GL45.cull_distance.functional Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* llvmpipe/setup: move line stats collection earlier.Dave Airlie2020-04-151-4/+4
| | | | | | | | | | You have to count the stats pre-culling here. Fixes: KHR-GL45.pipeline_statistics_query_tests_ARB.functional_primitives_vertices_submitted_and_clipping_input_output_primitives Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* draw: fix tessellation stats queryDave Airlie2020-04-151-2/+5
| | | | | Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* llvmpipe: fix no tokens detections.Dave Airlie2020-04-151-1/+4
| | | | | | | | this only applies to the TGSI path, fixes KHR-GLES31.core.geometry_shader.api.program_pipeline_vs_gs_capture Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* gallivm/draw: calloc prim id toavoid undefDave Airlie2020-04-151-1/+1
| | | | | | | Otherwise masked off channels can access random bad memory Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* gallivm/nir: lower implicit lod to tex.Dave Airlie2020-04-151-0/+3
| | | | | | | Fixes some sampling issues in vertex shaders Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* gallivm: fix left over shader vote debugDave Airlie2020-04-151-1/+0
| | | | | Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* llvmpipe/query: fix transform feedback overflow any queries.Dave Airlie2020-04-152-32/+52
| | | | | | | | The any queries need to signal if any stream has overflowed, so we have to track all the streams. Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* llvmpipe: report tessellation shader statistics.Dave Airlie2020-04-152-0/+8
| | | | | | | Fixes KHR-GL45.pipeline_statistics_query_tests_ARB.functional_tess_queries Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* draw: collect tessellation invocations statisticsDave Airlie2020-04-151-0/+6
| | | | | Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* llvmpipe: fixup context leaks.Dave Airlie2020-04-151-16/+13
| | | | | | | | | | Make sure we unreference all resources for all shaders on context destruction. Fixes: eb5227173f03 (llvmpipe: add support for tessellation shaders) Reviewed-by: Roland Scheidegger <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4560>
* swr: Remove Byte Order Mark.Vinson Lee2020-04-141-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | before: $ file src/gallium/drivers/swr/rasterizer/codegen/gen_llvm_types.py src/gallium/drivers/swr/rasterizer/codegen/gen_llvm_types.py: Python script text executable, UTF-8 Unicode (with BOM) text after: $ file src/gallium/drivers/swr/rasterizer/codegen/gen_llvm_types.py src/gallium/drivers/swr/rasterizer/codegen/gen_llvm_types.py: Python script text executable, ASCII text This patch also fixes this build error. File "src/gallium/drivers/swr/rasterizer/codegen/gen_llvm_types.py", line 1 # Copyright (C) 2014-2018 Intel Corporation. All Rights Reserved. ^ SyntaxError: invalid character in identifier Fixes: c6e67f5a9373 ("gallium/swr: add OpenSWR rasterizer") Signed-off-by: Vinson Lee <[email protected]> Reviewed-by: Jan Zielinski <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4221>
* glsl/list: Fix undefined behaviour of foreach_* macrosDanylo Piliaiev2020-04-143-60/+352
| | | | | | | | | | | | | | | | | | | | | | | | | These macros produced a lot of errors with ubsan preventing us from expanding the ubsan coverage on CIs. C++ spec has such clause: "If the prvalue of type "pointer to cv1 B" points to a B that is actually a subobject of an object of type D, the resulting pointer points to the enclosing object of type D. Otherwise, the result of the cast is undefined." Ubsan error example: ../src/compiler/glsl/builtin_functions.cpp:4945:4: runtime error: downcast of address 0x559b926abb50 which does not point to an object of type 'ir_instruction' 0x559b926abb50: note: object has invalid vptr 9b 55 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 58 ba 6a 92 9b 55 00 00 01 00 00 00 ^~~~~~~~~~~~~~~~~~~~~~~ invalid vptr #0 0x559b914dbe1a in call ../src/compiler/glsl/builtin_functions.cpp:4945 Signed-off-by: Danylo Piliaiev <[email protected]> Acked-by: Matt Turner <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4129>
* freedreno/ir3: don't overwrite wrmask in ir3_SAMJonathan Marek2020-04-141-2/+2
| | | | | | | | | | Fixes (with other patches to allow these tests to run): dEQP-VK.ycbcr.query.size_lod.vertex.* Suggested-by: Rob Clark <[email protected]> Signed-off-by: Jonathan Marek <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4557>
* freedreno/ir3: fix emit_tex_info split_destJonathan Marek2020-04-141-2/+1
| | | | | | | | | Fixes a "free(): invalid next size (fast)" error in: dEQP-VK.glsl.texture_functions.query.texturequerylevels.* Signed-off-by: Jonathan Marek <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4557>
* gallium/tgsi_to_nir: Set nir_intrinsic_align_mul to 16 and offset to 0Gert Wollny2020-04-141-0/+1
| | | | | | | | | | | | | | | | Since the alignment is now checked in the validator we must set it. v2: Use alignement of 4, i.e. dest bit size by eight. v3: Use alignment 16 (Rhys Perry & Jason Ekstand) v4: Use nir_intrinsic_set_align to make it clear that align offset is 0 (Jason) Fixes: e78a7a182524f091e2d77ba97bfbe057c3975cab nir: Assert memory loads are aligned Signed-off-by: Gert Wollny <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4544>