summaryrefslogtreecommitdiff
path: root/src/gallium
AgeCommit message (Collapse)Author
2009-05-28nv50: use multiple constant buffersChristoph Bumiller
Use different buffers for immds, FP params, and VP params. One has to map constant buffer indices in shader code to buffers defined via CB_DEF. In principle, we could use more buffers so we'd have to change the shader code less frequently.
2009-05-28nv50: don't look for unfreed temps in free_nv50_pcChristoph Bumiller
Since we stopped using alloc_temp to get hw indices for FP attrs there shouldn't be any non-deallocated temps left.
2009-05-28nv50: release hw TEMPs earlyChristoph Bumiller
Since we know when we don't use a TEMP or FP ATTR register anymore, we can release their hw resources early.
2009-05-28nv50: allow immediates for MOV, ADD and MULChristoph Bumiller
Immediates are inlined now where possible, so we need to set pc->allow32 to FALSE in LIT where we have the conditional MOV, since immediates swallow the predicate bits.
2009-05-28nv50: enable half insns for MOV and MULChristoph Bumiller
2009-05-28nv50: make sure half-long insns are pairedChristoph Bumiller
I chose to just convert unpaired 32 bit length instructions after parsing all instructions, although it might be possible to determine beforehand whether there would be any lone ones, and then even do some swapping to bring them together ...
2009-05-28nv50: enable KIL in register 19a8Christoph Bumiller
2009-05-28nv50: don't overwrite sources before they're usedChristoph Bumiller
This would have happened in p.e. ADD TEMP[0], TEMP[0].xyxy, TEMP[1] or RCP/RSQ TEMP[i], TEMP[i].
2009-05-28nv50: put FP outputs where they belongChristoph Bumiller
Depth output in fragment programs should end up in the first register after the color outputs.
2009-05-28nv50: modified FP attribute loadingChristoph Bumiller
VP outputs that should be loadable in the FP are mapped to interpolant indices by HPOS, COL0 etc.; of course HPOS is always written, so the highest byte of 1988 is a bitmask that selects which components of HPOS are used for interpolants, i.e. the FP inputs in COL0 start at index POPCNT(1988[24:28]).
2009-05-28nv50: inspect decl semantic and interpolation modeChristoph Bumiller
Record interpolation mode for attributes while parsing declarations, and also remember the indices of FP color inputs and FP depth output, which has to end up in the highest output register.
2009-05-28nv50: record last access to temp and attr regsChristoph Bumiller
We now inspect the TGSI instructions in tx_prep to determine where temps and FP attrs are last accessed. This will enable us to reclaim some temporaries early and we also use it to omit pre-loading FP attributes that aren't used.
2009-05-28nv50: save some space in immediate bufferChristoph Bumiller
We could do even better (like just allocating 1 value in alloc_immd), but that's fine for now I guess.
2009-05-28nv50: fix SIGN_SET case in tgsi_srcChristoph Bumiller
2009-05-28nv50: set dst.z,w to 0,1 in SCS and XPDChristoph Bumiller
According to tgsi-instruction-set.txt, if they are written, z and w should be set to 0 and 1 respectively in SCS, and w to 1.0 in XPD.
2009-05-28nv50: make LRP instruction nicerChristoph Bumiller
2009-05-28nv50: fix some memory leaks in shader assemblerChristoph Bumiller
2009-05-28nouveau: explicitly request mappable buffers for the momentBen Skeggs
2009-05-27draw: Fix assertion failure at fetch_emit_prepareMike Kaplinksiy
2009-05-27softpipe: commentsBrian Paul
2009-05-27softpipe: include sp_winsys.h to silence warning (unprototyped function)Brian Paul
2009-05-27softpipe: fix flat shading provoking vertex for PIPE_PRIM_POLYGONBrian Paul
Use the first vertex, not the last.
2009-05-21cell: perform triangle cull a little earlierJonathan Adamczewski
In spu_tri.c:setup_sort_vertices() triangles are culled after the vertices are sorted. This patch moves the check a little earlier and performs the actual check a little faster through intrinsics and a little trickery. Reduced code size and less work is done before a triangle is deemed OK to skip.
2009-05-21cell: unroll inner loop of spu_render.c:cmd_render()Jonathan Adamczewski
It was taking approximately 50 cycles to extract the vertex indices, calculate the vertex_header pointers and call tri_draw() for each three vertices - . Unrolled, it takes less than 100 cycles to extract, unpack, calculate pointers and call tri_draw() eight times. It does have a nasty jump-tabled switch. I'm sure that there's a better way... Code size of spu_render.o gets larger due to the extra constants and work in the inner loop, there are extra stack saves and loads because there are more registers in use, and an assert. spu_tri.o gets a little smaller.
2009-05-20r300-gallium: r500-fs: POW.Corbin Simpson
I feel so unclean.
2009-05-20r300-gallium: r500-fs: LRP.Corbin Simpson
Goddammit. This cannot be the "easy way." :C
2009-05-20r300-gallium: r500-fs: Combine function.Corbin Simpson
2009-05-20r300-gallium: Prevent assert when fogcoords are present.Corbin Simpson
Seems like this file is the source of all bad logic. (Pun intended.)
2009-05-20r300-gallium: Another constantbuf shader recompile test.Corbin Simpson
Less briefly... Shaders need to be recompiled if their constantbuf offsets have changed. However, since we only change them from shaders if immediates need to be emitted, we shouldn't bother if the shader doesn't use immediates.
2009-05-20r300-gallium: Raise constantbuf limits.Corbin Simpson
Still not correct, but really I don't care.
2009-05-20r300-gallium: fs: Remove cruft from way back when.Corbin Simpson
2009-05-20radeon-gallium: Add surface_buffer_create callback.Corbin Simpson
2009-05-20r300-gallium: Make surface_copy actually load the texture in shader.Corbin Simpson
2009-05-20r300-gallium: Add missing R481 PCI ID.Corbin Simpson
Per 74cb2aba on xf86-video-ati.
2009-05-20r300-gallium: Make surface_copy work, and refactor buffer validation.Corbin Simpson
2009-05-20radeon-gallium: Don't permit reading and writing a BO in one CS.Corbin Simpson
This fixes some silent problems in current libdrm_radeon. surface_copy still locks up hard.
2009-05-18trace: Improve shader wrappingJakob Bornecrantz
2009-05-18st/dri: Only create new textures if drawable has changedJakob Bornecrantz
2009-05-18r300-gallium: Fix (another) wrong value in MSPOS.Corbin Simpson
Again, thanks to agd5f.
2009-05-18radeon-gallium: Remove BO validation debug.Corbin Simpson
It appears that that area of code "just works" much like classic Mesa's version, so might as well not waste scrollback on it.
2009-05-18r300-gallium: Cleanup viewport state setup.Corbin Simpson
2009-05-18r300-gallium: Always do VTE, never software viewport.Corbin Simpson
This makes glxgears draw properly with SW TCL.
2009-05-18Merge branch 'mesa_7_5_branch'Brian Paul
Conflicts: Makefile src/mesa/main/version.h
2009-05-18softpipe: add texture target sanity check assertionBrian Paul
2009-05-17r300-gallium: Enable GLSL for r500.Corbin Simpson
Before you get all excited, this is *not* to be construed as actual support for GLSL shaders. The GL version is still 1.3, and stuff still sucks. Just flicking it on so that it can be tested and developed a bit easier.
2009-05-17r300-gallium: r500-fs: DDX and DDY support.Corbin Simpson
Oh, look, GLSL instructions. I wonder what I'll do next.
2009-05-17dri-gallium: Add GLSL support.Corbin Simpson
Oh, look, it's more features. :3
2009-05-17r300-gallium: Add half-right COS and SIN.Corbin Simpson
HW trig does a premultiply by 2pi, where Mesa does another premultiply by pi. This is a problem.
2009-05-17r300-gallium: Size mismatch.Corbin Simpson
2009-05-17r300-gallium, radeon-gallium: Nuke gb_pipes from orbit.Corbin Simpson
See the previous commit for an explanation. This is just all the support code for GB_TILE_CONFIG.