summaryrefslogtreecommitdiff
path: root/src/gallium/auxiliary
AgeCommit message (Collapse)Author
2008-10-01util: Fix util_fast_pow/exp2/log2.José Fonseca
- Use a lookup table for log2. - Compute (float) (1 << ipart) by tweaking with the exponent directly to avoid integer overflow and float conversion. - Also table negative exponents to avoid float division and branching. - Implement util_fast_exp as function of util_fast_exp2.
2008-09-30tgsi: SSE2 optimized exp2, log2 and pow implementations.José Fonseca
Special care must be taken when calling compiler generated SSE2 functions from the runtime generated SSE2: saving the xmm registers, and notify gcc the stack is not 16byte aligned. It would be more efficient to keep the stack pointer 16byte aligned, but too hairy, and not consistent in all x86 architectures. This has been tested in linux x86 and windows x86 userspace. Not tested on x86-64 because it is broken for other reasons (even without this change).
2008-09-30util: Header for SSE2 intrinsics portability.José Fonseca
2008-09-29rtasm: Implement immediate group 1 instructions. Fix SIB emition.José Fonseca
2008-09-26gallium: SPU register commentsBrian Paul
2008-09-26util: Update fast_log2 article url.José Fonseca
2008-09-24add cso_hash_contains() functionAlan Hourihane
2008-09-23CELL: improve legibility of CELL_DEBUG environment variable outputRobert Ellison
2008-09-19cell: use different opcodes for spe_move() depending on even/odd addressBrian Paul
2008-09-19gallium: added spe_code_size()Brian Paul
2008-09-19cell: change spe_complement() to take a src and dst reg, like other instructionsBrian Paul
2008-09-19Merge branch 'gallium-0.2' of ssh+git://git.freedesktop.org/git/mesa/mesa ↵José Fonseca
into gallium-0.2
2008-09-19util: Use OpenGL rasterization rules in blits and mipmap generation.José Fonseca
2008-09-19CELL: add codegen for logic op, color maskRobert Ellison
- rtasm_ppc_spe.c, rtasm_ppc_spe.h: added a new macro function "spe_load_uint" for loading and splatting unsigned integers in a register; it will use "ila" for values 18 bits or less, "ilh" for word values that are symmetric across halfwords, "ilhu" for values that have zeroes in their bottom halfwords, or "ilhu" followed by "iohl" for general 32-bit values. Of the 15 color masks of interest, 4 are 18 bits or less, 2 are symmetric across halfwords, 3 are zero in the bottom halfword, and 6 require two instructions to load. - cell_gen_fragment.c: added full codegen for logic op and color mask.
2008-09-18Merge commit 'origin/gallium-0.1' into gallium-0.2Keith Whitwell
Conflicts: src/mesa/shader/slang/slang_link.c
2008-09-18tgsi: Build tgsi_text with makeJakob Bornecrantz
2008-09-18util: A few more memory debugging checks.José Fonseca
2008-09-18util: Add missing p_debug.h include.José Fonseca
2008-09-18gallium: fix surface object memory leak in cso moduleBrian Paul
2008-09-18tgsi: Make tgsi dumps look more like mesa shader dumps.Jakob Bornecrantz
2008-09-18CELL: finish fragment ops blending (except for unusual D3D modes)Robert Ellison
- Added new "macro" functions spe_float_min() and spe_float_max() to rtasm_ppc_spe.{ch}. These emit instructions that cause the minimum or maximum of each element in a vector of floats to be saved in the destination register. - Major changes to cell_gen_fragment.c to implement all the blending modes (except for the mysterious D3D-based PIPE_BLENDFACTOR_SRC1_COLOR, PIPE_BLENDFACTOR_SRC1_ALPHA, PIPE_BLENDFACTOR_INV_SRC1_COLOR, and PIPE_BLENDFACTOR_INV_SRC1_ALPHA). - Some revamping of code in cell_gen_fragment.c: use the new spe_float_min() and spe_float_max() functions (instead of expanding these calculations inline via macros); create and use an inline utility function for handling "optional" register allocation (for the {1,1,1,1} vector, and the blend color vectors) instead of expanding with macros; use the Float Multiply and Subtract (fnms) instruction to simplify and optimize many blending calculations.
2008-09-18pipebuffer: New callback to flush all temporary-held buffers.José Fonseca
Used mostly to aid debugging memory issues or to clean up resources when the drivers are long lived.
2008-09-17gallium: fix wide point / point coord semantic info (generic, not fog)Brian Paul
2008-09-17gallium: fix tgsi sanity checker with respect to END.Brian Paul
Subroutine code may be found after the END instruction so it's not always the last instruction. At least check for presence of exactly one END instruction though.
2008-09-17gallium: fix lack of surface reference counting in ↵Brian Paul
cso_set/save/restore_framebuffer() Fixes asst problems with FBO / render to texture.
2008-09-16gallium: fix info entries for KIL, KILPBrian Paul
KIL takes 1 src register. KILP uses no registers (uses cond codes).
2008-09-16tgsi: Make tgsi_sanity.c compile with makeJakob Bornecrantz
2008-09-15gallium: emit SPU instructions in assembler-compatible syntaxBrian Paul
2008-09-15Fixed emit_RRRJonathan White
2008-09-14draw: Silence compiler warnings on Windows.Michal Krol
2008-09-13gallium: add another value check to util_fast_pow()Brian Paul
Fixes glitches seen in morph3d demo.
2008-09-13gallium: add another value check to util_fast_pow()Brian Paul
Fixes glitches seen in morph3d demo.
2008-09-12gallium: added print/dump code to SPE code emitterBrian Paul
2008-09-12gallium: added spe_splat_word()Brian Paul
2008-09-12gallium: avoid redundant tgsi_exec_machine_bind_shader() calls on draw exec pathBrian Paul
tgsi_exec_machine_bind_shader() isn't cheap so avoiding unecessary calls is a big win. A similar change should be done for softpipe's fragment exec path but extra care needs to be taken with the texture sampler state/params.
2008-09-12gallium: use copy_token() function to avoid type punning/aliasing problemsBrian Paul
This fixes parsing errors seen with optimized builds on PPC (which led to crashes). The memcpy() is heavy-handed, but works. A lighter uint assignment could be used on x86...
2008-09-12gallium: use new compare32() function to fix warnings about type punning and ↵Brian Paul
aliasing
2008-09-12gallium: silence warningBrian Paul
2008-09-12gallium: minor optimization to spe_load_int()Brian Paul
2008-09-12gallium: initial PPC/Altivec codegenBrian Paul
2008-09-11gallium: add special cases in spe_load_float(), spe_load_int(), added ↵Brian Paul
spe_splat()
2008-09-11gallium: assorted additions and fixes to Cell SPE rtasm codeBrian Paul
Fix incorrect opcode for fsmbi. Added "macro" functions for loading floats/ints, register complement, zero, move. Added #defines for return address and stack pointer registers. Added assertions to check that the instruction buffer doesn't overflow.
2008-09-11gallium: comments, assertions, etcBrian Paul
2008-09-10tgsi: Fix newline pos.José Fonseca
2008-09-10tgsi: Verify constants are set before attempting to read them.José Fonseca
2008-09-09util: Ensure we always have a full qualified file name on windows display.José Fonseca
2008-09-09util: Enable u_stream_std.c for PIPE_SUBSYSTEM_WINDOWS_USER.Michal Krol
2008-09-08tgsi: Cleanup code.Michal Krol
2008-09-08util: Allow to define the maximum file size.José Fonseca
This avoids splitting the bitmaps in many files.
2008-09-08util: Dump surfaces to BMP.José Fonseca
This allows quick inspection of surfaces in mass scale.