mirrors/ryujinx - nin0git: A never-online, self-hosted Forgejo

mirrors/ryujinx

Author	SHA1	Message	Date
gdkchan	0bcbe32367	Only initialize shader outputs that are actually used on the next stage (#3054 ) * Only initialize shader outputs that are actually used on the next stage * Shader cache version bump	2022-03-06 20:42:13 +01:00
gdkchan	0a24aa6af2	Allow textures to have their data partially mapped (#2629 ) * Allow textures to have their data partially mapped * Explicitly check for invalid memory ranges on the MultiRangeList * Update GetWritableRegion to also support unmapped ranges	2022-02-22 13:34:16 -03:00
riperiperi	c9c65af59e	Perform unscaled 2d engine copy on CPU if source texture isn't in cache. (#3112 ) * Initial implementation of fast 2d copy TODO: Partial copy for mismatching region/size. * WIP * Cleanup * Update Ryujinx.Graphics.Gpu/Engine/Twod/TwodClass.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2022-02-22 11:21:29 -03:00
Berkan Diler	644b497df1	Collapse AsSpan().Slice(..) calls into AsSpan(..) (#3145 ) * Collapse AsSpan().Slice(..) calls into AsSpan(..) Less code and a bit faster * Collapse an Array.Clear(array, 0, array.Length) call to Array.Clear(array)	2022-02-22 10:32:10 -03:00
gdkchan	72e543e946	Prefer texture over textureSize for sampler type (#3132 ) * Prefer texture over textureSize for sampler type * Shader cache version bump	2022-02-18 02:44:46 +01:00
gdkchan	3bd357045f	Do not allow render targets not explicitly written by the fragment shader to be modified (#3063 ) * Do not allow render targets not explicitly written by the fragment shader to be modified * Shader cache version bump * Remove blank lines * Avoid redundant color mask updates * HostShaderCacheEntry can be null * Avoid more redundant glColorMask calls * nit: Mask -> Masks * Fix currentComponentMask * More efficient way to update _currentComponentMasks	2022-02-16 23:15:39 +01:00
gdkchan	7bfb5f79b8	When copying linear textures, DMA should ignore region X/Y (#3121 )	2022-02-16 11:13:45 +01:00
Berkan Diler	8f35345729	Use Enum and Delegate.CreateDelegate generic overloads (#3111 ) * Use Enum generic overloads * Remove EnumExtensions.cs * Use Delegate.CreateDelegate generic overloads	2022-02-13 10:50:07 -03:00
gdkchan	f861f0bca2	Fix missing geometry shader passthrough inputs (#3106 ) * Fix missing geometry shader passthrough inputs * Shader cache version bump	2022-02-11 19:52:20 +01:00
Mary	6dffe0fad4	misc: Make PID unsigned long instead of long (#3043 )	2022-02-09 17:18:07 -03:00
gdkchan	b944941733	Fix bug that could cause depth buffer to be missing after clear (#3067 )	2022-01-31 00:11:43 -03:00
riperiperi	c52158b733	Add timestamp to 16-byte/4-word semaphore releases. (#3049 ) * Add timestamp to 16-byte semaphore releases. BOTW was reading a ulong 8 bytes after a semaphore return. Turns out this is the timestamp it was trying to do performance calculation with, so I've made it write when necessary. This mode was also added to the DMA semaphore I added recently, as it is required by a few games. (i think quake?) The timestamp code has been moved to GPU context. Check other games with an unusually low framerate cap or dynamic resolution to see if they have improved. * Cast dma semaphore payload to ulong to fill the space * Write timestamp first Might be just worrying too much, but we don't want the applcation reading timestamp if it sees the payload before timestamp is written.	2022-01-27 22:50:32 +01:00
riperiperi	fd6d3ec88f	Fix res scale parameters not being updated in vertex shader (#3046 ) This fixes an issue where the render scale array would not be updated when technically the scales on the flat array were the same, but the start index for the vertex scales was different.	2022-01-27 14:17:13 -03:00
gdkchan	42c75dbb8f	Add support for BC1/2/3 decompression (for 3D textures) (#2987 ) * Add support for BC1/2/3 decompression (for 3D textures) * Optimize and clean up * Unsafe not needed here * Fix alpha value interpolation when a0 <= a1	2022-01-22 19:23:00 +01:00
gdkchan	7e967d796c	Stop using glTransformFeedbackVaryings and use explicit layout on the shader (#3012 ) * Stop using glTransformFeedbackVarying and use explicit layout on the shader * This is no longer needed * Shader cache version bump * Fix gl_PerVertex output for tessellation control shaders	2022-01-21 12:35:21 -03:00
gdkchan	0e59573f2b	Add capability for BGRA formats (#3011 )	2022-01-20 08:37:21 -03:00
gdkchan	fb853f13e9	Scale scissor used for clears (#3002 )	2022-01-16 20:23:00 -03:00
gdkchan	6e0799580f	Fix render target clear when sizes mismatch (#2994 )	2022-01-11 20:15:17 +01:00
riperiperi	ef24c8983d	Fix adjacent 3d texture slices being detected as Incompatible Overlaps (#2993 ) This fixes some regressions caused by #2971 which caused rendered 3D texture data to be lost for most slices. Fixes issues with Xenoblade 2's colour grading, probably a ton of other games. This also removes the check from TextureCache, making it the tiniest bit smaller (any win is a win here).	2022-01-11 09:37:40 +01:00
gdkchan	7f6b3d234a	Implement IMUL, PCNT and CONT shader instructions, fix FFMA32I and HFMA32I (#2972 ) * Implement IMUL shader instruction * Implement PCNT/CONT instruction and fix FFMA32I * Add HFMA232I to the table * Shader cache version bump * No Rc on Ffma32i	2022-01-10 12:08:00 -03:00
gdkchan	952c6e4d45	Fix sampled multisample image size (#2984 )	2022-01-10 08:45:25 +01:00
riperiperi	cda659955c	Texture Sync, incompatible overlap handling, data flush improvements. (#2971 ) * Initial test for texture sync * WIP new texture flushing setup * Improve rules for incompatible overlaps Fixes a lot of issues with Unreal Engine games. Still a few minor issues (some caused by dma fast path?) Needs docs and cleanup. * Cleanup, improvements Improve rules for fast DMA * Small tweak to group together flushes of overlapping handles. * Fixes, flush overlapping texture data for ASTC and BC4/5 compressed textures. Fixes the new Life is Strange game. * Flush overlaps before init data, fix 3d texture size/overlap stuff * Fix 3D Textures, faster single layer flush Note: nosy people can no longer merge this with Vulkan. (unless they are nosy enough to implement the new backend methods) * Remove unused method * Minor cleanup * More cleanup * Use the More Fun and Hopefully No Driver Bugs method for getting compressed tex too This one's for metro * Address feedback, ASTC+ETC to FormatClass * Change offset to use Span slice rather than IntPtr Add * Fix this too	2022-01-09 13:28:48 -03:00
riperiperi	79adba4402	Add support for render scale to vertex stage. (#2763 ) * Add support for render scale to vertex stage. Occasionally games read off textureSize on the vertex stage to inform the fragment shader what size a texture is without querying in there. Scales were not present in the vertex shader to correct the sizes, so games were providing the raw upscaled texture size to the fragment shader, which was incorrect. One downside is that the fragment and vertex support buffer description must be identical, so the full size scales array must be defined when used. I don't think this will have an impact though. Another is that the fragment texture count must be updated when vertex shader textures are used. I'd like to correct this so that the update is folded into the update for the scales. Also cleans up a bunch of things, like it making no sense to call CommitRenderScale for each stage. Fixes render scale causing a weird offset bloom in Super Mario Party and Clubhouse Games. Clubhouse Games still has a pixelated look in a number of its games due to something else it does in the shader. * Split out support buffer update, lazy updates. * Commit support buffer before compute dispatch * Remove unnecessary qualifier. * Address Feedback	2022-01-08 14:48:48 -03:00
gdkchan	15131d4350	Force crop when presentation cached texture size mismatches (#2957 )	2021-12-31 12:00:42 -03:00
gdkchan	c05c8e09d4	Add support for the R4G4 texture format (#2956 )	2021-12-30 17:10:54 +01:00
gdkchan	ef39b2ebdd	Flip scissor box when the YNegate bit is set (#2941 ) * Flip scissor box when the YNegate bit is set * Flip scissor based on screen scissor state, account for negative scissor Y * No need for abs when we already know the value is negative	2021-12-28 08:37:23 -03:00
gdkchan	a87f7f2029	Fix DMA copy fast path line size when xCount < stride (#2942 )	2021-12-26 13:05:26 -03:00
gdkchan	451673ada5	Fix I2M texture copies when line length is not a multiple of 4 (#2938 ) * Fix I2M texture copies when line length is not a multiple of 4 * Do not copy padding bytes for 1D copies * Nit	2021-12-26 12:39:07 -03:00
gdkchan	e7c2dc8ec3	Fix for texture pool not being updated when it should + buffer texture related fixes (#2911 )	2021-12-19 11:50:44 -03:00
riperiperi	521a07e612	Add support for releasing a semaphore to DmaClass (#2926 ) * Add support for releasing a semaphore to DmaClass Fixes freezes in OpenGL games, primarily GameMaker ones such as Undertale. * Address Feedback	2021-12-19 11:32:52 -03:00
gdkchan	119a3a1887	Fix SUATOM and other texture shader instructions with RZ dest (#2885 ) * Fix SUATOM and other texture shader instructions with RZ dest * Shader cache version bump	2021-12-08 18:36:09 -03:00
riperiperi	bc4e70b6fa	Move texture anisotropy check to SetInfo (#2843 ) Rather than calculating this for every sampler, this PR calculates if a texture can force anisotropy when its info is set, and exposes the value via a public boolean. This should help texture/sampler heavy games when anisotropic filtering is not Auto, like UE4 ones (or so i hear?). There is another cost where samplers are created twice when anisotropic filtering is enabled, but I'm not sure how relevant this one is.	2021-12-08 18:09:36 -03:00
gdkchan	650cc41c02	Implement remaining shader double-precision instructions (#2845 ) * Implement remaining shader double-precision instructions * Shader cache version bump	2021-12-08 17:54:12 -03:00
gdkchan	acc0b0f313	Fix FLO.SH shader instruction with a input of 0 (#2876 ) * Fix FLO.SH shader instruction with a input of 0 * Shader cache version bump	2021-12-05 13:25:05 +01:00
Mary	57d3296ba4	infra: Migrate to .NET 6 (#2829 ) * infra: Migrate to .NET 6 * Rollback version naming change * Workaround .NET 6 ZipArchive API issues * ci: Switch to VS 2022 for AppVeyor CI is now ready for .NET 6 * Suppress WebClient warning in DoUpdateWithMultipleThreads * Attempt to workaround System.Drawing.Common changes on 6.0.0 * Change keyboard rendering from System.Drawing to ImageSharp * Make the software keyboard renderer multithreaded * Bump ImageSharp version to 1.0.4 to fix a bug in Image.Load * Add fallback fonts to the keyboard renderer * Fix warnings * Address caian's comment * Clean up linux workaround as it's uneeded now * Update readme Co-authored-by: Caian Benedicto <caianbene@gmail.com>	2021-11-28 21:24:17 +01:00
gdkchan	30b7aaefca	Better depth range detection (#2754 ) * Better depth range detection * PR feedback * Move depth mode set out of the loop and to a separate method	2021-11-21 10:25:03 -03:00
riperiperi	788aec511f	Limit Custom Anisotropic Filtering to mipmapped textures with many levels (#2832 ) * Limit Custom Anisotropic Filtering to only fully mipmapped textures There's a major flaw with the anisotropic filtering setting that causes @GamerzHell9137 to report graphical bugs that otherwise wouldn't be there, because he just won't set it to Auto. This should fix those issues, hopefully. These bugs are generally because anisotropic filtering is enabled on something that it shouldn't be, such as a post process filter or some data texture. This PR maintains two host samplers when custom AF is enabled, and only uses the forced AF one when the texture is 2d and fully mipmapped (goes down to 1x1). This is because game textures are the ideal target for this filtering, and they are typically fully mipmapped, unlike things like screen render targets which usually have 1 or just a few levels. This also only enables AF on mipmapped samplers where the filtering is bilinear or trilinear. This should be self explanatory. This PR also allows the changing of Anisotropic Filtering at runtime, and you can immediately see the changes. All samplers are flushed from the cache if the setting changes, causing them to be recreated with the new custom AF value. This brings it in line with our resolution scale. 😌 * Expected minimum mip count for large textures rather than all, address feedback * Use Target rather than Info.Target * Retrigger build? * Fix rebase	2021-11-13 16:04:21 -03:00
gdkchan	611bec6e44	Implement DrawTexture functionality (#2747 ) * Implement DrawTexture functionality * Non-NVIDIA support * Disable some features that should not affect draw texture (slow path) * Remove space from shader source * Match 2D engine names * Fix resolution scale and add missing XML docs * Disable transform feedback for draw texture fallback	2021-11-10 15:37:49 -03:00
gdkchan	911ea38e93	Support shader gl_Color, gl_SecondaryColor and gl_TexCoord built-ins (#2817 ) * Support shader gl_Color, gl_SecondaryColor and gl_TexCoord built-ins * Shader cache version bump * Fix back color value on fragment shader * Disable IPA multiplication for fixed function attributes and back color selection	2021-11-08 13:18:46 -03:00
gdkchan	3dee712164	Fix bindless/global memory elimination with inverted predicates (#2826 ) * Fix bindless/global memory elimination with inverted predicates * Shader cache version bump	2021-11-08 12:57:28 -03:00
gdkchan	b7a1544e8b	Fix InvocationInfo on geometry shader and bindless default integer const (#2822 ) * Fix InvocationInfo on geometry shader and bindless default integer const * Shader cache version bump * Consistency for the default value	2021-11-08 11:39:30 -03:00
gdkchan	e48530e9d9	When waiting on CPU, do not return a time out error from EventWait (#2780 ) * When waiting on CPU, do not return a time out error from EventWait * And while I'm at it...	2021-11-01 19:10:02 -03:00
gdkchan	99445dd0a6	Add support for fragment shader interlock (#2768 ) * Support coherent images * Add support for fragment shader interlock * Change to tree based match approach * Refactor + check for branch targets and external registers * Make detection more robust * Use Intel fragment shader ordering if interlock is not available, use nothing if both are not available * Remove unused field	2021-10-28 19:53:12 -03:00
gdkchan	0d174cbd45	EventWait should not signal the event when it returns Success (#2739 ) * Fix race when EventWait is called and a wait is done on the CPU * This is useless now * Fix EventSignal * Ensure the signal belongs to the current fence, to avoid stale signals	2021-10-19 17:25:32 -03:00
gdkchan	63f1663fa9	Fix shader 8-bit and 16-bit STS/STG (#2741 ) * Fix 8 and 16-bit STG * Fix 8 and 16-bit STS * Shader cache version bump	2021-10-18 20:24:15 -03:00
riperiperi	052deebf26	Another workaround for NVIDIA driver 496.13 shader bug (#2750 ) * Another workaround for NVIDIA driver 496.13 shader bug This might work better than the other one. Give this a test to see if it fixes/doesn't fix issues with the other one. The problem seems to be when any variable assignment happens with a negation. `temp_1 = -temp_0;` seems to trigger weird behaviour, but `temp_1 = 0.0 - temp_0;` does not. This also might to extend towards integer types? * Update cache version * Add disclaimer comment * Wording	2021-10-18 20:04:06 -03:00
gdkchan	d512ce122c	Initial tessellation shader support (#2534 ) * Initial tessellation shader support * Nits * Re-arrange built-in table * This is not needed anymore * PR feedback	2021-10-18 18:38:04 -03:00
gdkchan	25fd4ef10e	Extend bindless elimination to work with masked and shifted handles (#2727 ) * Extent bindless elimination to work with masked handles * Extend bindless elimination to catch shifted pattern, refactor handle packing/unpacking	2021-10-17 17:28:18 -03:00
gdkchan	d05573bfd1	Implement SHF (funnel shift) shader instruction (#2702 ) * Implement SHF shader instruction * Shader cache version bump * Better name	2021-10-17 17:02:20 -03:00
gdkchan	464a92d8a7	Force index buffer update for games using Vulkan (#2726 )	2021-10-12 23:46:42 +02:00

1 2 3 4 5 ...