Merry 
							
						 
					 
					
						
						
						
						
							
						
						
							1770503185 
							
						 
					 
					
						
						
							
							xbyak: Update include path  
						
						
						
						
					 
					
						2021-08-15 19:26:38 +01:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							87d63b858a 
							
						 
					 
					
						
						
							
							Merge pull request  #6861  from yzct12345/const-mempy-is-all-the-speed  
						
						... 
						
						
						
						decoders: Optimize memcpy for the other functions 
						
						
					 
					
						2021-08-15 02:38:12 -07:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							0509fe3377 
							
						 
					 
					
						
						
							
							Merge pull request  #6838  from ameerj/sws-align  
						
						... 
						
						
						
						vic: Specify sws_scale height stride. 
						
						
					 
					
						2021-08-12 11:28:33 -07:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							356e10898f 
							
						 
					 
					
						
						
							
							codec: Replace deprecated av_init_packet usage  
						
						
						
						
					 
					
						2021-08-12 01:28:01 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							659039ca6d 
							
						 
					 
					
						
						
							
							nvdec: Implement GPU accelerated decoding for all platforms  
						
						... 
						
						
						
						Supplements the VAAPI intel gpu decoder by implementing the D3D11VA decoder for Windows, and CUVID/VDPAU for Nvidia and AMD on drivers linux respectively. 
						
						
					 
					
						2021-08-12 01:28:01 -04:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							430255caf8 
							
						 
					 
					
						
						
							
							decoders: Templates allow memcpy optimizations  
						
						
						
						
					 
					
						2021-08-12 04:45:25 +00:00 
						 
				 
			
				
					
						
							
							
								Fernando S 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							6a082df427 
							
						 
					 
					
						
						
							
							Merge pull request  #6820  from yzct12345/split-cache  
						
						... 
						
						
						
						texture_cache: Split out template definitions 
						
						
					 
					
						2021-08-10 12:23:05 +02:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							a779cede7c 
							
						 
					 
					
						
						
							
							vic: Specify sws_scale height stride.  
						
						... 
						
						
						
						Silences a sws_scale runtime warning about unaligned strides. 
						
						
					 
					
						2021-08-09 23:24:16 -04:00 
						 
				 
			
				
					
						
							
							
								Mai M 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2da91ec75b 
							
						 
					 
					
						
						
							
							Merge pull request  #6844  from ameerj/vp9-empty-frame  
						
						... 
						
						
						
						vp9: Ensure the first frame is complete 
						
						
					 
					
						2021-08-08 19:02:39 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							fa22695705 
							
						 
					 
					
						
						
							
							vp9: Ensure the first frame is complete  
						
						... 
						
						
						
						Silences a runtime error due to the first frame missing the frame data, and being set to hidden despite being a key-frame. 
						
						
					 
					
						2021-08-08 13:49:00 -04:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c4eafcc861 
							
						 
					 
					
						
						
							
							texture_cache: Address ameerj's review  
						
						
						
						
					 
					
						2021-08-08 11:02:51 +00:00 
						 
				 
			
				
					
						
							
							
								Fernando S 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							859deda3bb 
							
						 
					 
					
						
						
							
							Merge pull request  #6834  from K0bin/buffer-image-granularity  
						
						... 
						
						
						
						Respect Vulkan bufferImageGranularity 
						
						
					 
					
						2021-08-08 11:57:40 +02:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							bd0e1d3a25 
							
						 
					 
					
						
						
							
							Merge pull request  #6830  from ameerj/nvdec-unimpld-codec  
						
						... 
						
						
						
						nvdec: Better logging for unimplemented codecs 
						
						
					 
					
						2021-08-07 12:37:39 -07:00 
						 
				 
			
				
					
						
							
							
								Robin Kertels 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							bb29dcb7f2 
							
						 
					 
					
						
						
							
							vulkan_memory_allocator: Respect bufferImageGranularity  
						
						
						
						
					 
					
						2021-08-07 15:28:05 +02:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							928b64d2ce 
							
						 
					 
					
						
						
							
							nvdec: Better logging for unimplemented codecs  
						
						
						
						
					 
					
						2021-08-07 01:08:33 -04:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							268b5764c7 
							
						 
					 
					
						
						
							
							Merge pull request  #6791  from ameerj/astc-opt  
						
						... 
						
						
						
						astc_decoder: Various performance and memory optimizations 
						
						
					 
					
						2021-08-06 21:45:24 -07:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							e80323b8b0 
							
						 
					 
					
						
						
							
							texture_cache: Address ameerj's review  
						
						
						
						
					 
					
						2021-08-07 01:27:47 +00:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f183668a87 
							
						 
					 
					
						
						
							
							Merge pull request  #6799  from ameerj/vp9-fixes  
						
						... 
						
						
						
						nvdec: Fix VP9 reference frame refreshes 
						
						
					 
					
						2021-08-06 17:46:46 -07:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							e3688f0627 
							
						 
					 
					
						
						
							
							vp9: Cleanup unused variables  
						
						... 
						
						
						
						With reference frames refreshes fix, we no longer need to buffer two frames in advance.
We can also remove other unused or otherwise unneeded variables. 
						
						
					 
					
						2021-08-06 20:08:11 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							a3f80a97a3 
							
						 
					 
					
						
						
							
							vp9: Fix reference frame refreshes  
						
						... 
						
						
						
						This resolves the artifacting when decoding VP9 streams. 
						
						
					 
					
						2021-08-06 20:08:08 -04:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							02e98f6c93 
							
						 
					 
					
						
						
							
							texture_cache: Don't change copyright year  
						
						
						
						
					 
					
						2021-08-05 20:52:12 +00:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							5566f3dbc0 
							
						 
					 
					
						
						
							
							texture_cache: Address ameerj's review  
						
						
						
						
					 
					
						2021-08-05 20:46:24 +00:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f9563c8f24 
							
						 
					 
					
						
						
							
							texture_cache: Split templates out  
						
						
						
						
					 
					
						2021-08-05 13:52:30 +00:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2868d4ba84 
							
						 
					 
					
						
						
							
							nvdec: Implement VA-API hardware video acceleration ( #6713 )  
						
						... 
						
						
						
						* nvdec: VA-API
* Verify formatting
* Forgot a semicolon for Windows
* Clarify comment about AV_PIX_FMT_NV12
* Fix assert log spam from missing negation
* vic: Remove forgotten debug code
* Address lioncash's review
* Mention VA-API is Intel/AMD
* Address v1993's review
* Hopefully fix CMakeLists style this time
* vic: Improve cache locality
* vic: Fix off-by-one error
* codec: Async
* codec: Forgot the GetValue()
* nvdec: Address ameerj's review
* codec: Fallback to CPU without VA-API support
* cmake: Address lat9nq's review
* cmake: Make VA-API optional
* vaapi: Multiple GPU
* Apply suggestions from code review
Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com>
* nvdec: Address ameerj's review
* codec: Use anonymous instead of static
* nvdec: Remove enum and fix memory leak
* nvdec: Address ameerj's review
* codec: Remove preparation for threading
Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com> 
						
						
					 
					
						2021-08-03 23:43:11 -04:00 
						 
				 
			
				
					
						
							
							
								yzct12345 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f56d0db5bd 
							
						 
					 
					
						
						
							
							decoders: Optimize swizzle copy performance ( #6790 )  
						
						... 
						
						
						
						This makes UnswizzleTexture up to two times faster. It is the main bottleneck in NVDEC video decoding. 
						
						
					 
					
						2021-08-02 11:18:58 -04:00 
						 
				 
			
				
					
						
							
							
								Fernando S 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							30f0b7cf31 
							
						 
					 
					
						
						
							
							Merge pull request  #6720  from ameerj/vk-screenshot  
						
						... 
						
						
						
						renderer_vulkan: Implement screenshots 
						
						
					 
					
						2021-08-01 13:31:33 +02:00 
						 
				 
			
				
					
						
							
							
								Ameer J 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							db32c3762b 
							
						 
					 
					
						
						
							
							Merge pull request  #6765  from ReinUsesLisp/y-negate-vk  
						
						... 
						
						
						
						vk_rasterizer: Flip viewport on Y_NEGATE 
						
						
					 
					
						2021-08-01 01:47:37 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							c439fc9be9 
							
						 
					 
					
						
						
							
							astc_decoder: Reduce workgroup size  
						
						... 
						
						
						
						This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32. 
						
						
					 
					
						2021-08-01 01:22:27 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							5ab8053511 
							
						 
					 
					
						
						
							
							astc_decoder: Compute offset swizzles in-shader  
						
						... 
						
						
						
						Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes. 
						
						
					 
					
						2021-08-01 01:22:26 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							b2862e4772 
							
						 
					 
					
						
						
							
							astc_decoder: Make use of uvec4 for payload data  
						
						
						
						
					 
					
						2021-07-31 22:28:04 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							a75d70fa90 
							
						 
					 
					
						
						
							
							astc_decoder: Simplify Select2DPartition  
						
						
						
						
					 
					
						2021-07-31 21:36:26 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							5665d05547 
							
						 
					 
					
						
						
							
							astc_decoder: Optimize the use EncodingData  
						
						... 
						
						
						
						This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation.
We can take advantage of its sorted property to optimize its usage in the shader.
Thanks to wwylele for the optimization idea. 
						
						
					 
					
						2021-07-31 21:36:26 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							15c0c213b1 
							
						 
					 
					
						
						
							
							astc.h: Move data to cpp implementation  
						
						... 
						
						
						
						Moves leftover values that are no longer used by the gpu decoder back to the cpp implementation. 
						
						
					 
					
						2021-07-31 21:26:42 -04:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							7530594602 
							
						 
					 
					
						
						
							
							Merge pull request  #6759  from ReinUsesLisp/pipeline-statistics  
						
						... 
						
						
						
						renderer_vulkan: Add setting to log pipeline statistics 
						
						
					 
					
						2021-07-30 11:18:52 -07:00 
						 
				 
			
				
					
						
							
							
								ReinUsesLisp 
							
						 
					 
					
						
						
						
						
							
						
						
							b185567a03 
							
						 
					 
					
						
						
							
							vk_rasterizer: Flip viewport on Y_NEGATE  
						
						... 
						
						
						
						Matches OpenGL's behavior. I don't believe this register flips geometry,
but we have to try to match behavior on both backends. 
						
						
					 
					
						2021-07-29 02:17:53 -03:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							7ac99bb127 
							
						 
					 
					
						
						
							
							renderers: Add explicit invert_y bool to screenshot callback  
						
						... 
						
						
						
						OpenGL and Vulkan images render in different coordinate systems. This allows us to specify the coordinate system of the screenshot within each renderer 
						
						
					 
					
						2021-07-28 21:46:08 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							75e7f54fb0 
							
						 
					 
					
						
						
							
							renderer_vulkan: Implement screenshots  
						
						
						
						
					 
					
						2021-07-28 21:45:55 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							548bac8989 
							
						 
					 
					
						
						
							
							vk_blit_screen: Add public CreateFramebuffer method  
						
						
						
						
					 
					
						2021-07-28 21:43:02 -04:00 
						 
				 
			
				
					
						
							
							
								ameerj 
							
						 
					 
					
						
						
						
						
							
						
						
							1e6c5d323d 
							
						 
					 
					
						
						
							
							vk_blit_screen: Make Draw method more generic  
						
						... 
						
						
						
						Allows specifying the framebuffer and render area dimensions, rather than being hard coded for the render window. 
						
						
					 
					
						2021-07-28 21:37:30 -04:00 
						 
				 
			
				
					
						
							
							
								ReinUsesLisp 
							
						 
					 
					
						
						
						
						
							
						
						
							3b006f4fe2 
							
						 
					 
					
						
						
							
							renderer_vulkan: Add setting to log pipeline statistics  
						
						... 
						
						
						
						Use VK_KHR_pipeline_executable_properties when enabled and available to
log statistics about the pipeline cache in a game.
For example, this is on Turing GPUs when generating a pipeline cache
from Super Smash Bros. Ultimate:
Average pipeline statistics
==========================================
Code size:       6433.167
Register count:    32.939
More advanced results could be presented, at the moment it's just an
average of all 3D and compute pipelines. 
						
						
					 
					
						2021-07-27 21:29:24 -03:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							92887a65f0 
							
						 
					 
					
						
						
							
							Merge pull request  #6749  from lioncash/rtarget  
						
						... 
						
						
						
						render_target: Add missing initializer for size extent 
						
						
					 
					
						2021-07-27 17:28:53 -07:00 
						 
				 
			
				
					
						
							
							
								Rodrigo Locatti 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							ab206d6378 
							
						 
					 
					
						
						
							
							Merge pull request  #6748  from lioncash/engine-init  
						
						... 
						
						
						
						video_core/engine: Consistently initialize rasterizer pointers 
						
						
					 
					
						2021-07-27 16:17:20 -03:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2717e79c74 
							
						 
					 
					
						
						
							
							Merge pull request  #6745  from lioncash/copies  
						
						... 
						
						
						
						video_core: Remove some unused variables 
						
						
					 
					
						2021-07-27 11:38:32 -07:00 
						 
				 
			
				
					
						
							
							
								Lioncash 
							
						 
					 
					
						
						
						
						
							
						
						
							00e100de08 
							
						 
					 
					
						
						
							
							render_target: Add missing initializer for size extent  
						
						... 
						
						
						
						Everything else has a default constructor that does the straightforward
thing of initializing most members to a default value, except for the
size.
We explicitly initialize the size (and others, for consistency), to
prevent potential uninitialized reads from occurring. Particularly given
the largeish surface area that this struct is used in. 
						
						
					 
					
						2021-07-27 07:41:21 -04:00 
						 
				 
			
				
					
						
							
							
								Lioncash 
							
						 
					 
					
						
						
						
						
							
						
						
							f8964dd89a 
							
						 
					 
					
						
						
							
							video_core/engine: Consistently initialize rasterizer pointers  
						
						... 
						
						
						
						Ensures all of the engines have consistent and deterministic
initialization of the rasterizer pointers. 
						
						
					 
					
						2021-07-27 07:30:57 -04:00 
						 
				 
			
				
					
						
							
							
								Lioncash 
							
						 
					 
					
						
						
						
						
							
						
						
							8c82c594f0 
							
						 
					 
					
						
						
							
							vulkan_wrapper: Fix SetObjectName() always indicating objects as images  
						
						... 
						
						
						
						We should be using the passed in object type instead. 
						
						
					 
					
						2021-07-27 07:19:15 -04:00 
						 
				 
			
				
					
						
							
							
								Lioncash 
							
						 
					 
					
						
						
						
						
							
						
						
							ec56a17acd 
							
						 
					 
					
						
						
							
							buffer_cache: Remove unused small_vector in CommitAsyncFlushesHigh()  
						
						... 
						
						
						
						Given this is non-trivial, the constructor is required to execute, so
this removes a bit of redundant codegen. 
						
						
					 
					
						2021-07-27 06:24:44 -04:00 
						 
				 
			
				
					
						
							
							
								Lioncash 
							
						 
					 
					
						
						
						
						
							
						
						
							075a744e38 
							
						 
					 
					
						
						
							
							gl_shader_cache: Remove unused variable  
						
						
						
						
					 
					
						2021-07-27 06:23:49 -04:00 
						 
				 
			
				
					
						
							
							
								Lioncash 
							
						 
					 
					
						
						
						
						
							
						
						
							296728ec46 
							
						 
					 
					
						
						
							
							vk_compute_pass: Remove unused captures  
						
						... 
						
						
						
						Resolves two compiler warnings. 
						
						
					 
					
						2021-07-27 06:17:52 -04:00 
						 
				 
			
				
					
						
							
							
								bunnei 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d6c799494c 
							
						 
					 
					
						
						
							
							Merge pull request  #6696  from ameerj/speed-limit-rename  
						
						... 
						
						
						
						general: Rename "Frame Limit" references to "Speed Limit" 
						
						
					 
					
						2021-07-26 18:51:00 -07:00