Difference between revisions of "GPU Classes"

From Nintendo Switch Brew
Jump to navigation Jump to search
Line 11: Line 11:
 
|-
 
|-
 
| 0xB0B5 || 4 || [[#MAXWELL_DMA_COPY_A|MAXWELL_DMA_COPY_A]]
 
| 0xB0B5 || 4 || [[#MAXWELL_DMA_COPY_A|MAXWELL_DMA_COPY_A]]
 +
|-
 +
| 0xB06F || - || MAXWELL_CHANNEL_GPFIFO_A
 
|}
 
|}
  

Revision as of 19:49, 14 February 2022

Id Subchannel (nvn) Name
0xB197 0 MAXWELL_B
0xB1C0 1 MAXWELL_COMPUTE_B
0xA140 2 KEPLER_INLINE_TO_MEMORY_B
0x902D 3 FERMI_TWOD_A
0xB0B5 4 MAXWELL_DMA_COPY_A
0xB06F - MAXWELL_CHANNEL_GPFIFO_A

MAXWELL_B

Register Name Size Type Notes
0 SetObject 1 bitfield Bit0-15: ClassId, bit16-20: EngineId
0x40 NoOperation 1
0x41 SetNotifyA 1 gpuva
0x42 SetNotifyB 1 gpuva
0x43 Notify 1 0=WriteOnly, 1=WriteThenAwaken
0x44 WaitForIdle 1
0x45 LoadMmeInstructionRamPointer 1 uint
0x46 LoadMmeInstructionRam 1 pipe Writes to and increments MmeInstructionRamPointer by 1.
0x47 LoadMmeStartAddressRamPointer 1 uint
0x48 LoadMmeStartAddressRam 1 pipe
0x49 SetMmeShadowRamControl 1 uint 0=MethodTrack, 1=MethodTrackWithFilter, 2=MethodPassthrough, 3=MethodReplay. Used during ClearColor.
0x83 LineWidthSeparate 1 bool
0x84 ForceEarlyFragmentTests 1 bool
0x87 Barrier 1 bitfield
0xBA 1 SetRenderTargets writes 1/0 here an optional buffer != NULL, and depending on type.
0xBE 1 pipe SetRenderTargets writes here repeatedly 16 times, if an optional buffer != NULL and is a given type.
0xBF 1 bool SetRenderTargets writes 1 here after 0xBE has been written 16 times, and 0xDC been written.
0xC2 CacheSplit 1 enum
0xC8 TesselationMode 1 bitfield Bit0-1: PrimitiveType (0=Isolines, 1=Triangles, 2=Quads), bit4-5: Spacing (0=Equal, 1=FractionalOdd, 2=FractionalEven), bit8-9: WindingAndConnectednessFlags (0=CcwAndNotConnected, 1=IsolineConnected, 1=TriangleCw, 2=TriangleConnected, 3=TriangleConnectedCw)
0xC9 TesselationOuterLevels 4 float[4]
0xCD TesselationInnerLevels 2 float[2]
0xDC 1 bool SetRenderTargets writes 1 here, if an optional buffer != NULL and is a given type.
0xDF RasterizerEnable 1 bool
0xE0+8*N TransformFeedbackBufferEnable 1 bool n=0..3
0xE1+8*N TransformFeedbackBufferAddr 2 gpuva
0xE3+8*N TransformFeedbackBufferFlags 1
0x1C0+4*N TransformFeedbackBufferStream 1 bitfield
0x1C1+4*N TransformFeedbackVaryingCount 1 uint
0x1C2+4*N TransformFeedbackStride 1 uint
0x1D1 TransformFeedbackEnable 1 bool
0x1D3 1 GpuInit writes 0x3f here.
0x1D5 SampleShading 1 bitfield Bit0-3: ?, bit4: Enable
0x1DF LocalBase 1 uint TODO
0x1F0 ZCullWidth 1 SetRenderTargets optionally uses this.
0x1F1 ZCullHeight 1 SetRenderTargets optionally uses this.
0x1F2 1 SetRenderTargets optionally uses this.
0x1F3 1 SetRenderTargets optionally writes 0 here.
0x1F8 1 SetRenderTargets optionally uses this.
0x1F9 1 SetRenderTargets optionally uses this.
0x1FA ZCullContextStartAddr 2 gpuva
0x1FC ZCullContextEndAddr 2 gpuva
0x200+16*N RenderTargetNAddr 2 gpuva
0x202+16*N RenderTargetNHorizontal 1
0x203+16*N RenderTargetNVertical 1
0x204+16*N RenderTargetNFormat 1
0x205+16*N RenderTargetNTileMode 1 bitfield Bit0-3: Width, bit4-7: Height, bit8-10: Depth, bit12: Layout, bit16: ?
0x206+16*N RenderTargetNArrayMode 1 bitfield Bit0-15: Layers, bit16: Volume
0x207+16*N RenderTargetNLayerStride 1 uint In units of 4 bytes.
0x208+16*N RenderTargetNBaseLayer 1
0x280+8*N ViewportNScaleX 1 float n=0..15
0x281+8*N ViewportNScaleY 1 float
0x282+8*N ViewportNScaleZ 1 float
0x283+8*N ViewportNTranslateX 1 float
0x284+8*N ViewportNTranslateY 1 float
0x285+8*N ViewportNTranslateZ 1 float
0x286+8*N ViewportNSwizzles 1 bitfield
0x287+8*N ViewportNSubpixelPrecisionBias 1 bitfield Bit0-4: BiasX, bit8-bit11: BiasY
0x300+4*N ViewportNHorizontal 1 bitfield Bit0-15: X, bit16-31: Width
0x301+4*N ViewportNVertical 1 bitfield Bit0-15: Y, bit16-31: Height
0x302+4*N ViewportNDepthRangeNear 1 float
0x303+4*N ViewportNDepthRangeFar 1 float
0x340+2*N ClipRectNHorizontal 1 n=0..7, TODO
0x341+2*N ClipRectNVertical 1 TODO
0x359 CallLimit 1 uint 0=FullLimit, 1...8=?, 15=Unlimited
0x35A CounterEnable 1 bitfield Bit0: ?, bit1: ?, ... bit15: ?
0x35D VertexBufferFirst 1
0x35E VertexBufferCount 1
0x35F DepthClipAllowNegativeZ 1 bool
0x360 ClearColor 4 float
0x364 ClearDepth 1 float
0x368 ClearStencil 1
0x36B PolygonModeFront 1 bitfield 0x1B00/0x1B01/0x1B02
0x36C PolygonModeBack 1 bitfield 0x1B00/0x1B01/0x1B02
0x36D PolygonSmoothEnable 1 bool
0x36F ? 1 bitfield Bit0:?, bit16:?. Used by ClearDepthStencil.
0x370 PolygonOffsetPointEnable 1 bool
0x371 PolygonOffsetLineEnable 1 bool
0x372 PolygonOffsetFillEnable 1 bool
0x373 PatchSize 1 Small value, always fits in 12 bits. In number of vertices.
0x374 1 0 written here for "simple" BlendState.
0x379 WatchdogTimer 1
0x37A PrimitiveRestartWithDrayArrays 1 boolean
0x37E WindowOffsetX 1 int
0x37F WindowOffsetY 1 int
0x380+4*N ScissorNEnable 1 bool n=0..15. GpuInit writes 1 here.
0x381+4*N ScissorNHorizontal 1 bitfield Bit0-15: min, bit16-31: max
0x382+4*N ScissorNVertical 1 bitfield Bit0-15: min, bit16-31: max
0x3D5 StencilBackRefValue 1
0x3D6 StencilBackEnable 1 bool
0x3D7 StencilBackValueMask 1
0x3D8 TiledCacheAction0 1 bool Trigger?
0x3D9 TiledCacheTileSize 1 bitfield Bit0-15: ?, bit16-31: ?
0x3DE DiscardTrigger 1 bitfield Bit4-6: DiscardColorIndex, bit0: DiscardDepthStencil
0x3E0 TiledCacheAction1 1 bool Trigger?
0x3E7 DepthBounds 2 float[2]
0x3ED MultisampleRasterEnable 1 bool Also written 0 when clearing all colors.
0x3EE MultisampleRasterSamples 1 bitfield 2=Four, 4=Eight, 5=?, 6=Sixteen
0x3EF MultisampleCoverageModulationMode 1
0x3F0 SampleMask0 1
0x3F1 SampleMask1 1
0x3F2 SampleMask2 1
0x3F5 Multisample_Related2 1 TODO
0x3F6 CoverageModulationEnable 1 bool
0x3F8 Unknown?Addr 2 gpuva SetRenderTargets writes the address of an optional buffer here.
0x3FA Unknown?TileMode 1 SetRenderTargets writes here optionally.
0x3FB Unknown?ArrayMode 1 SetRenderTargets writes here optionally.
0x3FC Unknown?BaseLayer 1 SetRenderTargets writes here optionally.
0x3FD 1 bitfield?[2] SetRenderTargets writes here
0x40C CoverageModulationTable 4 float[4]
0x433 1 GpuInit writes 4 here.
0x438 1 GpuInit writes 0xFF here.
0x439 1 GpuInit writes 0xFF here.
0x43B 1 GpuInit writes 0xFF here.
0x43C 1 GpuInit writes 4 here.
0x43D Unbind (???) 1 bitfield Bit0: InvalidateTextureHeaders, bit4: InvalidateTextureSamplers, bit8: InvalidateConstBuffers
0x43E ClearFlags? 1 GpuInit writes 0x101 here.
0x446 DrawElementsEnableBaseVertex? 1 bool TODO
0x44D TiledCacheAction2 1 bool Trigger?
0x452 RasterEnable 1 bool
0x458 VertexStreamNAttrib 4 bitfield[16] Bit0-4: StreamIndex, bit7-20: Format0, bit21-30: Format1
0x478 MultisampleGrid 3 bitfield[3] Bit0-3: x0, bit4-7: y0, bit8-11: x1, etc..
0x47C 1 trigger BindProgram writes 0 here sometimes.
0x47E MultisampleCoverageToColor 1 bitfield Bit0: Enable, bit4-6: ?
0x47F DepthBufferResolve 1 trigger 1 is written here to trigger.
0x487 RenderTargetControl 1 bitfield Bit0-3: NumberOfRenderTargets, bunch of other flags. Used by SetRenderTargets.
0x48A 1 Optionally used by SetRenderTargets.
0x48B 1 Optionally used by SetRenderTargets.
0x48C 1 Optionally used by SetRenderTargets.
0x48D SamplerBinding (???) 1 enum 0=Independently, 1=ViaHeaderBinding
0x4A2 InvalidateTextureDataNoWfi (???) 1 bitfield bit0: 0=AllLines, 1=OneLine, bit4-25: Tag
0x4AB ShaderScheduling (???) 1 enum 0=OldestThreadFirst, 1=RoundRobin
0x4B3 DepthTestEnable 1 bool Enables DepthWriteEnable and DepthFunc.
0x4B8 MultisampleAlphaToCoverageDither 1 bool
0x4B9 BlendIndependent 1 bool 1 written here for "simple" BlendState.
0x4BA DepthWriteEnable 1 bool
0x4BB AlphaTestEnable 1 bool
0x4C3 DepthFunc 1 bitfield Bit0-3: DepthFunc
0x4C4 AlphaTestRefValue 1 float
0x4C5 AlphaTestFunc 1 bitfield Bit0-3: AlphaTestFunc
0x4C7 BlendColor 4 float[4] TODO
0x4CC InvalidateSamplerCache (???) 1 bitfield bit0: 0=AllLines, 1=OneLine, bit4-25: Tag
0x4CD InvalidateTextureHeaderCache (???) 1 bitfield bit0: 0=AllLines, 1=OneLine, bit4-25: Tag
0x4CE InvalidateTextureDataCache (???) 1 bitfield bit0: 0=AllLines, 1=OneLine, bit4-25: Tag
0x4E0 StencilEnable 1 bool
0x4E5 StencilFrontRefValue 1
0x4E6 StencilFrontMaskValue 1
0x4E7 StencilFrontEnable 1 bool
0x4EC LineWidthSmooth 1 float
0x4ED LineWidthAliased 1 float
0x50D VertexBufferElementBase 1 uint
0x519 ZCullContextSave 1 trigger 0 is written here to trigger ctx-save, uses ZCullContextStartAddr.
0x51F PolygonOffsetClamp0 1 float
0x540 ZCullContextRestore 1 trigger 0 is written here to trigger ctx-restore, uses ZCullContextStartAddr.
0x546 PointSize 1 float
0x54C CounterReset 1 Value written decides which counter to reset.
0x54D MultisampleEnable 1
0x54E 1 bool SetRenderTargets writes 0 here if an optional buffer is NULL.
0x54F MultisampleControl 1 bool bit0: AlphaToCoverageEnable, bit1+: ?
0x554 RenderEnableOffset 2 gpuva
0x556 RenderEnableMode 1 enum 0=False, 1=True, 2=Conditional, 3=RenderIfEqual, 4=RenderIfNotEqual,
0x557 TexSamplerPoolOffset (???) 2 gpuva
0x558 TexSamplerPoolMaximumIndex (???) 1 bitfield Bit0-19: Maximum
0x55B PolygonOffsetFactor 1 float
0x55D TexHeaderPoolOffset (???) 2 gpuva
0x55F TexHeaderPoolMaximumIndex (???) 1 bitfield Bit0-21: Maximum
0x56D CsaaEnable 1 bool
0x56F PolygonOffsetOffset 1 float Float multiplied by 2 is written here.
0x574 MultisampleMode 1 Written by SetRenderTargets, possible values: 0, 2, 4, 5, 6.
0x57F 1 SetRenderTargets optionally writes 0 here.
0x580 1 SetRenderTargets optionally writes 0 here.
0x582 ProgramRegion (???) 2 gpuva
0x591 PrimitiveRestartEnable 1 bool
0x592 PrimitiveRestartIndex 1 uint
0x599 CubeMapInterFaceFiltering (???) 1 bitfield Bit0-1: Mode (0=UseWrap, 1=OverrideWrap, 2=AutoSpanSeam, 3=AutoCrossSeam)
0x5A4 ShaderControl (???) 1 bitfield Bit0: Partial (0=Zero, 1=Infinity), bit1: Fp32NanBehavior (0=Legacy, 1=Fp64Compatible), bit2: Fp32F21NanBehavior (0=PassZero, 1=PassIndefinite), bit16: ZeroTimesAnythingIsZero
0x5A5 BindConstantBuffer (???) 1 bitfield Bit0: Valid, bit8-12: ShaderSlot
0x5A6 InvalidateShaderCachesNoWfi (???) 1 bitfield Bit0: Instruction, bit4: GlobalData, bit8: Uniform, bit12: Constant
0x5F2 DrawElementsIndirectAddr 2 gpuva TODO
0x5F6 DrawElementsIndirectFlag? 1 bool TODO
0x620+N VertexStreamNEnableDivisor 1 bool n=0...15.
0x646 PolygonCullFaceEnable 1 bool
0x647 PolygonFrontFace 1 bitfield Bit0: Enable. Always ORR'd with 0x9000.
0x648 PolygonCullFaceConfig 1 bitfield TODO: 0x404/0x405/0x408
0x651 RenderEnableOverride 1 bitfield Bit0-1: 0=UseRenderEnable, 1=AlwaysRender, 2=NeverRender
0x64C InvalidateConstantBufferCache (???) 1 bitfield Bit0: ThruL2
0x64F DepthClamp 1 TODO: 0x101A is written when enabled, 0x181D when disabled.
0x66F DepthBoundsEnable 1 bool
0x671 ColorLogicOpEnable 1 bool Used for all LogicOps except 3.
0x672 ColorLogicOpType 1 bitfield Bit0-7: LogicOp, bit8-15: unknown, always 0x15.
0x689 SetSpare (???) 1 uint[4]
0x68B PipeNop 1 trigger Always 0 is written here. During zcull ctx-save, spammed when enabling raster, ...
0x6C0 ReportSemaphoreOffset 2 gpuva
0x6C2 ReportSemaphorePayload 1 0 is written here during most queries.
0x6C3 ReportSemaphoreControl 1 bitfield Bit0-1: Operation (0=Release, 3=Trap), bit2: FlushDisable, bit20: AwakenEnable, bit28: StructureSize (0=FourWords, 1=OneWord)
0x700+4*N VertexStreamNStride 1 uint Bit0-11: Stride. TODO: This has more stuff according to nouveau. n=0..31
0x701+4*N VertexBufferStartAddr 2 gpuva TODO: Incorrecto
0x703+4*N VertexStreamNDivisor
0x7C0+2*N VertexBufferEndAddr 2 gpuva
0x781+8*N BlendNRgbEquation 1 bitfield Bit0-2: BlendEquation.
0x782+8*N BlendNRgbFunctionSrc 1 bitfield Bit0-4: ?, bit14-15: ?
0x783+8*N BlendNRgbFunctionDst 1 bitfield Bit0-4: ?, bit14-15: ?
0x784+8*N BlendNAlphaEquation 1 bitfield Bit0-2: BlendEquation
0x785+8*N BlendNAlphaFunctionSrc 1 bitfield Bit0-4: ?, bit14-15: ?
0x786+8*N BlendNAlphaFunctionDst 1 bitfield Bit0-4: ?, bit14-15: ?
0x820 1 Select tessellation control program.
0x821 1 Specify tessellation control program entry point.
0x830 1 Select tessellation evaluation program.
0x840 1 Select geometry program.
0x850 1 Select fragment program.
0x8C0+N FirmwareCallN 8 uint Invokes GPU microcode firmware blob.
0x8E0 ConstantBufferSelectorSize 1 uint Bit0-16: Size
0x8E1 ConstantBufferSelectorAddr 2 gpuva
0x8E3 LoadConstantBufferOffset 1 uint Bit0-15: Offset. BindImage writes "8*i + 0x120" here. BindSeparateSampler writes "8*i + 0x568" here. BindSeparateTexture uses "8*i + 0x168". BindTexture uses 8*i + 32. TODO: BindStorageBuffer, UpdateUniformBuffer, etc
0x8E4 LoadConstantBuffer 1 uint[16]
0x904+8*N ConstBufferNBind 1 bitfield n=0..4. Bit0: Valid, bit4-8: Index
0xD00 SetMmeShadowScratch uint[0x80] 128 Mostly general purpose, sometimes accessed by FirmwareCall.
0xE00+N*2 CallMmeMacroN 1 uint n=0..0x7f. Writing here calls a bound macro with value written put in r1.
0xE01+N*2 CallMmeDataN 1 pipe Used to send variable number of arguments to macro via the "parm" instruction.

TODO: (Tiled)Downsample, Copy*, Dispatch*, DrawTexture/DrawTransformFeedback, BindImages, BindProgram, BindSeprarateSamplers, BindSeprarateTextures, BindTextures.

MAXWELL_COMPUTE_B

Register Name Size Type Notes
0 SetObject 1 bitfield Bit0-15: ClassId, bit16-20: EngineId
0x40 NoOperation 1
0x41 SetNotifyA 1 gpuva
0x42 SetNotifyB 1 gpuva
0x43 Notify 1 0=WriteOnly, 1=WriteThenAwaken
0x44 WaitForIdle 1
0x4C SetGlobalRenderEnableA 1
0x4D SetGlobalRenderEnableB 1
0x4E SetGlobalRenderEnableC 1
0x4F SendGoIdle 1
0x50 PmTrigger 1
0x51 PmTriggerWfi 1
0x54 SetInstrumentationMethodHeader 1
0x55 SetInstrumentationMethodData 1
0x60 LineLengthIn 1
0x61 LineCount 1
0x62 OffsetOutUpper 1
0x63 OffsetOut 1
0x64 PitchOut 1
0x65 SetDstBlockSize 1
0x66 SetDstWidth 1
0x67 SetDstHeight 1
0x68 SetDstDepth 1
0x69 SetDstLayer 1
0x6A SetDstOriginBytesX 1
0x6B SetDstOriginSamplesY 1
0x6C LaunchDma 1
0x6D LoadInlineData 1
0x77 SetI2mSemaphoreA 1
0x78 SetI2mSemaphoreB 1
0x79 SetI2mSemaphoreC 1
0x7C SetI2mSpareNoop00 1
0x7D SetI2mSpareNoop01 1
0x7E SetI2mSpareNoop02 1
0x7F SetI2mSpareNoop03 1
0x80 SetValidSpanOverflowAreaA 1
0x81 SetValidSpanOverflowAreaB 1
0x82 SetValidSpanOverflowAreaC 1
0x83 SetCoalesceWaitingPeriodUnit 1
0x84 PerfmonTransfer 1
0x85 SetShaderSharedMemoryWindow 1
0x86 SetSelectMaxwellTextureHeaders 1
0x87 InvalidateShaderCaches 1
0x88 SetReservedSwMethod00 1
0x89 SetReservedSwMethod01 1
0x8A SetReservedSwMethod02 1
0x8B SetReservedSwMethod03 1
0x8C SetReservedSwMethod04 1
0x8D SetReservedSwMethod05 1
0x8E SetReservedSwMethod06 1
0x8F SetReservedSwMethod07 1
0x90 SetCwdControl 1
0x91 InvalidateTextureHeaderCacheNoWfi 1
0x92 SetCwdRefCounter 1
0x93 SetReservedSwMethod08 1
0x94 SetReservedSwMethod09 1
0x95 SetReservedSwMethod10 1
0x96 SetReservedSwMethod11 1
0x97 SetReservedSwMethod12 1
0x98 SetReservedSwMethod13 1
0x99 SetReservedSwMethod14 1
0x9A SetReservedSwMethod15 1
0x9B SetGwcScgType 1
0x9C SetScgControl 1
0x9D InvalidateConstantBufferCacheA 1
0x9E InvalidateConstantBufferCacheB 1
0x9F InvalidateConstantBufferCacheC 1
0xA0 SetComputeClassVersion 1
0xA1 CheckComputeClassVersion 1
0xA2 SetQmdVersion 1
0xA3 SetWfiConfig 1
0xA4 CheckQmdVersion 1
0xA5 WaitForIdleScgType 1
0xA6 InvalidateSkedCaches 1
0xA7 SetScgRenderEnableControl 1
0xAC SetCwdSlotCount 1
0xAD SendPcasA 1
0xAE SendPcasB 1
0xAF SendSignalingPcasB 1
0xB9 SetShaderLocalMemoryNonThrottledA 1
0xBA SetShaderLocalMemoryNonThrottledB 1
0xBB SetShaderLocalMemoryNonThrottledC 1
0xBC SetShaderLocalMemoryThrottledA 1
0xBD SetShaderLocalMemoryThrottledB 1
0xBE SetShaderLocalMemoryThrottledC 1
0xC4 SetSpaVersion 1
0x140 SetFalcon00 1
0x141 SetFalcon01 1
0x142 SetFalcon02 1
0x143 SetFalcon03 1
0x144 SetFalcon04 1
0x145 SetFalcon05 1
0x146 SetFalcon06 1
0x147 SetFalcon07 1
0x148 SetFalcon08 1
0x149 SetFalcon09 1
0x14A SetFalcon10 1
0x14B SetFalcon11 1
0x14C SetFalcon12 1
0x14D SetFalcon13 1
0x14E SetFalcon14 1
0x14F SetFalcon15 1
0x150 SetFalcon16 1
0x151 SetFalcon17 1
0x152 SetFalcon18 1
0x153 SetFalcon19 1
0x154 SetFalcon20 1
0x155 SetFalcon21 1
0x156 SetFalcon22 1
0x157 SetFalcon23 1
0x158 SetFalcon24 1
0x159 SetFalcon25 1
0x15A SetFalcon26 1
0x15B SetFalcon27 1
0x15C SetFalcon28 1
0x15D SetFalcon29 1
0x15E SetFalcon30 1
0x15F SetFalcon31 1
0x1DF SetShaderLocalMemoryWindow 1
0x1E4 SetShaderLocalMemoryA 1
0x1E8 SetShaderLocalMemoryB 1
0x365 SetShaderCacheControl 1
0x379 SetSmTimeoutInterval 1
0x3D1 SetSpareNoop12 1
0x3D2 SetSpareNoop13 1
0x3D3 SetSpareNoop14 1
0x3D4 SetSpareNoop15 1
0x410 SetSpareNoop00 1
0x411 SetSpareNoop01 1
0x412 SetSpareNoop02 1
0x413 SetSpareNoop03 1
0x414 SetSpareNoop04 1
0x415 SetSpareNoop05 1
0x416 SetSpareNoop06 1
0x417 SetSpareNoop07 1
0x418 SetSpareNoop08 1
0x419 SetSpareNoop09 1
0x41A SetSpareNoop10 1
0x41B SetSpareNoop11 1
0x483 InvalidateSamplerCacheAll 1
0x484 InvalidateTextureHeaderCacheAll 1
0x4A2 InvalidateTextureDataCacheNoWfi 1
0x4AA ActivatePerfSettingsForComputeContext 1
0x4CC InvalidateSamplerCache 1
0x4CD InvalidateTextureHeaderCache 1
0x4CE InvalidateTextureDataCache 1
0x509 InvalidateSamplerCacheNoWfi 1
0x54A SetShaderExceptions 1
0x554 SetRenderEnableA 1
0x555 SetRenderEnableB 1
0x556 SetRenderEnableC 1
0x557 SetTexSamplerPoolA 1
0x558 SetTexSamplerPoolB 1
0x559 SetTexSamplerPoolC 1
0x55D SetTexHeaderPoolA 1
0x55E SetTexHeaderPoolB 1
0x55F SetTexHeaderPoolC 1
0x582 SetProgramRegionA 1
0x583 SetProgramRegionB 1
0x5A6 InvalidateShaderCachesNoWfi 1
0x651 SetRenderEnableOverride 1
0x68B PipeNop 1
0x68C SetSpare00 1
0x68D SetSpare01 1
0x68E SetSpare02 1
0x68F SetSpare03 1
0x6C0 SetReportSemaphoreA 1
0x6C1 SetReportSemaphoreB 1
0x6C2 SetReportSemaphoreC 1
0x6C3 SetReportSemaphoreD 1
0x982 SetBindlessTexture 1
0x983 SetTrapHandler 1
0xCCF SetShaderPerformanceCounterValueUpper 8
0xCD7 SetShaderPerformanceCounterValue 8
0xCDF SetShaderPerformanceCounterEvent 8
0xCE7 SetShaderPerformanceCounterControlA 8
0xCEF SetShaderPerformanceCounterControlB 8
0xCF7 SetShaderPerformanceCounterTrapControl 1
0xCF8 StartShaderPerformanceCounter 1
0xCF9 StopShaderPerformanceCounter 1
0xD00 SetMmeShadowScratch 8

FERMI_TWOD_A

Register Name Size Type Notes
0 SetObject 1 bitfield Bit0-15: ClassId, bit16-20: EngineId
0x40 NoOperation 1
0x41 SetNotifyA 1 gpuva
0x42 SetNotifyB 1 gpuva
0x43 Notify 1 0=WriteOnly, 1=WriteThenAwaken
0x44 WaitForIdle 1
0x45 LoadMmeInstructionRamPointer 1 uint
0x46 LoadMmeInstructionRam 1 pipe Writes to and increments MmeInstructionRamPointer by 1.
0x47 LoadMmeStartAddressRamPointer 1 uint
0x48 LoadMmeStartAddressRam 1 pipe
0x49 SetMmeShadowRamControl 1 uint 0=MethodTrack, 1=MethodTrackWithFilter, 2=MethodPassthrough, 3=MethodReplay. Used during ClearColor.
0x4C SetGlobalRenderEnableA 1
0x4D SetGlobalRenderEnableB 1
0x4E SetGlobalRenderEnableC 1
0x4F SendGoIdle 1
0x50 PmTrigger 1
0x54 SetInstrumentationMethodHeader 1
0x55 SetInstrumentationMethodData 1
0x7B SetMmeSwitchState 1
0x80 SetDstFormat 1
0x81 SetDstMemoryLayout 1
0x82 SetDstBlockSize 1
0x83 SetDstDepth 1
0x84 SetDstLayer 1
0x85 SetDstPitch 1
0x86 SetDstWidth 1
0x87 SetDstHeight 1
0x88 SetDstOffsetUpper 1
0x89 SetDstOffsetLower 1
0x8A FlushAndInvalidateRopMiniCache 1
0x8B SetSpareNoop06 1
0x8C SetSrcFormat 1
0x8D SetSrcMemoryLayout 1
0x8E SetSrcBlockSize 1
0x8F SetSrcDepth 1
0x90 TwodInvalidateTextureDataCache 1
0x91 SetSrcPitch 1
0x92 SetSrcWidth 1
0x93 SetSrcHeight 1
0x94 SetSrcOffsetUpper 1
0x95 SetSrcOffsetLower 1
0x96 SetPixelsFromMemorySectorPromotion 1
0x97 SetSpareNoop12 1
0x98 SetNumProcessingClusters 1
0x99 SetRenderEnableA 1
0x9A SetRenderEnableB 1
0x9B SetRenderEnableC 1
0x9C SetSpareNoop08 1
0x9D SetSpareNoop01 1
0x9E SetSpareNoop11 1
0x9F SetSpareNoop07 1
0xA0 SetClipX0 1
0xA1 SetClipY0 1
0xA2 SetClipWidth 1
0xA3 SetClipHeight 1
0xA4 SetClipEnable 1
0xA5 SetColorKeyFormat 1
0xA6 SetColorKey 1
0xA7 SetColorKeyEnable 1
0xA8 SetRop 1
0xA9 SetBeta1 1
0xAA SetBeta4 1
0xAB SetOperation 1
0xAC SetPatternOffset 1
0xAD SetPatternSelect 1
0xAE SetDstColorRenderToZetaSurface 1
0xAF SetSpareNoop04 1
0xB0 SetSpareNoop15 1
0xB1 SetSpareNoop13 1
0xB2 SetSpareNoop03 1
0xB3 SetSpareNoop14 1
0xB4 SetSpareNoop02 1
0xB5 SetCompression 1
0xB6 SetSpareNoop09 1
0xB7 SetRenderEnableOverride 1
0xB8 SetPixelsFromMemoryDirection 1
0xB9 SetSpareNoop10 1
0xBA SetMonochromePatternColorFormat 1
0xBB SetMonochromePatternFormat 1
0xBC SetMonochromePatternColor0 1
0xBD SetMonochromePatternColor1 1
0xBE SetMonochromePattern0 1
0xBF SetMonochromePattern1 1
0xC0 ColorPatternX8R8G8B8 64
0x100 ColorPatternR5G6B5 32
0x120 ColorPatternX1R5G5B5 32
0x140 ColorPatternY8 16
0x150 SetRenderSolidPrimColor0 1
0x151 SetRenderSolidPrimColor1 1
0x152 SetRenderSolidPrimColor2 1
0x153 SetRenderSolidPrimColor3 1
0x154 SetMmeMemAddressA 1
0x155 SetMmeMemAddressB 1
0x156 SetMmeDataRamAddress 1
0x157 MmeDmaRead 1
0x158 MmeDmaReadFifoed 1
0x159 MmeDmaWrite 1
0x15A MmeDmaReduction 1
0x15B MmeDmaSysmembar 1
0x15C MmeDmaSync 1
0x15D SetMmeDataFifoConfig 1
0x160 RenderSolidPrimMode 1
0x161 SetRenderSolidPrimColorFormat 1
0x162 SetRenderSolidPrimColor 1
0x163 SetRenderSolidLineTieBreakBits 1
0x178 RenderSolidPrimPointXY 1
0x180 RenderSolidPrimPoint 128
0x200 SetPixelsFromCpuDataType 1
0x201 SetPixelsFromCpuColorFormat 1
0x202 SetPixelsFromCpuIndexFormat 1
0x203 SetPixelsFromCpuMonoFormat 1
0x204 SetPixelsFromCpuWrap 1
0x205 SetPixelsFromCpuColor0 1
0x206 SetPixelsFromCpuColor1 1
0x207 SetPixelsFromCpuMonoOpacity 1
0x20E SetPixelsFromCpuSrcWidth 1
0x20F SetPixelsFromCpuSrcHeight 1
0x210 SetPixelsFromCpuDxDuFrac 1
0x211 SetPixelsFromCpuDxDuInt 1
0x212 SetPixelsFromCpuDyDvFrac 1
0x213 SetPixelsFromCpuDyDvInt 1
0x214 SetPixelsFromCpuDstX0Frac 1
0x215 SetPixelsFromCpuDstX0Int 1
0x216 SetPixelsFromCpuDstY0Frac 1
0x217 SetPixelsFromCpuDstY0Int 1
0x218 PixelsFromCpuData 1
0x21C SetBigEndianControl 1
0x220 SetPixelsFromMemoryBlockShape 1
0x221 SetPixelsFromMemoryCorralSize 1
0x222 SetPixelsFromMemorySafeOverlap 1
0x223 SetPixelsFromMemorySampleMode 1
0x22C SetPixelsFromMemoryDstX0 1
0x22D SetPixelsFromMemoryDstY0 1
0x22E SetPixelsFromMemoryDstWidth 1
0x22F SetPixelsFromMemoryDstHeight 1
0x230 SetPixelsFromMemoryDuDxFrac 1
0x231 SetPixelsFromMemoryDuDxInt 1
0x232 SetPixelsFromMemoryDvDyFrac 1
0x233 SetPixelsFromMemoryDvDyInt 1
0x234 SetPixelsFromMemorySrcX0Frac 1
0x235 SetPixelsFromMemorySrcX0Int 1
0x236 SetPixelsFromMemorySrcY0Frac 1
0x237 PixelsFromMemorySrcY0Int 1
0x238 SetFalcon00 1
0x239 SetFalcon01 1
0x23A SetFalcon02 1
0x23B SetFalcon03 1
0x23C SetFalcon04 1
0x23D SetFalcon05 1
0x23E SetFalcon06 1
0x23F SetFalcon07 1
0x240 SetFalcon08 1
0x241 SetFalcon09 1
0x242 SetFalcon10 1
0x243 SetFalcon11 1
0x244 SetFalcon12 1
0x245 SetFalcon13 1
0x246 SetFalcon14 1
0x247 SetFalcon15 1
0x248 SetFalcon16 1
0x249 SetFalcon17 1
0x24A SetFalcon18 1
0x24B SetFalcon19 1
0x24C SetFalcon20 1
0x24D SetFalcon21 1
0x24E SetFalcon22 1
0x24F SetFalcon23 1
0x250 SetFalcon24 1
0x251 SetFalcon25 1
0x252 SetFalcon26 1
0x253 SetFalcon27 1
0x254 SetFalcon28 1
0x255 SetFalcon29 1
0x256 SetFalcon30 1
0x257 SetFalcon31 1
0x37B MmeDmaWriteMethodBarrier 1
0xD00 SetMmeShadowScratch uint[0x100] 256 Mostly general purpose, sometimes accessed by FirmwareCall.
0xE00+N*2 CallMmeMacroN 1 uint n=0..0x7f. Writing here calls a bound macro with value written put in r1.
0xE01+N*2 CallMmeDataN 1 pipe Used to send variable number of arguments to macro via the "parm" instruction.

MAXWELL_DMA_COPY_A

Register Name Size Type Notes
0x40 Nop 1
0x50 PmTrigger 1
0x90 SetSemaphoreA 1
0x91 SetSemaphoreB 1
0x92 SetSemaphorePayload 1
0x95 SetRenderEnableA 1
0x96 SetRenderEnableB 1
0x97 SetRenderEnableC 1
0x98 SetSrcPhysMode 1
0x99 SetDstPhysMode 1
0xC0 LaunchDma 1 bitfield With 0x186 Src/DstStride is not used. With 0x586 memset-functionality is used.
0x100 OffsetInUpper 1 gpuva
0x101 OffsetInLower 1 gpuva
0x102 OffsetOutUpper 1 gpuva
0x103 OffsetOutLower 1 gpuva
0x104 PitchIn 1 uint
0x105 PitchOut 1 uint
0x106 LineLengthIn 1 uint At most 0x3FFFFF.
0x107 LineCount 1 uint At most 0x3FFFFF.
0x1C0 SetRemapConstA 1 uint
0x1C1 SetRemapConstB 1 uint
0x1C2 SetRemapComponents 1 bitfield Seen: 0x34444
0x1C3 SetDstBlockSize 1 uint
0x1C4 SetDstWidth 1 uint In units of 4 bytes.
0x1C5 SetDstHeight 1 Seen: 1
0x1C6 SetDstDepth 1
0x1C7 SetDstLayer 1
0x1C8 SetDstOrigin 1
0x1CA SetSrcBlockSize 1
0x1CB SetSrcWidth 1
0x1CC SetSrcHeight 1
0x1CD SetSrcDepth 1
0x1CE SetSrcLayer 1
0x1CF SetSrcOrigin 1
0x445 PmTriggerEnd 1