GPU Classes: Difference between revisions
		
		
		
		Jump to navigation
		Jump to search
		
| Line 424: | Line 424: | ||
| | 0x786+8*N || BlendNAlphaFunctionDst || 1 || bitfield || Bit0-4: ?, bit14-15: ? | | 0x786+8*N || BlendNAlphaFunctionDst || 1 || bitfield || Bit0-4: ?, bit14-15: ? | ||
| |- style="border-top: double" | |- style="border-top: double" | ||
| | 0x820 || || 1 || ||  | | 0x820 || || 1 || || Select tessellation control program. | ||
| |- | |- | ||
| | 0x821 || || 1 || ||  | | 0x821 || || 1 || || Specify tessellation control program entry point. | ||
| |- style="border-top: double" | |- style="border-top: double" | ||
| | 0x830 || || 1 || ||  | | 0x830 || || 1 || || Select tessellation evaluation program. | ||
| |- style="border-top: double" | |- style="border-top: double" | ||
| | 0x840 || || 1 || ||  | | 0x840 || || 1 || || Select geometry program. | ||
| |- style="border-top: double" | |- style="border-top: double" | ||
| | 0x850 || || 1 || ||  | | 0x850 || || 1 || || Select fragment program. | ||
| |- style="border-top: double" | |- style="border-top: double" | ||
| | 0x8C0+N || FirmwareCallN || 8 || uint || Invokes GPU microcode firmware blob. | | 0x8C0+N || FirmwareCallN || 8 || uint || Invokes GPU microcode firmware blob. | ||
Revision as of 12:35, 25 May 2018
Subchannels:
| Id | Subchannel (nvn) | Name | 
|---|---|---|
| 0xB197 | 0 | 3D | 
| 0xB1C0 | 1 | Compute | 
| 0xA140 | 2 | Inline-to-Memory | 
| 0x902D | 3 | 2D | 
| 0xB0B5 | 4 | DMA | 
3D
| Register | Name | Size | Type | Notes | 
|---|---|---|---|---|
| 0 | SetObject | 1 | bitfield | Bit0-15: ClassId, bit16-20: EngineId | 
| 0x40 | NoOperation | 1 | ||
| 0x41 | SetNotify | 2 | gpuva | |
| 0x43 | NotifyType | 1 | 0=WriteOnly, 1=WriteThenAwaken | |
| 0x44 | WaitForIdle | 1 | ||
| 0x45 | MmeInstructionRamPointer | 1 | uint | |
| 0x46 | MmeInstructionRamLoad | 1 | pipe | Writes to and increments MmeInstructionRamPointer by 1. | 
| 0x47 | MmeStartAddressRamPointer | 1 | uint | |
| 0x48 | MmeStartAddressRamLoad | 1 | pipe | |
| 0x49 | MmeShadowRamControl | 1 | uint | 0=MethodTrack, 1=MethodTrackWithFilter, 2=MethodPassthrough, 3=MethodReplay. Used during ClearColor. | 
| 0x83 | LineWidthSeparate | 1 | bool | |
| 0x84 | ForceEarlyFragmentTests | 1 | bool | |
| 0x87 | Barrier | 1 | bitfield | |
| 0xBA | 1 | SetRenderTargets writes 1/0 here an optional buffer != NULL, and depending on type. | ||
| 0xBE | 1 | pipe | SetRenderTargets writes here repeatedly 16 times, if an optional buffer != NULL and is a given type. | |
| 0xBF | 1 | bool | SetRenderTargets writes 1 here after 0xBE has been written 16 times, and 0xDC been written. | |
| 0xC2 | CacheSplit | 1 | enum | |
| 0xC8 | TesselationMode | 1 | bitfield | Bit0-1: PrimitiveType (0=Isolines, 1=Triangles, 2=Quads), bit4-5: Spacing (0=Equal, 1=FractionalOdd, 2=FractionalEven), bit8-9: WindingAndConnectednessFlags (0=CcwAndNotConnected, 1=IsolineConnected, 1=TriangleCw, 2=TriangleConnected, 3=TriangleConnectedCw) | 
| 0xC9 | TesselationOuterLevels | 4 | float[4] | |
| 0xCD | TesselationInnerLevels | 2 | float[2] | |
| 0xDC | 1 | bool | SetRenderTargets writes 1 here, if an optional buffer != NULL and is a given type. | |
| 0xDF | RasterizerEnable | 1 | bool | |
| 0xE0+8*N | TransformFeedbackBufferEnable | 1 | bool | n=0..3 | 
| 0xE1+8*N | TransformFeedbackBufferAddr | 2 | gpuva | |
| 0xE3+8*N | TransformFeedbackBufferFlags | 1 | ||
| 0x1C0+4*N | TransformFeedbackBufferStream | 1 | bitfield | |
| 0x1C1+4*N | TransformFeedbackVaryingCount | 1 | uint | |
| 0x1C2+4*N | TransformFeedbackStride | 1 | uint | |
| 0x1D1 | TransformFeedbackEnable | 1 | bool | |
| 0x1D3 | 1 | GpuInit writes 0x3f here. | ||
| 0x1D5 | SampleShading | 1 | bitfield | Bit0-3: ?, bit4: Enable | 
| 0x1DF | LocalBase | 1 | uint | TODO | 
| 0x1F0 | ZCullWidth | 1 | SetRenderTargets optionally uses this. | |
| 0x1F1 | ZCullHeight | 1 | SetRenderTargets optionally uses this. | |
| 0x1F2 | 1 | SetRenderTargets optionally uses this. | ||
| 0x1F3 | 1 | SetRenderTargets optionally writes 0 here. | ||
| 0x1F8 | 1 | SetRenderTargets optionally uses this. | ||
| 0x1F9 | 1 | SetRenderTargets optionally uses this. | ||
| 0x1FA | ZCullContextStartAddr | 2 | gpuva | |
| 0x1FC | ZCullContextEndAddr | 2 | gpuva | |
| 0x200+16*N | RenderTargetNAddr | 2 | gpuva | |
| 0x202+16*N | RenderTargetNHorizontal | 1 | ||
| 0x203+16*N | RenderTargetNVertical | 1 | ||
| 0x204+16*N | RenderTargetNFormat | 1 | ||
| 0x205+16*N | RenderTargetNTileMode | 1 | bitfield | Bit0-3: Width, bit4-7: Height, bit8-10: Depth, bit12: Layout, bit16: ? | 
| 0x206+16*N | RenderTargetNArrayMode | 1 | bitfield | Bit0-15: Layers, bit16: Volume | 
| 0x207+16*N | RenderTargetNLayerStride | 1 | uint | In units of 4 bytes. | 
| 0x208+16*N | RenderTargetNBaseLayer | 1 | ||
| 0x280+8*N | ViewportNScaleX | 1 | float | n=0..15 | 
| 0x281+8*N | ViewportNScaleY | 1 | float | |
| 0x282+8*N | ViewportNScaleZ | 1 | float | |
| 0x283+8*N | ViewportNTranslateX | 1 | float | |
| 0x284+8*N | ViewportNTranslateY | 1 | float | |
| 0x285+8*N | ViewportNTranslateZ | 1 | float | |
| 0x286+8*N | ViewportNSwizzles | 1 | bitfield | |
| 0x287+8*N | ViewportNSubpixelPrecisionBias | 1 | bitfield | Bit0-4: BiasX, bit8-bit11: BiasY | 
| 0x300+4*N | ViewportNHorizontal | 1 | bitfield | Bit0-15: X, bit16-31: Width | 
| 0x301+4*N | ViewportNVertical | 1 | bitfield | Bit0-15: Y, bit16-31: Height | 
| 0x302+4*N | ViewportNDepthRangeNear | 1 | float | |
| 0x303+4*N | ViewportNDepthRangeFar | 1 | float | |
| 0x340+2*N | ClipRectNHorizontal | 1 | n=0..7, TODO | |
| 0x341+2*N | ClipRectNVertical | 1 | TODO | |
| 0x359 | CallLimit | 1 | uint | 0=FullLimit, 1...8=?, 15=Unlimited | 
| 0x35A | CounterEnable | 1 | bitfield | Bit0: ?, bit1: ?, ... bit15: ? | 
| 0x35D | VertexBufferFirst | 1 | ||
| 0x35E | VertexBufferCount | 1 | ||
| 0x35F | DepthClipAllowNegativeZ | 1 | bool | |
| 0x360 | ClearColor | 4 | float | |
| 0x364 | ClearDepth | 1 | float | |
| 0x368 | ClearStencil | 1 | ||
| 0x36B | PolygonModeFront | 1 | bitfield | 0x1B00/0x1B01/0x1B02 | 
| 0x36C | PolygonModeBack | 1 | bitfield | 0x1B00/0x1B01/0x1B02 | 
| 0x36D | PolygonSmoothEnable | 1 | bool | |
| 0x36F | ? | 1 | bitfield | Bit0:?, bit16:?. Used by ClearDepthStencil. | 
| 0x370 | PolygonOffsetPointEnable | 1 | bool | |
| 0x371 | PolygonOffsetLineEnable | 1 | bool | |
| 0x372 | PolygonOffsetFillEnable | 1 | bool | |
| 0x373 | PatchSize | 1 | Small value, always fits in 12 bits. In number of vertices. | |
| 0x374 | 1 | 0 written here for "simple" BlendState. | ||
| 0x379 | WatchdogTimer | 1 | ||
| 0x37A | PrimitiveRestartWithDrayArrays | 1 | boolean | |
| 0x37E | WindowOffsetX | 1 | int | |
| 0x37F | WindowOffsetY | 1 | int | |
| 0x380+4*N | ScissorNEnable | 1 | bool | n=0..15. GpuInit writes 1 here. | 
| 0x381+4*N | ScissorNHorizontal | 1 | bitfield | Bit0-15: min, bit16-31: max | 
| 0x382+4*N | ScissorNVertical | 1 | bitfield | Bit0-15: min, bit16-31: max | 
| 0x3D5 | StencilBackRefValue | 1 | ||
| 0x3D6 | StencilBackEnable | 1 | bool | |
| 0x3D7 | StencilBackValueMask | 1 | ||
| 0x3D8 | TiledCacheAction0 | 1 | bool | Trigger? | 
| 0x3D9 | TiledCacheTileSize | 1 | bitfield | Bit0-15: ?, bit16-31: ? | 
| 0x3DE | DiscardTrigger | 1 | bitfield | Bit4-6: DiscardColorIndex, bit0: DiscardDepthStencil | 
| 0x3E0 | TiledCacheAction1 | 1 | bool | Trigger? | 
| 0x3E7 | DepthBounds | 2 | float[2] | |
| 0x3ED | MultisampleRasterEnable | 1 | bool | Also written 0 when clearing all colors. | 
| 0x3EE | MultisampleRasterSamples | 1 | bitfield | 2=Four, 4=Eight, 5=?, 6=Sixteen | 
| 0x3EF | MultisampleCoverageModulationMode | 1 | ||
| 0x3F0 | SampleMask0 | 1 | ||
| 0x3F1 | SampleMask1 | 1 | ||
| 0x3F2 | SampleMask2 | 1 | ||
| 0x3F5 | Multisample_Related2 | 1 | TODO | |
| 0x3F6 | CoverageModulationEnable | 1 | bool | |
| 0x3F8 | Unknown?Addr | 2 | gpuva | SetRenderTargets writes the address of an optional buffer here. | 
| 0x3FA | Unknown?TileMode | 1 | SetRenderTargets writes here optionally. | |
| 0x3FB | Unknown?ArrayMode | 1 | SetRenderTargets writes here optionally. | |
| 0x3FC | Unknown?BaseLayer | 1 | SetRenderTargets writes here optionally. | |
| 0x3FD | 1 | bitfield?[2] | SetRenderTargets writes here | |
| 0x40C | CoverageModulationTable | 4 | float[4] | |
| 0x433 | 1 | GpuInit writes 4 here. | ||
| 0x438 | 1 | GpuInit writes 0xFF here. | ||
| 0x439 | 1 | GpuInit writes 0xFF here. | ||
| 0x43B | 1 | GpuInit writes 0xFF here. | ||
| 0x43C | 1 | GpuInit writes 4 here. | ||
| 0x43D | Unbind (???) | 1 | bitfield | Bit0: InvalidateTextureHeaders, bit4: InvalidateTextureSamplers, bit8: InvalidateConstBuffers | 
| 0x43E | ClearFlags? | 1 | GpuInit writes 0x101 here. | |
| 0x446 | DrawElementsEnableBaseVertex? | 1 | bool | TODO | 
| 0x44D | TiledCacheAction2 | 1 | bool | Trigger? | 
| 0x452 | RasterEnable | 1 | bool | |
| 0x458 | VertexStreamNAttrib | 4 | bitfield[16] | Bit0-4: StreamIndex, bit7-20: Format0, bit21-30: Format1 | 
| 0x478 | MultisampleGrid | 3 | bitfield[3] | Bit0-3: x0, bit4-7: y0, bit8-11: x1, etc.. | 
| 0x47C | 1 | trigger | BindProgram writes 0 here sometimes. | |
| 0x47E | MultisampleCoverageToColor | 1 | bitfield | Bit0: Enable, bit4-6: ? | 
| 0x47F | DepthBufferResolve | 1 | trigger | 1 is written here to trigger. | 
| 0x487 | RenderTargetControl | 1 | bitfield | Bit0-3: NumberOfRenderTargets, bunch of other flags. Used by SetRenderTargets. | 
| 0x48A | 1 | Optionally used by SetRenderTargets. | ||
| 0x48B | 1 | Optionally used by SetRenderTargets. | ||
| 0x48C | 1 | Optionally used by SetRenderTargets. | ||
| 0x48D | SamplerBinding (???) | 1 | enum | 0=Independently, 1=ViaHeaderBinding | 
| 0x4A2 | InvalidateTextureDataNoWfi (???) | 1 | bitfield | bit0: 0=AllLines, 1=OneLine, bit4-25: Tag | 
| 0x4AB | ShaderScheduling (???) | 1 | enum | 0=OldestThreadFirst, 1=RoundRobin | 
| 0x4B3 | DepthTestEnable | 1 | bool | Enables DepthWriteEnable and DepthFunc. | 
| 0x4B8 | MultisampleAlphaToCoverageDither | 1 | bool | |
| 0x4B9 | BlendIndependent | 1 | bool | 1 written here for "simple" BlendState. | 
| 0x4BA | DepthWriteEnable | 1 | bool | |
| 0x4BB | AlphaTestEnable | 1 | bool | |
| 0x4C3 | DepthFunc | 1 | bitfield | Bit0-3: DepthFunc | 
| 0x4C4 | AlphaTestRefValue | 1 | float | |
| 0x4C5 | AlphaTestFunc | 1 | bitfield | Bit0-3: AlphaTestFunc | 
| 0x4C7 | BlendColor | 4 | float[4] | TODO | 
| 0x4CC | InvalidateSamplerCache (???) | 1 | bitfield | bit0: 0=AllLines, 1=OneLine, bit4-25: Tag | 
| 0x4CD | InvalidateTextureHeaderCache (???) | 1 | bitfield | bit0: 0=AllLines, 1=OneLine, bit4-25: Tag | 
| 0x4CE | InvalidateTextureDataCache (???) | 1 | bitfield | bit0: 0=AllLines, 1=OneLine, bit4-25: Tag | 
| 0x4E0 | StencilEnable | 1 | bool | |
| 0x4E5 | StencilFrontRefValue | 1 | ||
| 0x4E6 | StencilFrontMaskValue | 1 | ||
| 0x4E7 | StencilFrontEnable | 1 | bool | |
| 0x4EC | LineWidthSmooth | 1 | float | |
| 0x4ED | LineWidthAliased | 1 | float | |
| 0x50D | VertexBufferElementBase | 1 | uint | |
| 0x519 | ZCullContextSave | 1 | trigger | 0 is written here to trigger ctx-save, uses ZCullContextStartAddr. | 
| 0x51F | PolygonOffsetClamp0 | 1 | float | |
| 0x540 | ZCullContextRestore | 1 | trigger | 0 is written here to trigger ctx-restore, uses ZCullContextStartAddr. | 
| 0x546 | PointSize | 1 | float | |
| 0x54C | CounterReset | 1 | Value written decides which counter to reset. | |
| 0x54D | MultisampleEnable | 1 | ||
| 0x54E | 1 | bool | SetRenderTargets writes 0 here if an optional buffer is NULL. | |
| 0x54F | MultisampleControl | 1 | bool | bit0: AlphaToCoverageEnable, bit1+: ? | 
| 0x554 | RenderEnableOffset | 2 | gpuva | |
| 0x556 | RenderEnableMode | 1 | enum | 0=False, 1=True, 2=Conditional, 3=RenderIfEqual, 4=RenderIfNotEqual, | 
| 0x557 | TexSamplerPoolOffset (???) | 2 | gpuva | |
| 0x558 | TexSamplerPoolMaximumIndex (???) | 1 | bitfield | Bit0-19: Maximum | 
| 0x55B | PolygonOffsetFactor | 1 | float | |
| 0x55D | TexHeaderPoolOffset (???) | 2 | gpuva | |
| 0x55F | TexHeaderPoolMaximumIndex (???) | 1 | bitfield | Bit0-21: Maximum | 
| 0x56D | CsaaEnable | 1 | bool | |
| 0x56F | PolygonOffsetOffset | 1 | float | Float multiplied by 2 is written here. | 
| 0x574 | MultisampleMode | 1 | Written by SetRenderTargets, possible values: 0, 2, 4, 5, 6. | |
| 0x57F | 1 | SetRenderTargets optionally writes 0 here. | ||
| 0x580 | 1 | SetRenderTargets optionally writes 0 here. | ||
| 0x582 | ProgramRegion (???) | 2 | gpuva | |
| 0x591 | PrimitiveRestartEnable | 1 | bool | |
| 0x592 | PrimitiveRestartIndex | 1 | uint | |
| 0x599 | CubeMapInterFaceFiltering (???) | 1 | bitfield | Bit0-1: Mode (0=UseWrap, 1=OverrideWrap, 2=AutoSpanSeam, 3=AutoCrossSeam) | 
| 0x5A4 | ShaderControl (???) | 1 | bitfield | Bit0: Partial (0=Zero, 1=Infinity), bit1: Fp32NanBehavior (0=Legacy, 1=Fp64Compatible), bit2: Fp32F21NanBehavior (0=PassZero, 1=PassIndefinite), bit16: ZeroTimesAnythingIsZero | 
| 0x5A5 | BindConstantBuffer (???) | 1 | bitfield | Bit0: Valid, bit8-12: ShaderSlot | 
| 0x5A6 | InvalidateShaderCachesNoWfi (???) | 1 | bitfield | Bit0: Instruction, bit4: GlobalData, bit8: Uniform, bit12: Constant | 
| 0x5F2 | DrawElementsIndirectAddr | 2 | gpuva | TODO | 
| 0x5F6 | DrawElementsIndirectFlag? | 1 | bool | TODO | 
| 0x620+N | VertexStreamNEnableDivisor | 1 | bool | n=0...15. | 
| 0x646 | PolygonCullFaceEnable | 1 | bool | |
| 0x647 | PolygonFrontFace | 1 | bitfield | Bit0: Enable. Always ORR'd with 0x9000. | 
| 0x648 | PolygonCullFaceConfig | 1 | bitfield | TODO: 0x404/0x405/0x408 | 
| 0x651 | RenderEnableOverride | 1 | bitfield | Bit0-1: 0=UseRenderEnable, 1=AlwaysRender, 2=NeverRender | 
| 0x64C | InvalidateConstantBufferCache (???) | 1 | bitfield | Bit0: ThruL2 | 
| 0x64F | DepthClamp | 1 | TODO: 0x101A is written when enabled, 0x181D when disabled. | |
| 0x66F | DepthBoundsEnable | 1 | bool | |
| 0x671 | ColorLogicOpEnable | 1 | bool | Used for all LogicOps except 3. | 
| 0x672 | ColorLogicOpType | 1 | bitfield | Bit0-7: LogicOp, bit8-15: unknown, always 0x15. | 
| 0x689 | SetSpare (???) | 1 | uint[4] | |
| 0x68B | PipeNop | 1 | trigger | Always 0 is written here. During zcull ctx-save, spammed when enabling raster, ... | 
| 0x6C0 | ReportSemaphoreOffset | 2 | gpuva | |
| 0x6C2 | ReportSemaphorePayload | 1 | 0 is written here during most queries. | |
| 0x6C3 | ReportSemaphoreControl | 1 | bitfield | Bit0-1: Operation (0=Release, 3=Trap), bit2: FlushDisable, bit20: AwakenEnable, bit28: StructureSize (0=FourWords, 1=OneWord) | 
| 0x700+4*N | VertexStreamNStride | 1 | uint | Bit0-11: Stride. TODO: This has more stuff according to nouveau. n=0..31 | 
| 0x701+4*N | VertexBufferStartAddr | 2 | gpuva | TODO: Incorrecto | 
| 0x703+4*N | VertexStreamNDivisor | |||
| 0x7C0+2*N | VertexBufferEndAddr | 2 | gpuva | |
| 0x781+8*N | BlendNRgbEquation | 1 | bitfield | Bit0-2: BlendEquation. | 
| 0x782+8*N | BlendNRgbFunctionSrc | 1 | bitfield | Bit0-4: ?, bit14-15: ? | 
| 0x783+8*N | BlendNRgbFunctionDst | 1 | bitfield | Bit0-4: ?, bit14-15: ? | 
| 0x784+8*N | BlendNAlphaEquation | 1 | bitfield | Bit0-2: BlendEquation | 
| 0x785+8*N | BlendNAlphaFunctionSrc | 1 | bitfield | Bit0-4: ?, bit14-15: ? | 
| 0x786+8*N | BlendNAlphaFunctionDst | 1 | bitfield | Bit0-4: ?, bit14-15: ? | 
| 0x820 | 1 | Select tessellation control program. | ||
| 0x821 | 1 | Specify tessellation control program entry point. | ||
| 0x830 | 1 | Select tessellation evaluation program. | ||
| 0x840 | 1 | Select geometry program. | ||
| 0x850 | 1 | Select fragment program. | ||
| 0x8C0+N | FirmwareCallN | 8 | uint | Invokes GPU microcode firmware blob. | 
| 0x8E0 | ConstantBufferSelectorSize | 1 | uint | Bit0-16: Size | 
| 0x8E1 | ConstantBufferSelectorAddr | 2 | gpuva | |
| 0x8E3 | LoadConstantBufferOffset | 1 | uint | Bit0-15: Offset. BindImage writes "8*i + 0x120" here. BindSeparateSampler writes "8*i + 0x568" here. BindSeparateTexture uses "8*i + 0x168". BindTexture uses 8*i + 32. TODO: BindStorageBuffer, UpdateUniformBuffer, etc | 
| 0x8E4 | LoadConstantBuffer | 1 | uint[16] | |
| 0x904+8*N | ConstBufferNBind | 1 | bitfield | n=0..4. Bit0: Valid, bit4-8: Index | 
| 0xD00 | MmeShadowScratch | uint[0x80] | 128 | Mostly general purpose, sometimes accessed by FirmwareCall. | 
| 0xE00+N*2 | MmeMacroNCall | 1 | uint | n=0..0x7f. Writing here calls a bound macro with value written put in r1. | 
| 0xE01+N*2 | MmeMacroNData | 1 | pipe | Used to send variable number of arguments to macro via the "parm" instruction. | 
TODO: (Tiled)Downsample, Copy*, Dispatch*, DrawTexture/DrawTransformFeedback, BindImages, BindProgram, BindSeprarateSamplers, BindSeprarateTextures, BindTextures.
DMA
| Register | Name | Size | Type | Notes | 
|---|---|---|---|---|
| 0x0C0 | Control | 1 | bitfield | With 0x186 Src/DstStride is not used. With 0x586 memset-functionality is used. | 
| 0x100 | SourceAddr | 2 | gpuva | |
| 0x102 | DestinationAddr | 2 | gpuva | |
| 0x104 | SourceStride? | 1 | uint | |
| 0x105 | DestinationStride? | 1 | uint | |
| 0x106 | Count | 1 | uint | At most 0x3FFFFF. | 
| 0x1C0 | MemsetValue? | 1 | uint | |
| 0x1C2 | MemsetControl? | 1 | bitfield | Seen: 0x34444 | 
| 0x1C4 | MemsetLength? | 1 | uint | In units of 4 bytes. | 
| 0x1C5 | ? | 1 | Seen: 1 |