aboutsummaryrefslogtreecommitdiff
path: root/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
AgeCommit message (Expand)AuthorFilesLines
2024-07-10drm/amdgpu: timely save bad pages to eeprom after gpu ras reset is completedYiPeng Chai1-1/+5
2024-07-10drm/amdgpu: flush all cached ras bad pages to eepromYiPeng Chai1-6/+29
2024-07-08drm/amdgpu: add ras event state device attribute supportYang Wang1-4/+52
2024-07-08drm/amdgpu: add ras POSION_CONSUMPTION event id supportYang Wang1-3/+13
2024-07-08drm/amdgpu: add ras POSION_CREATION event id supportYang Wang1-3/+14
2024-07-08drm/amdgpu: refine amdgpu ras event id core codeYang Wang1-18/+84
2024-07-08drm/amdgpu: sysfs node disable query error count during gpu resetYiPeng Chai1-0/+3
2024-07-01drm/amdgpu: Fix hbm stack id in boot error reportHawking Zhang1-1/+1
2024-06-27drm/amdgpu: add gpu reset check and exception handlingYiPeng Chai1-0/+53
2024-06-27drm/amdgpu: refine poison consumption interrupt handlerYiPeng Chai1-18/+37
2024-06-27drm/amdgpu: refine poison creation interrupt handlerYiPeng Chai1-22/+17
2024-06-27drm/amdgpu: add variable to record the deferred error number read by driverYiPeng Chai1-18/+44
2024-06-14drm/amdgpu: set RAS fed status for more casesTao Zhou1-0/+1
2024-06-14drm/amdgpu: create amdgpu_ras_in_recovery to simplify codeTao Zhou1-12/+19
2024-06-14drm/amdgpu: trigger mode1 reset for RAS RMA statusTao Zhou1-6/+22
2024-06-14drm/amdgpu: move aca/mca init functions into ras_init() stageYang Wang1-23/+50
2024-06-14drm/amdgpu: add reset source in various casesEric Huang1-0/+1
2024-06-05drm/amdgpu: add RAS is_rma flagTao Zhou1-5/+4
2024-06-05drm/amdgpu: Update programming for boot error reportingHawking Zhang1-54/+45
2024-06-05drm/amdgpu: Estimate RAS reservation when report capacity v2Hawking Zhang1-0/+20
2024-05-29drm/amdgpu: fix typo in amdgpu_ras_aca_sysfs_read() functionYang Wang1-1/+1
2024-05-23drm/amdgpu: skip to create ras xxx_err_count node when ACA is enabledYang Wang1-0/+6
2024-05-17drm/amdgpu: fix ACA no query result after gpu resetYang Wang1-5/+4
2024-05-17drm/amdgpu: fix compiler 'side-effect' check issue for RAS_EVENT_LOG()Yang Wang1-0/+18
2024-05-17drm/amdgpu: Fix the null pointer dereference to ras_managerMa Jun1-2/+5
2024-05-17drm/amdgpu: Remove dead code in amdgpu_ras_add_mca_err_addrMa Jun1-13/+0
2024-05-08drm/amdgpu: change log levelYiPeng Chai1-1/+1
2024-05-08drm/amdgpu: fix RAS unload driver issue in SRIOVYang Wang1-6/+8
2024-05-02drm/amdgpu: Add psp v13_0_14 ip blockHawking Zhang1-0/+2
2024-04-30drm/amdgpu: Remove redundant function callYiPeng Chai1-16/+6
2024-04-30drm/amdgpu: add MCA smu cache supportYang Wang1-0/+9
2024-04-26drm/amdgpu: Fix ras mode2 reset failure in ras aca modeYiPeng Chai1-0/+4
2024-04-26drm/amdgpu: Use new interface to reserve bad pageYiPeng Chai1-3/+1
2024-04-26drm/amdgpu: Fix address translation defectYiPeng Chai1-1/+1
2024-04-26drm/amdgpu: add poison consumption handlerYiPeng Chai1-4/+39
2024-04-26drm/amdgpu: Add delay work to retire bad pagesYiPeng Chai1-1/+35
2024-04-26drm/amdgpu: add interface to update umc v12_0 ecc statusYiPeng Chai1-0/+2
2024-04-26drm/amdgpu: add poison creation handlerYiPeng Chai1-7/+69
2024-04-26drm/amdgpu: prepare for logging ecc errorsYiPeng Chai1-0/+32
2024-04-26drm/amdgpu: add message fifo to handle RAS poison eventsYiPeng Chai1-0/+35
2024-04-26drm/amdgpu: Add interface to reserve bad pageYiPeng Chai1-0/+19
2024-04-09drm/amdgpu: Set fatal errror detected flag earlierLijo Lazar1-13/+28
2024-03-22drm/amdgpu: add ras event id support for ACAYang Wang1-5/+6
2024-03-20drm/amdgpu: add aca deferred error type supportYang Wang1-2/+6
2024-03-20drm/amdgpu: make reset method configurable for RAS poisonTao Zhou1-2/+2
2024-03-20drm/amdgpu: add ras event id supportYang Wang1-71/+136
2024-02-26drm/amdgpu: Fix ineffective ras_mask settingsStanley.Yang1-0/+1
2024-02-26drm/amdgpu: Add fatal error detected flagLijo Lazar1-0/+32
2024-01-31drm/amdgpu: disable RAS feature when finiTao Zhou1-1/+1
2024-01-31drm/amdgpu: Update boot time errors polling sequenceHawking Zhang1-1/+13