33 Commits

Author SHA1 Message Date
wjiang
fae7c0263f
Add some improvements and add new modules to HiSilicon common section 2025-04-30 10:56:55 +08:00
znzjugod
f0b4938899 fix ras-mc-ctl --summary failed 2025-04-19 11:15:50 +08:00
luckky
ab151e3a74 set to default when param is overflow 2024-12-16 17:31:13 +08:00
Bing Xia
c6a2b9b56f rasdaemon: ras-mc-ctl: Modify check for HiSilicon KunPeng9xx error fields
Modify check for valid HiSilicon KunPeng9xx error fields.
Fixes an error data is not printed when it's value is 0.
2024-04-25 14:52:52 +08:00
Shengwei Luo
be4fee4058 Fix cpu isolate errors when some cpus are offline before the service started 2024-04-23 17:53:17 +08:00
zhuofeng
c200e16c47 fix ras-mc-ctl.service startup failed when selinux is on 2024-03-27 15:00:20 +08:00
zhangruifang2020
49d1da09ac backport patches from upstream 2024-03-25 14:44:27 +08:00
zhuofeng
e8e59ad901 update version to 0.8.0 2024-02-04 20:50:44 +08:00
Lv Ying
ef66da8259 backport bugfix patches from community
Signed-off-by: Lv Ying <lvying6@huawei.com>
2023-12-31 22:55:05 +08:00
renxichen
49c470f4c0 bugfix on rasdaemon.service 2023-12-04 09:30:55 +08:00
markeryang
c0c9e9e521 backport libtraceevent patch to adapt to kernel ftrace ring buffer change 2023-06-17 16:06:56 +08:00
Shiju Jose
e0f5dee5f1 rasdaemon: Add fix patches for rasdaemon and Add support for creating the vendor error tables at startup
Add the following fix patches and changes,
1. Fix return value type issue of read/write function from unistd.h.
2. Fix issue of signed and unsigned integer comparison.
3. Remove redundant header file and do some clean-up.
4. Add support for create/open the vendor error tables at rasdaemon startup.
5. Make changes in the HiSilicon error handling code for the same.
6. Add four modules supported by HiSilicon common section.

Signed-off-by: Shiju Jose <shiju.jose@huawei.com>
2023-06-02 12:58:41 +01:00
huangfangrun
09eae3860b rasdaemon: Fix for regression in ras_mc_create_table() if some cpus are offline at the system start and Fix poll() on per_cpu trace_pipe_raw blocks indefinitely 2023-04-04 10:24:15 +08:00
renxichen
8f14bff537 backport patches from upstream 2023-03-24 09:47:09 +08:00
Lv Ying
6b355f4933 rasdaemon: upgrade to 0.6.8 2023-01-19 11:39:20 +08:00
Xiaofei Tan
e5920d76da rasdaemon: Fix startup core dumped issue.
Add the following patch to fix startup core dumped issue.
0001-rasdaemon-use-standard-length-PATH_MAX-for-path-name.patch

Signed-off-by: Xiaofei Tan <tanxiaofei@huawei.com>
(cherry picked from commit c5e8cb9176935b30077e38e50a2b7e280903460e)
2022-08-29 17:32:46 +08:00
Shiju Jose
f9eb9d8c57 rasdaemon: Update with the latest patches for the CPU fault isolation, Hisilicon Kunpeng9xx common error records and improvements in the ras-mc-ctl for the Hisilicon Kunpeng9xx errors
Update with the latest patches for the
1. CPU online fault isolation for arm event.
2. Modify recording Hisilicon common error data in the rasdaemon
3. In the ras-mc-ctl,
3.1. Improve Hisilicon common error statistics.
3.2. Add support to display the HiSilicon vendor-errors for a specified module.
3.3. Add printing usage if necessary parameters are not passed for the HiSilicon vendor-errors options.
3.4. Reformat error info of the HiSilicon Kunpeng920.
3.5. Relocate reading and display Kunpeng920 errors to under Kunpeng9xx.
3.6. Updated the HiSilicon platform name as KunPeng9xx.
4. Fixed a memory out-of-bounds issue in the rasdaemon.

Signed-off-by: Shiju Jose <shiju.jose@huawei.com>
2022-05-30 09:30:03 +01:00
Lostwayzxc
d594e333f4 sync master aand 22.03 2022-03-28 16:19:44 +08:00
xujing
513c13e9c3 Update software to v0.6.7 2022-01-17 14:33:01 +08:00
Xiaofei Tan
cdfd2385d7 rasdaemon: enable compilation of memory fault prediction feature
Enable compilation of the feature memory fault prediction based on
corrected error.

Signed-off-by: Xiaofei Tan <tanxiaofei@huawei.com>
2021-12-09 15:46:26 +08:00
Xiaofei Tan
463e8e4d56 Backport the feature of memory failure events support
Only one patch:
rasdaemon: add support for memory_failure events

Signed-off-by: Xiaofei Tan <tanxiaofei@huawei.com>
2021-12-03 11:46:16 +08:00
Lostwayzxc
130f7d6303 sync three patches from sp2 2021-10-27 11:51:26 +08:00
Xiaofei Tan
f8001e8ca5 Some little fixes and add some modules support for kunpeng series
Backport one patch, and some little fixes and add some modules support
for kunpeng series, including following 5 patches:
rasdaemon: Modify non-standard error decoding interface using linked list
rasdaemon: Fix the issue of sprintf data type mismatch in uuid_le()
rasdaemon: Fix the issue of command option -r for hip08
rasdaemon: Fix some print format issues for hisi common error section
rasdaemon: Add some modules supported by hisi common error section

Signed-off-by: Xiaofei Tan <tanxiaofei@huawei.com>
2021-10-21 11:26:19 +08:00
Xiaofei Tan
defae9844b rasdaemon: backport hisilicon common section patches from community 2021-07-30 14:29:14 +08:00
xujing
959a34bc6f rasdaemon: fix disk error log storm 2021-05-15 18:24:31 +08:00
lvying
0b9d06a79e rasdaemon: backport bugfix patch from community
Fix error print handle_ras_events:
00115dda85

Signer-off-by: lvying <lvying6@huawei.com>
2021-04-28 12:06:47 +08:00
Lv Ying
d708bdf82b rasdaemon: backport bugfix patches from community
1. ras-page-isolation: do_page_offline always considers page offline was successful
e4d27840e1
2. ras-page-isolation: page which is PAGE_OFFLINE_FAILED can be offlined again
c329012ce4
2021-03-31 11:28:01 -07:00
lvying
a78599da8d rasdaemon:update Source0
Signed-off-by: lvying <lvying6@huawei.com>
2020-09-25 01:15:25 -07:00
chengquan
600f22fc95 update software to v0.6.6 2020-07-24 16:47:49 +08:00
chengquan
7e5fdbf6c8 fix file descriptor leak in ras-report.c:setup_report_socket() 2020-03-18 15:44:27 +08:00
chengquan
7970b44d84 fix file descriptor leak in ras-report.c:setup_report_socket() 2020-03-11 16:38:47 +08:00
zhuchunyi
ae37fc0910 update code 2019-11-06 19:51:08 +08:00
overweight
4f1fc1e5a7 Package init 2019-09-30 11:16:11 -04:00