查看问题详情

编号项目分类查看权限最后更新
0000142Anolis OS 8kernelpublic2021-12-17 16:30
报告员qingming2021 分派给Shiloong  
优先级low严重性block出现频率always
状态 assigned处理状况open 
平台x86_64操作系统Anolis OS操作系统版本8
产品版本8.2 正式版 
标题0000142: [AnolisOS-8.2] BUG: unable to handle kernel paging request at ffffb14c0839cff6 - tpacket_rcv+0x741/0xa50
描述将 AnolisOS-8.2-GA-x86_64-RHCK.qcow2 镜像运行在阿里云上,跑LTP测试时触发kernel panic:

[13337.364401] LTP: starting sendmsg03
[13337.368502] raw_sendmsg: sendmsg03 forgot to set AF_INET. Fix it!
[13338.252430] LTP: starting sendmmsg01
[13338.255910] LTP: starting sendto01
[13338.259092] LTP: starting sendto02
[13338.263948] LTP: starting sendto03
[13338.303220] BUG: unable to handle kernel paging request at ffffb14c0839cff6
[13338.303857] PGD 107c7d067 P4D 107c7d067 PUD 107c7e067 PMD fbbbe5067 PTE 0
[13338.304456] Oops: 0002 [#1] SMP PTI
[13338.304815] CPU: 13 PID: 795895 Comm: sendto03 Kdump: loaded Not tainted 4.18.0-193.el8.x86_64 #1
[13338.305568] Hardware name: Alibaba Cloud Alibaba Cloud ECS, BIOS e623647 04/01/2014
[13338.306239] RIP: 0010:tpacket_rcv+0x741/0xa50
[13338.306684] Code: 48 03 2d 3a 2f 1b 4f 48 83 45 00 01 e8 78 c8 7c ff 48 2b 44 24 08 8b 54 24 10 48 01 45 08 e9 b1 f9 ff ff 31 c9 49 8d 44 13 f6 <48> c7 00 00 00 00 00 66 89 48 08 41 8b 8f dc 00 00 00 49 03 8f e0
[13338.308308] RSP: 0018:ffffb14c137a7bf0 EFLAGS: 00010246
[13338.308845] RAX: ffffb14c0839cff6 RBX: ffff9fab0cfb0800 RCX: 0000000000000000
[13338.309526] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff9fab0cfb0800
[13338.310205] RBP: 0000000000000001 R08: ffff9f9e87b2ec78 R09: 0000000000000000
[13338.310892] R10: ffff9fab0cfb0002 R11: ffffb14c0839d000 R12: ffff9f9c87c56000
[13338.311585] R13: ffff9f9c87c56000 R14: 0000000000000000 R15: ffff9fab3943c000
[13338.312284] FS: 00007fec6b626580(0000) GS:ffff9fab40b40000(0000) knlGS:0000000000000000
[13338.313051] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[13338.313664] CR2: ffffb14c0839cff6 CR3: 0000000f7f556001 CR4: 00000000003606e0
[13338.314382] DR0: 0000000000000001 DR1: 0000000000000000 DR2: 0000000000000000
[13338.315101] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000600
[13338.315824] Call Trace:
[13338.316231] ? __skb_clone+0x29/0x130
[13338.316717] ? dev_queue_xmit_nit+0xd9/0x260
[13338.317243] dev_hard_start_xmit+0x68/0x1e0
[13338.317771] __dev_queue_xmit+0x7e3/0x9f0
[13338.318289] ? packet_parse_headers.isra.61+0xd7/0x110
[13338.318885] packet_sendmsg+0x5fe/0x920
[13338.319396] sock_sendmsg+0x4c/0x50
[13338.319884] __sys_sendto+0xee/0x160
[13338.320386] ? syscall_trace_enter+0x1d3/0x2c0
[13338.320947] ? __audit_syscall_exit+0x249/0x2a0
[13338.321520] __x64_sys_sendto+0x24/0x30
[13338.322043] do_syscall_64+0x5b/0x1a0
[13338.322558] entry_SYSCALL_64_after_hwframe+0x65/0xca
[13338.323171] RIP: 0033:0x7fec6b1403eb
[13338.323684] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 75 42 2c 00 41 89 ca 8b 00 85 c0 75 14 b8 2c 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 75 c3 0f 1f 40 00 41 57 4d 89 c7 41 56 41 89
[13338.325555] RSP: 002b:00007ffdadd00e98 EFLAGS: 00000246 ORIG_RAX: 000000000000002c
[13338.326384] RAX: ffffffffffffffda RBX: 0000000000000007 RCX: 00007fec6b1403eb
[13338.327188] RDX: 0000000000000400 RSI: 00000000004238c0 RDI: 0000000000000007
[13338.327994] RBP: 00000000004238c0 R08: 0000000000423880 R09: 0000000000000014
[13338.328803] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000416004
[13338.329611] R13: 00000000000000af R14: 0000000000000400 R15: 0000000000000014
[13338.330424] Modules linked in: tun brd overlay vfat fat fuse xfs libcrc32c loop intel_rapl_msr intel_rapl_common nfit libnvdimm intel_powerclamp cirrus drm_kms_helper syscopyarea sysfillrect sysimgblt crct10dif_pclmul crc32_pclmul fb_sys_fops drm ghash_clmulni_intel virtio_net virtio_balloon i2c_piix4 net_failover failover pcspkr joydev intel_rapl_perf sunrpc ip_tables ext4 mbcache jbd2 ata_generic ata_piix libata crc32c_intel serio_raw virtio_console virtio_blk
[13338.334588] Features: eBPF/sock
[13338.335159] CR2: ffffb14c0839cff6
标签没加标签.

活动

jacobwang

2021-06-06 14:54

经理   ~0000241

最后编辑: 2021-06-06 15:09

相关背景信息:
当前LTP测试有两个版本,一个是针对4.19,一个针对5.10专门适配
备注, 可复现crash的LTS版本是针对5.10 内核相关适配的LTS。

第一,基于龙蜥(Anolis) OS GA发布阶段同版本的LTP(4.19版本)验证,无crash,
            该问题并不是产品的regression,是一个新发现的问题
            (备注, 可复现crash的LTS版本是针对5.10 内核相关适配的LTS)

第二,经过验证在CentOS8.2GA 实例上4.18.0-193.el8.x86_64 内核版本基于LTP 5.10版本依旧存在该问题。
            鉴于龙蜥(Anolis) OS 与CentOS当前处理测试是bug-to-bug,该问题优先级降低。

第三,经过验证在CentOS8.2GA 实例上4.18.0-240.22.1.el8_3 内核版本基于 LTP 5.10版本不存在该问题。

该问题目前看在upstream 已经解决
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=acf69c946233259ab4d64f8869d4037a198c7f06
并且已经在4.18.0-193.51.1.el8_2之后的版本解决

请测试同学继续4.18.0-193.51.1.el8_2 之后的版本做一下测试,如果问题已经解决,
龙蜥(Anolis) OS 在RHCK内核保持正常的EUS迭代即可。

Shiloong

2021-09-09 17:17

开发人员   ~0000391

该问题在 ANCK-4.19.91-24 已经解决.

问题历史

日期 用户名 字段 更改
2021-05-28 18:12 qingming2021 新建问题
2021-06-06 14:54 jacobwang 注释已添加: 0000241
2021-06-06 14:54 jacobwang 分派给 => jacobwang
2021-06-06 14:54 jacobwang 优先级 高 => 低
2021-06-06 14:54 jacobwang 状态 新建 => 已分配
2021-06-06 15:09 jacobwang 注释已编辑: 0000241
2021-09-09 17:17 Shiloong 注释已添加: 0000391
2021-12-17 16:30 jacobwang 分派给 jacobwang => Shiloong