[PATCH] do not check sp if ip points to user space - Crash-utility - Crash Utility List Archives

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

[PATCH] do not check sp if ip points to user space

Crash faults when determining...

Accessing integer variables from...

Wen Congyang

Friday, 23 September 2011 Fri, 23 Sep '11

2:48 a.m.

If the task is a user program, the sp can be points to anywhere, because we can modify sp in assembly. For example: .globl main .type main, @function main: finit subq $16, (%rsp) movq $0, (%rsp) .loop: jmp .loop

Attachments:

0001-do-not-check-sp-if-ip-points-to-user-space.patch (text/x-patch — 1.3 KB)

Reply

Show replies by date

Dave Anderson

Friday, 23 September Fri, 23 Sep

8:41 a.m.

----- Original Message -----

If the task is a user program, the sp can be points to anywhere, because we can modify sp in assembly. For example: .globl main .type main, @function main: finit subq $16, (%rsp) movq $0, (%rsp) .loop: jmp .loop

Why would any user task do that? And what happens when a backtrace is attempted on such a task? Since the current code would not set BT_USER_SPACE, I'm guessing that it would run into this (at least on x86_64): if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { error(INFO, "cannot determine starting stack pointer\n"); return; } I do believe that I put the additional in_user_stack() checks in those locations for a reason. Consider a task running in kernel mode that corrupts its return address stack location with a non-kernel address, or called a function indirectly that had a NULL pointer in it. That would cause a kernel crash with a non-kernel RIP in its exception frame, and your patch would mistake it for user-space. In any case, you're going to have to come up with a more compelling reason to change all of these locations. (And for that matter, I wonder why you didn't patch Fujitsu's get_sadump_regs() the same way?) Dave

Reply

Andrew Suffield

7:19 p.m.

On 23 September 2011 14:41, Dave Anderson <anderson(a)redhat.com> wrote:

Why would any user task do that?

Generally because it's buggy and has just smashed the stack, which dovetails nicely with the question "Why am I running a debugger?" (I'm not really sure what the right behaviour is here)

Reply

Dave Anderson

Saturday, 24 September Sat, 24 Sep

10:33 a.m.

New subject: [PATCH] do not check sp if ip points to user space

----- Original Message -----

On 23 September 2011 14:41, Dave Anderson < anderson(a)redhat.com > wrote: Why would any user task do that? Generally because it's buggy and has just smashed the stack, which dovetails nicely with the question "Why am I running a debugger?" (I'm not really sure what the right behaviour is here)

But a buggy task such as that would only be relevant to the crash utility *if*: (1) it were to generate a kernel-mode crash (highly unlikely), or (2) if it were the active task on a cpu when a kernel crash occurred on another cpu. In the second case, it would receive an NMI from the crashing cpu, which would bring it into the kernel, and the backtrace on that cpu would start from the NMI stack. Now, in that bizarre case, I'm not sure whether the transition from the NMI stack back (in this case) to user space would work as expected. That's why I asked for an example of a backtrace. But is it even worth caring about? And if it is, it should probably be addressed in the backtrace code, and not as the patch did it. Dave

Reply

Wen Congyang

Monday, 26 September Mon, 26 Sep

12:47 a.m.

At 09/23/2011 09:41 PM, Dave Anderson Write:

----- Original Message ----- > If the task is a user program, the sp can be points to anywhere, > because we can modify sp in assembly. > For example: > > .globl main > .type main, @function > main: > > finit > subq $16, (%rsp) > movq $0, (%rsp) > .loop: > jmp .loop > > Why would any user task do that? And what happens when a backtrace is attempted on such a task? Since the current code would not set BT_USER_SPACE, I'm guessing that it would run into this (at least on x86_64): if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { error(INFO, "cannot determine starting stack pointer\n"); return; }

Yes, crash will run into this on x86_64.

I do believe that I put the additional in_user_stack() checks in those locations for a reason. Consider a task running in kernel mode that corrupts its return address stack location with a non-kernel address, or called a function indirectly that had a NULL pointer in it. That would cause a kernel crash with a non-kernel RIP in its exception frame, and your patch would mistake it for user-space.

I know the reason why you check if sp is in user stack. What about this: if !is_kernel_text(ip) && (sp is in kernel stack(include irq)) try to backtrace according to sp else display registers If both ip and sp is corrupted, and we can not determine sp according to the content in the stack, I think we should display registers.

In any case, you're going to have to come up with a more compelling reason to change all of these locations. (And for that matter, I wonder why you didn't patch Fujitsu's get_sadump_regs() the same way?)

No only for Fujitsu's sadump, I think kvmdump has the same problem. By the way, crash try to use the register when the dump format is diskdump. I think we should use the register value when the dump format is Fujitsu's sadump. Thanks Wen Congyang

Dave -- Crash-utility mailing list Crash-utility(a)redhat.com https://www.redhat.com/mailman/listinfo/crash-utility

Reply

Dave Anderson

7:51 a.m.

----- Original Message -----

At 09/23/2011 09:41 PM, Dave Anderson Write: > > > ----- Original Message ----- >> If the task is a user program, the sp can be points to anywhere, >> because we can modify sp in assembly. >> For example: >> >> .globl main >> .type main, @function >> main: >> >> finit >> subq $16, (%rsp) >> movq $0, (%rsp) >> .loop: >> jmp .loop >> >> > > Why would any user task do that? > > And what happens when a backtrace is attempted on such a task? > > Since the current code would not set BT_USER_SPACE, I'm guessing that it > would run into this (at least on x86_64): > > if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { > error(INFO, "cannot determine starting stack pointer\n"); > return; > } Yes, crash will run into this on x86_64.

OK, so why not change the above to do something like this: if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { fprintf(ofp, "cannot determine starting stack pointer\n"); if(KVMDUMP_DUMPFILE()) kvmdump_display_regs(bt->tc->processor, ofp); else if (ELF_NOTES_VALID() && DISKDUMP_DUMPFILE()) diskdump_display_regs(bt->tc->processor, ofp); else if (SADUMP_DUMPFILE()) { sadump_display_regs(bt->tc->processor, ofp); return; } Dave

> > I do believe that I put the additional in_user_stack() checks in those > locations for a reason. Consider a task running in kernel mode that > corrupts its return address stack location with a non-kernel address, > or called a function indirectly that had a NULL pointer in it. That > would cause a kernel crash with a non-kernel RIP in its exception frame, > and your patch would mistake it for user-space. I know the reason why you check if sp is in user stack. What about this: if !is_kernel_text(ip) && (sp is in kernel stack(include irq)) try to backtrace according to sp else display registers If both ip and sp is corrupted, and we can not determine sp according to the content in the stack, I think we should display registers. > > In any case, you're going to have to come up with a more compelling > reason to change all of these locations. (And for that matter, I wonder > why you didn't patch Fujitsu's get_sadump_regs() the same way?) No only for Fujitsu's sadump, I think kvmdump has the same problem. By the way, crash try to use the register when the dump format is diskdump. I think we should use the register value when the dump format is Fujitsu's sadump. Thanks Wen Congyang > > Dave

Reply

Wen Congyang

Wednesday, 28 September Wed, 28 Sep

4:03 a.m.

At 09/26/2011 08:51 PM, Dave Anderson Write:

----- Original Message ----- > At 09/23/2011 09:41 PM, Dave Anderson Write: >> >> >> ----- Original Message ----- >>> If the task is a user program, the sp can be points to anywhere, >>> because we can modify sp in assembly. >>> For example: >>> >>> .globl main >>> .type main, @function >>> main: >>> >>> finit >>> subq $16, (%rsp) >>> movq $0, (%rsp) >>> .loop: >>> jmp .loop >>> >>> >> >> Why would any user task do that? >> >> And what happens when a backtrace is attempted on such a task? >> >> Since the current code would not set BT_USER_SPACE, I'm guessing that it >> would run into this (at least on x86_64): >> >> if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { >> error(INFO, "cannot determine starting stack pointer\n"); >> return; >> } > > Yes, crash will run into this on x86_64. OK, so why not change the above to do something like this: if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { fprintf(ofp, "cannot determine starting stack pointer\n"); if(KVMDUMP_DUMPFILE()) kvmdump_display_regs(bt->tc->processor, ofp); else if (ELF_NOTES_VALID() && DISKDUMP_DUMPFILE()) diskdump_display_regs(bt->tc->processor, ofp); else if (SADUMP_DUMPFILE()) { sadump_display_regs(bt->tc->processor, ofp); return; } Dave

Agree with it. But we should init ofp earlier, and we should add the same code in the function x86_back_trace_cmd(). Thanks Wen Congyang

>> >> I do believe that I put the additional in_user_stack() checks in those >> locations for a reason. Consider a task running in kernel mode that >> corrupts its return address stack location with a non-kernel address, >> or called a function indirectly that had a NULL pointer in it. That >> would cause a kernel crash with a non-kernel RIP in its exception frame, >> and your patch would mistake it for user-space. > > I know the reason why you check if sp is in user stack. > What about this: > if !is_kernel_text(ip) && (sp is in kernel stack(include irq)) > try to backtrace according to sp > else > display registers > > If both ip and sp is corrupted, and we can not determine sp according to > the content in the stack, I think we should display registers. > >> >> In any case, you're going to have to come up with a more compelling >> reason to change all of these locations. (And for that matter, I wonder >> why you didn't patch Fujitsu's get_sadump_regs() the same way?) > > No only for Fujitsu's sadump, I think kvmdump has the same problem. > > By the way, crash try to use the register when the dump format is diskdump. > I think we should use the register value when the dump format is Fujitsu's > sadump. > > Thanks > Wen Congyang > >> >> Dave

Reply

Dave Anderson

7:53 a.m.

----- Original Message -----

>>> >>> And what happens when a backtrace is attempted on such a task? >>> >>> Since the current code would not set BT_USER_SPACE, I'm guessing that it >>> would run into this (at least on x86_64): >>> >>> if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { >>> error(INFO, "cannot determine starting stack >>> pointer\n"); >>> return; >>> } >> >> Yes, crash will run into this on x86_64. > > OK, so why not change the above to do something like this: > > if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { > fprintf(ofp, "cannot determine starting stack pointer\n"); > if(KVMDUMP_DUMPFILE()) > kvmdump_display_regs(bt->tc->processor, > ofp); > else if (ELF_NOTES_VALID() && DISKDUMP_DUMPFILE()) > diskdump_display_regs(bt->tc->processor, > ofp); > else if (SADUMP_DUMPFILE()) { > sadump_display_regs(bt->tc->processor, > ofp); > return; > } > > Dave Agree with it. But we should init ofp earlier, and we should add the same code in the function x86_back_trace_cmd(). Thanks Wen Congyang

Yes, I'll do that as well. Thanks Wen, Dave

Reply

HATAYAMA Daisuke

Monday, 13 November Mon, 13 Nov

8:14 a.m.

Hello all, From: Wen Congyang <wency(a)cn.fujitsu.com> Subject: Re: [Crash-utility] [PATCH] do not check sp if ip points to user space Date: Mon, 26 Sep 2011 13:47:39 +0800

At 09/23/2011 09:41 PM, Dave Anderson Write: > > > ----- Original Message ----- >> If the task is a user program, the sp can be points to anywhere, >> because we can modify sp in assembly. >> For example: >> >> .globl main >> .type main, @function >> main: >> >> finit >> subq $16, (%rsp) >> movq $0, (%rsp) >> .loop: >> jmp .loop >> >> > > Why would any user task do that? > > And what happens when a backtrace is attempted on such a task? > > Since the current code would not set BT_USER_SPACE, I'm guessing that it > would run into this (at least on x86_64): > > if (!(bt->flags & BT_USER_SPACE) && (!rsp || !accessible(rsp))) { > error(INFO, "cannot determine starting stack pointer\n"); > return; > } Yes, crash will run into this on x86_64. > > I do believe that I put the additional in_user_stack() checks in those > locations for a reason. Consider a task running in kernel mode that > corrupts its return address stack location with a non-kernel address, > or called a function indirectly that had a NULL pointer in it. That > would cause a kernel crash with a non-kernel RIP in its exception frame, > and your patch would mistake it for user-space. I know the reason why you check if sp is in user stack. What about this: if !is_kernel_text(ip) && (sp is in kernel stack(include irq)) try to backtrace according to sp else display registers If both ip and sp is corrupted, and we can not determine sp according to the content in the stack, I think we should display registers. > > In any case, you're going to have to come up with a more compelling > reason to change all of these locations. (And for that matter, I wonder > why you didn't patch Fujitsu's get_sadump_regs() the same way?) No only for Fujitsu's sadump, I think kvmdump has the same problem. By the way, crash try to use the register when the dump format is diskdump. I think we should use the register value when the dump format is Fujitsu's sadump.

To be honest, I douted the logic just as you when I read get_kvmdump_regs() for the first time, but at that time I've understood why it is so, just as Dave has already explained, that bt command in crash utility mainly targets kernel stacks generated by C programs, and it seems reasonable enough as the feature users want. BTW, it is the fact that it's not sufficient to see only RIP and RSP to decide if the execution mode for a given task. More condition is neccesary. On X86 archtectures, for example, CS gives information about such information, and can cover the assembly example you showed in the first post, but of course this is the specific logic on X86 and X86_64, and I have no idea on other architectures, and might be inapplicable to kvmdump that targets archs other than X86. Thanks. HATAYAMA, Daisuke

Reply

Dave Anderson

Monday, 26 September Mon, 26 Sep

8:20 a.m.

----- Original Message -----

Hello all, From: Wen Congyang <wency(a)cn.fujitsu.com> Subject: Re: [Crash-utility] [PATCH] do not check sp if ip points to user space Date: Mon, 26 Sep 2011 13:47:39 +0800 > At 09/23/2011 09:41 PM, Dave Anderson Write: >> >> >> ----- Original Message ----- >>> If the task is a user program, the sp can be points to anywhere, >>> because we can modify sp in assembly. >>> For example: >>> >>> .globl main >>> .type main, @function >>> main: >>> >>> finit >>> subq $16, (%rsp) >>> movq $0, (%rsp) >>> .loop: >>> jmp .loop >>> >>> >> >> Why would any user task do that? >> >> And what happens when a backtrace is attempted on such a task? >> >> Since the current code would not set BT_USER_SPACE, I'm guessing that it >> would run into this (at least on x86_64): >> >> if (!(bt->flags & BT_USER_SPACE) && (!rsp || >> !accessible(rsp))) { >> error(INFO, "cannot determine starting stack >> pointer\n"); >> return; >> } > > Yes, crash will run into this on x86_64. > >> >> I do believe that I put the additional in_user_stack() checks in those >> locations for a reason. Consider a task running in kernel mode that >> corrupts its return address stack location with a non-kernel address, >> or called a function indirectly that had a NULL pointer in it. That >> would cause a kernel crash with a non-kernel RIP in its exception frame, >> and your patch would mistake it for user-space. > > I know the reason why you check if sp is in user stack. > What about this: > if !is_kernel_text(ip) && (sp is in kernel stack(include irq)) > try to backtrace according to sp > else > display registers > > If both ip and sp is corrupted, and we can not determine sp according to > the content in the stack, I think we should display registers. > >> >> In any case, you're going to have to come up with a more compelling >> reason to change all of these locations. (And for that matter, I wonder >> why you didn't patch Fujitsu's get_sadump_regs() the same way?) > > No only for Fujitsu's sadump, I think kvmdump has the same problem. > > By the way, crash try to use the register when the dump format is diskdump. > I think we should use the register value when the dump format is Fujitsu's > sadump. > To be honest, I douted the logic just as you when I read get_kvmdump_regs() for the first time, but at that time I've understood why it is so, just as Dave has already explained, that bt command in crash utility mainly targets kernel stacks generated by C programs, and it seems reasonable enough as the feature users want. BTW, it is the fact that it's not sufficient to see only RIP and RSP to decide if the execution mode for a given task. More condition is neccesary. On X86 archtectures, for example, CS gives information about such information, and can cover the assembly example you showed in the first post, but of course this is the specific logic on X86 and X86_64, and I have no idea on other architectures, and might be inapplicable to kvmdump that targets archs other than X86.

Right, but the fact of the matter is that a backtrace cannot be performed for a task with a nonsensical SP value, so it doesn't make a difference whether it was in user-space or kernel-space. So the "cannot determine starting stack pointer" error message should still be displayed as it currently does -- and with my patch suggestion, the registers can be dumped (if available) before returning. Dave

Thanks. HATAYAMA, Daisuke -- Crash-utility mailing list Crash-utility(a)redhat.com https://www.redhat.com/mailman/listinfo/crash-utility

Reply

HATAYAMA Daisuke

Monday, 13 November Mon, 13 Nov

8:14 a.m.

From: Dave Anderson <anderson(a)redhat.com> Subject: Re: [Crash-utility] [PATCH] do not check sp if ip points to user space Date: Mon, 26 Sep 2011 09:20:26 -0400 (EDT)

----- Original Message ----- > Hello all, > > From: Wen Congyang <wency(a)cn.fujitsu.com> > Subject: Re: [Crash-utility] [PATCH] do not check sp if ip points to > user space > Date: Mon, 26 Sep 2011 13:47:39 +0800 > > > At 09/23/2011 09:41 PM, Dave Anderson Write: > >> > >> > >> ----- Original Message ----- > >>> If the task is a user program, the sp can be points to anywhere, > >>> because we can modify sp in assembly. > >>> For example: > >>> > >>> .globl main > >>> .type main, @function > >>> main: > >>> > >>> finit > >>> subq $16, (%rsp) > >>> movq $0, (%rsp) > >>> .loop: > >>> jmp .loop > >>> > >>> > >> > >> Why would any user task do that? > >> > >> And what happens when a backtrace is attempted on such a task? > >> > >> Since the current code would not set BT_USER_SPACE, I'm guessing that it > >> would run into this (at least on x86_64): > >> > >> if (!(bt->flags & BT_USER_SPACE) && (!rsp || > >> !accessible(rsp))) { > >> error(INFO, "cannot determine starting stack > >> pointer\n"); > >> return; > >> } > > > > Yes, crash will run into this on x86_64. > > > >> > >> I do believe that I put the additional in_user_stack() checks in those > >> locations for a reason. Consider a task running in kernel mode that > >> corrupts its return address stack location with a non-kernel address, > >> or called a function indirectly that had a NULL pointer in it. That > >> would cause a kernel crash with a non-kernel RIP in its exception frame, > >> and your patch would mistake it for user-space. > > > > I know the reason why you check if sp is in user stack. > > What about this: > > if !is_kernel_text(ip) && (sp is in kernel stack(include irq)) > > try to backtrace according to sp > > else > > display registers > > > > If both ip and sp is corrupted, and we can not determine sp according to > > the content in the stack, I think we should display registers. > > > >> > >> In any case, you're going to have to come up with a more compelling > >> reason to change all of these locations. (And for that matter, I wonder > >> why you didn't patch Fujitsu's get_sadump_regs() the same way?) > > > > No only for Fujitsu's sadump, I think kvmdump has the same problem. > > > > By the way, crash try to use the register when the dump format is diskdump. > > I think we should use the register value when the dump format is Fujitsu's > > sadump. > > > > To be honest, I douted the logic just as you when I read > get_kvmdump_regs() for the first time, but at that time I've > understood why it is so, just as Dave has already explained, that bt > command in crash utility mainly targets kernel stacks generated by C > programs, and it seems reasonable enough as the feature users want. > > BTW, it is the fact that it's not sufficient to see only RIP and RSP > to decide if the execution mode for a given task. More condition is > neccesary. > > On X86 archtectures, for example, CS gives information about such > information, and can cover the assembly example you showed in the > first post, but of course this is the specific logic on X86 and > X86_64, and I have no idea on other architectures, and might be > inapplicable to kvmdump that targets archs other than X86. Right, but the fact of the matter is that a backtrace cannot be performed for a task with a nonsensical SP value, so it doesn't make a difference whether it was in user-space or kernel-space. So the "cannot determine starting stack pointer" error message should still be displayed as it currently does -- and with my patch suggestion, the registers can be dumped (if available) before returning.

I understand. The condition to be used here is whether the backtrace can be performed or not, implied by SP values pointing at outside a variety of kernel stacks; I guess this is the definition of nonsensical SP you've mean. I think the new behaviour reasonable. By the way, I have a question that what intension do you have behind !is_kernel_text(ip)? Just to exclude the case of user text? I guess you're intending here also to exclude other possibilities. Because sadump runs beyond the logic of kernel, register values contained in vmcore sometimes the ones in real-address mode, appearing having run in some firmware, which often happens at crash during boot time. I'm also wondering if there's other dump mechanism that can lead to this kind of situation. For example, although I don't understand detailed behaviour of NMI, if assuming NMI immediately triggered even when running in firmware without rolling back register values saved when entering the firmware context from kernel, register values in NMI frames would be the ones in firmware and it would be concluded that the situation can happen on kdump (and others that uses NMI); but I have never seen such register values on kdump... Thanks. HATAYAMA, Daisuke

Reply

Dave Anderson

Tuesday, 27 September Tue, 27 Sep

7:46 a.m.

----- Original Message -----

> Right, but the fact of the matter is that a backtrace cannot be > performed for a task with a nonsensical SP value, so it doesn't > make a difference whether it was in user-space or kernel-space. > So the "cannot determine starting stack pointer" error message > should still be displayed as it currently does -- and with my patch > suggestion, the registers can be dumped (if available) before > returning. > I understand. The condition to be used here is whether the backtrace can be performed or not, implied by SP values pointing at outside a variety of kernel stacks; I guess this is the definition of nonsensical SP you've mean. I think the new behaviour reasonable. By the way, I have a question that what intension do you have behind !is_kernel_text(ip)? Just to exclude the case of user text? I guess you're intending here also to exclude other possibilities.

Right, just to prevent the inadvertent setting of BT_USER_SPACE.

Because sadump runs beyond the logic of kernel, register values contained in vmcore sometimes the ones in real-address mode, appearing having run in some firmware, which often happens at crash during boot time.

That's news to me. I don't know how you would want the backtrace mechanism to perform in that case, but I'm presuming that you would *not* want BT_USER_SPACE set.

I'm also wondering if there's other dump mechanism that can lead to this kind of situation. For example, although I don't understand detailed behaviour of NMI, if assuming NMI immediately triggered even when running in firmware without rolling back register values saved when entering the firmware context from kernel, register values in NMI frames would be the ones in firmware and it would be concluded that the situation can happen on kdump (and others that uses NMI); but I have never seen such register values on kdump...

I've never seen, or heard of, such a situation. I would guess that the design of SMI interrupts would prevent that from happening. Dave

Reply

HATAYAMA Daisuke

Monday, 13 November Mon, 13 Nov

8:14 a.m.

From: Dave Anderson <anderson(a)redhat.com> Subject: Re: [Crash-utility] [PATCH] do not check sp if ip points to user space Date: Tue, 27 Sep 2011 08:46:27 -0400 (EDT)

----- Original Message ----- > > > Right, but the fact of the matter is that a backtrace cannot be > > performed for a task with a nonsensical SP value, so it doesn't > > make a difference whether it was in user-space or kernel-space. > > So the "cannot determine starting stack pointer" error message > > should still be displayed as it currently does -- and with my patch > > suggestion, the registers can be dumped (if available) before > > returning. > > > > I understand. The condition to be used here is whether the backtrace > can be performed or not, implied by SP values pointing at outside a > variety of kernel stacks; I guess this is the definition of > nonsensical SP you've mean. I think the new behaviour reasonable. > > By the way, I have a question that what intension do you have behind > !is_kernel_text(ip)? Just to exclude the case of user text? I guess > you're intending here also to exclude other possibilities. Right, just to prevent the inadvertent setting of BT_USER_SPACE. > Because sadump runs beyond the logic of kernel, register values > contained in vmcore sometimes the ones in real-address mode, appearing > having run in some firmware, which often happens at crash during boot > time. That's news to me. I don't know how you would want the backtrace mechanism to perform in that case, but I'm presuming that you would *not* want BT_USER_SPACE set. > I'm also wondering if there's other dump mechanism that can lead to > this kind of situation. For example, although I don't understand > detailed behaviour of NMI, if assuming NMI immediately triggered even > when running in firmware without rolling back register values saved > when entering the firmware context from kernel, register values in NMI > frames would be the ones in firmware and it would be concluded that > the situation can happen on kdump (and others that uses NMI); but I > have never seen such register values on kdump... I've never seen, or heard of, such a situation. I would guess that the design of SMI interrupts would prevent that from happening.

I had been concerned about the fact that, if such situation was true, users could lose backtrace in kernel side until the firmware path. According to your answer, it seems to me that interrupts mechanism properly resumes its context when returning from firmware. Of couse I'll follow up this locally further because I had not confirmed this in specification level. But it's very helpful for understanding. Thanks for answering to the question. Thanks. HATAYAMA, Daisuke

Reply

5257

days inactive

5262

days old

devel@lists.crash-utility.osci.io

Manage subscription

12 comments

4 participants

tags (0)

participants (4)

Andrew Suffield
Dave Anderson
HATAYAMA Daisuke
Wen Congyang