----- Original Message -----
Thanks so much for your reply, Dave.
On Mon, Jan 19, 2015 at 03:33:41PM -0500, Dave Anderson wrote:
>
> Han,
>
> This is much worse than I thought. When you said "sometimes", you must
> mean "all of the time" with respect to the active tasks? Because
that's
> what I see here.
Yes, looks like bt doesn't show stack frame all the time ...
>
> So Han, can you find out who in IBM should be responsible for supporting
> ppc64le in the crash utility? Or is it you?
>
No, I'm just a tester. I'll try to file a bug on this problem. It'll be
assigned to the person who in charge of supporting crash utility.
Thanks again.
Han,
I've attached the patch that I've queued for crash-7.1.0:
https://github.com/crash-utility/crash/commit/dc4ea682a21567dd9d093862ec5...
As I mentioned earlier, it fixes the default stack-search method for determining
the backtrace starting hooks.
For example, without the patch, here is what I see on a ppc64le compressed kdump:
crash> bt -a
PID: 12674 TASK: c00000002cc08810 CPU: 0 COMMAND: "bash"
PID: 0 TASK: c0000001ee020000 CPU: 1 COMMAND: "swapper/1"
PID: 0 TASK: c0000001ee021370 CPU: 2 COMMAND: "swapper/2"
PID: 0 TASK: c0000001ee0226e0 CPU: 3 COMMAND: "swapper/3"
PID: 0 TASK: c0000001ee023a50 CPU: 4 COMMAND: "swapper/4"
PID: 0 TASK: c0000001ee024dc0 CPU: 5 COMMAND: "swapper/5"
PID: 0 TASK: c0000001ee026130 CPU: 6 COMMAND: "swapper/6"
PID: 0 TASK: c0000001ee0274a0 CPU: 7 COMMAND: "swapper/7"
crash>
With the patch applied, it looks like this:
crash> bt -a
PID: 12674 TASK: c00000002cc08810 CPU: 0 COMMAND: "bash"
#0 [c00000002925b6f0] crash_kexec at c00000000016e9d8
#1 [c00000002925b8e0] die at c000000000021628
#2 [c00000002925b980] bad_page_fault at c0000000000549e0
#3 [c00000002925b9f0] handle_page_fault at c000000000009584
Data Access error [300] exception frame:
R0: c000000000562b00 R1: c00000002925bce0 R2: c0000000010ade68
R3: 0000000000000063 R4: c000000001608018 R5: c000000001618c88
R6: 00000000000057a8 R7: c000000001259420 R8: c000000000c6de68
R9: 0000000000000001 R10: 0000000000000000 R11: 000000000000013f
R12: c000000000561ec0 R13: c000000007e40000 R14: 0000000010139e60
R15: 0000000040000000 R16: 0000000000000000 R17: 0000000000000000
R18: 00000000101306fc R19: 0000000010139dec R20: 0000000010139df0
R21: 0000000010092c40 R22: 0000000000000000 R23: 0000000000000000
R24: 0000000000000001 R25: 0000000000000007 R26: c00000000100add8
R27: 0000000000000063 R28: c0000000014a9b98 R29: c00000000100b198
R30: c000000000fd8ba0 R31: 0000000000000002
NIP: c000000000561ee0 MSR: 8000000000009033 OR3: c000000000010a48
CTR: c000000000561ec0 LR: c000000000562b20 XER: 0000000020000000
CCR: 0000000048222822 MQ: 0000000000000000 DAR: 0000000000000000
DSISR: 0000000042000000 Syscall Result: 0000000000000000
#4 [c00000002925bce0] sysrq_handle_crash at c000000000561ee0
[Link Register ] [c00000002925bce0] write_sysrq_trigger at c000000000562b20
#5 [c00000002925bd90] proc_reg_write at c00000000037f4f4
#6 [c00000002925bdd0] sys_write at c0000000002d6158
#7 [c00000002925be30] syscall_exit at c00000000000a0fc
syscall [c00] exception frame:
R0: 0000000000000004 R1: 00003fffd7e8e5c0 R2: 00003fff957e9768
R3: 0000000000000001 R4: 00003fff8ed30000 R5: 0000000000000002
R6: 0000000000000000 R7: 0000000000000000 R8: 0000000000000000
R9: 0000000000000000 R10: 0000000000000000 R11: 0000000000000000
R12: 0000000000000000 R13: 00003fff958eb450 R14: 0000000010139e60
R15: 0000000040000000 R16: 0000000000000000 R17: 0000000000000000
R18: 00000000101306fc R19: 0000000010139dec R20: 0000000010139df0
R21: 0000000010092c40 R22: 0000000000000000 R23: 0000000000000000
R24: 000000001013b5f0 R25: 00000000100f9a48 R26: 000001000bf78c50
R27: 0000000000000001 R28: 0000000000000002 R29: 00003fff957e12e8
R30: 00003fff8ed30000 R31: 0000000000000002
NIP: 00003fff95712960 MSR: 800000010280f033 OR3: 0000000000000001
CTR: 0000000000000000 LR: 00003fff95696574 XER: 0000000000000000
CCR: 0000000048222822 MQ: 0000000000000001 DAR: 00003fff95860308
DSISR: 000000000a000000 Syscall Result: 0000000000000000
PID: 0 TASK: c0000001ee020000 CPU: 1 COMMAND: "swapper/1"
#0 [c0000001eff6b9f0] crash_ipi_callback at c000000000051440
#1 [c0000001eff6ba30] smp_ipi_demux at c000000000048124
#2 [c0000001eff6ba70] doorbell_exception at c000000000041d4c
#3 [c0000001eff6baa0] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: c000000000718a8c R1: c0000001eff6bd90 R2: c0000000010ade68
R3: 0000000000000a00 R4: ffffffffffffffff R5: 0000000000000001
R6: 0005eafdf829448b R7: 00000000009e0000 R8: 0000000000000002
R9: 7fffffffffffffff R10: 000000000000003b R11: 000000000000001f
R12: 0000000000000000 R13: c000000007e40380
NIP: c000000000010880 MSR: 8000000100009033 OR3: c00000000071a95c
CTR: c00000000071a7d0 LR: c000000000010880 XER: 0000000000000000
CCR: 0000000044000084 MQ: 0000000000000001 DAR: c0000000010f09d8
DSISR: c0000000016482f8 Syscall Result: 0000000000000000
#4 [c0000001eff6bd90] arch_local_irq_restore at c000000000010880 (unreliable)
#5 [c0000001eff6bdb0] cpuidle_idle_call at c000000000718a8c
#6 [c0000001eff6be10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001eff6be70] arch_cpu_idle at c000000000018118
#8 [c0000001eff6bea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001eff6bf20] start_secondary at c000000000049080
#10 [c0000001eff6bf90] start_secondary_prolog at c000000000009a6c
PID: 0 TASK: c0000001ee021370 CPU: 2 COMMAND: "swapper/2"
#0 [c0000001eff6f9f0] crash_ipi_callback at c000000000051440
#1 [c0000001eff6fa30] smp_ipi_demux at c000000000048124
#2 [c0000001eff6fa70] doorbell_exception at c000000000041d4c
#3 [c0000001eff6faa0] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: c000000000718a8c R1: c0000001eff6fd90 R2: c0000000010ade68
R3: 0000000000000a00 R4: ffffffffffffffff R5: 0000000000000001
R6: 0005eafdf8294338 R7: 0000000000a20000 R8: 0000000000000002
R9: 0000000000000000 R10: c00000000001ec00 R11: 0000000000000079
R12: 0000000000000000 R13: c000000007e40700
NIP: c000000000010880 MSR: 8000000100009033 OR3: c000000000147548
CTR: c00000000001ec00 LR: c000000000010880 XER: 0000000000000000
CCR: 0000000044000084 MQ: 0000000000000001 DAR: c0000000010f09d8
DSISR: c0000000016882f8 Syscall Result: 0000000000000000
#4 [c0000001eff6fd90] arch_local_irq_restore at c000000000010880 (unreliable)
#5 [c0000001eff6fdb0] cpuidle_idle_call at c000000000718a8c
#6 [c0000001eff6fe10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001eff6fe70] arch_cpu_idle at c000000000018118
#8 [c0000001eff6fea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001eff6ff20] start_secondary at c000000000049080
#10 [c0000001eff6ff90] start_secondary_prolog at c000000000009a6c
PID: 0 TASK: c0000001ee0226e0 CPU: 3 COMMAND: "swapper/3"
#0 [c0000001eff739f0] crash_ipi_callback at c000000000051440
#1 [c0000001eff73a30] smp_ipi_demux at c000000000048124
#2 [c0000001eff73a70] doorbell_exception at c000000000041d4c
#3 [c0000001eff73aa0] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: c000000000718a8c R1: c0000001eff73d90 R2: c0000000010ade68
R3: 0000000000000a00 R4: ffffffffffffffff R5: 0000000000000001
R6: 0005eafdf82943ca R7: 0000000000a60000 R8: 0000000000000002
R9: 0000000000000020 R10: c0000000010ede68 R11: 0000000000000144
R12: 00003fff8605280c R13: 00003fff7f806900
NIP: c000000000010880 MSR: 8000000100009033 OR3: c000000000718a40
CTR: 0000000000000000 LR: c000000000010880 XER: 0000000000000000
CCR: 0000000044000084 MQ: 0000000000000001 DAR: c0000000016c0380
DSISR: c0000001eff73ca0 Syscall Result: 0000000000000000
#4 [c0000001eff73d90] arch_local_irq_restore at c000000000010880 (unreliable)
#5 [c0000001eff73db0] cpuidle_idle_call at c000000000718a8c
#6 [c0000001eff73e10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001eff73e70] arch_cpu_idle at c000000000018118
#8 [c0000001eff73ea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001eff73f20] start_secondary at c000000000049080
#10 [c0000001eff73f90] start_secondary_prolog at c000000000009a6c
PID: 0 TASK: c0000001ee023a50 CPU: 4 COMMAND: "swapper/4"
#0 [c0000001eff779a0] crash_ipi_callback at c000000000051440
#1 [c0000001eff779e0] smp_ipi_demux at c000000000048124
#2 [c0000001eff77a20] doorbell_exception at c000000000041d4c
#3 [c0000001eff77a50] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: 0000000024000084 R1: c0000001eff77d40 R2: c0000000010ade68
R3: 0000000000000000 R4: c00000000101b980 R5: 0000000000000000
R6: 001bc0f45c000000 R7: 0000000000000000 R8: 0000000000000000
R9: c000000007c40c00 R10: 0000000000000001 R11: 0005eafdf7a13a2f
R12: 0000000000000000 R13: c000000007e40e00
NIP: c000000000086ae4 MSR: 8000000100009033 OR3: c00000000071b334
CTR: c0000000000aca7c LR: c00000000071b338 XER: 0000000000000000
CCR: 0000000024000084 MQ: 0000000000000001 DAR: 0000000000000001
DSISR: c0000001eff77c60 Syscall Result: 0000000000000000
#4 [c0000001eff77d40] plpar_hcall_norets at c000000000086ae4
[Link Register ] [c0000001eff77d40] shared_cede_loop at c00000000071b338
#5 [c0000001eff77db0] cpuidle_idle_call at c000000000718a70
#6 [c0000001eff77e10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001eff77e70] arch_cpu_idle at c000000000018118
#8 [c0000001eff77ea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001eff77f20] start_secondary at c000000000049080
#10 [c0000001eff77f90] start_secondary_prolog at c000000000009a6c
PID: 0 TASK: c0000001ee024dc0 CPU: 5 COMMAND: "swapper/5"
#0 [c0000001eff7b9a0] crash_ipi_callback at c000000000051440
#1 [c0000001eff7b9e0] smp_ipi_demux at c000000000048124
#2 [c0000001eff7ba20] doorbell_exception at c000000000041d4c
#3 [c0000001eff7ba50] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: 0000000024000084 R1: c0000001eff7bd40 R2: c0000000010ade68
R3: 0000000000000000 R4: c00000000101b980 R5: 0000000000000000
R6: 001bc0f45c000000 R7: 0000000000000000 R8: 0000000000000000
R9: c000000007c41000 R10: 0000000000000001 R11: 0005eafdf7a13a2f
R12: 0000000000000000 R13: c000000007e41180
NIP: c000000000086ae4 MSR: 8000000100009033 OR3: c00000000071acec
CTR: c0000000000aca7c LR: c00000000071b338 XER: 0000000000000000
CCR: 0000000024000084 MQ: 0000000000000001 DAR: 0000000000000001
DSISR: c0000001eff7bc60 Syscall Result: 0000000000000000
#4 [c0000001eff7bd40] plpar_hcall_norets at c000000000086ae4
[Link Register ] [c0000001eff7bd40] shared_cede_loop at c00000000071b338
#5 [c0000001eff7bdb0] cpuidle_idle_call at c000000000718a70
#6 [c0000001eff7be10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001eff7be70] arch_cpu_idle at c000000000018118
#8 [c0000001eff7bea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001eff7bf20] start_secondary at c000000000049080
#10 [c0000001eff7bf90] start_secondary_prolog at c000000000009a6c
PID: 0 TASK: c0000001ee026130 CPU: 6 COMMAND: "swapper/6"
#0 [c0000001eff7f9a0] crash_ipi_callback at c000000000051440
#1 [c0000001eff7f9e0] smp_ipi_demux at c000000000048124
#2 [c0000001eff7fa20] doorbell_exception at c000000000041d4c
#3 [c0000001eff7fa50] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: 0000000024000084 R1: c0000001eff7fd40 R2: c0000000010ade68
R3: 0000000000000000 R4: c00000000101b980 R5: 0000000000000000
R6: 001bc0f45c000000 R7: 0000000000000000 R8: 0000000000000000
R9: c000000007c41400 R10: 0000000000000001 R11: 0005eafdf7a13a2f
R12: 0000000000000000 R13: c000000007e41500
NIP: c000000000086ae4 MSR: 8000000100009033 OR3: c00000000047a0b4
CTR: c0000000000aca7c LR: c00000000071b338 XER: 0000000000000000
CCR: 0000000024000084 MQ: 0000000000000001 DAR: 0000000000000001
DSISR: c0000001eff7fc60 Syscall Result: 0000000000000000
#4 [c0000001eff7fd40] plpar_hcall_norets at c000000000086ae4
[Link Register ] [c0000001eff7fd40] shared_cede_loop at c00000000071b338
#5 [c0000001eff7fdb0] cpuidle_idle_call at c000000000718a70
#6 [c0000001eff7fe10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001eff7fe70] arch_cpu_idle at c000000000018118
#8 [c0000001eff7fea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001eff7ff20] start_secondary at c000000000049080
#10 [c0000001eff7ff90] start_secondary_prolog at c000000000009a6c
PID: 0 TASK: c0000001ee0274a0 CPU: 7 COMMAND: "swapper/7"
#0 [c0000001ee0839a0] crash_ipi_callback at c000000000051440
#1 [c0000001ee0839e0] smp_ipi_demux at c000000000048124
#2 [c0000001ee083a20] doorbell_exception at c000000000041d4c
#3 [c0000001ee083a50] doorbell_super_common at c000000000002714
reserved [a01] exception frame:
R0: 0000000024000084 R1: c0000001ee083d40 R2: c0000000010ade68
R3: 0000000000000000 R4: c00000000101b980 R5: 0000000000000000
R6: 001bc0f45c000000 R7: 0000000000000000 R8: 0000000000000000
R9: c000000007c41800 R10: 0000000000000001 R11: 0005eafdf7a13a2f
R12: 0000000000000000 R13: c000000007e41880
NIP: c000000000086ae4 MSR: 8000000100009033 OR3: c0000000001475dc
CTR: c0000000000aca7c LR: c00000000071b338 XER: 0000000000000000
CCR: 0000000024000084 MQ: 0000000000000001 DAR: 0000000000000001
DSISR: c0000001ee083c60 Syscall Result: 0000000000000000
#4 [c0000001ee083d40] plpar_hcall_norets at c000000000086ae4
[Link Register ] [c0000001ee083d40] shared_cede_loop at c00000000071b338
#5 [c0000001ee083db0] cpuidle_idle_call at c000000000718a70
#6 [c0000001ee083e10] pseries_lpar_idle at c0000000000894d8
#7 [c0000001ee083e70] arch_cpu_idle at c000000000018118
#8 [c0000001ee083ea0] cpu_startup_entry at c0000000001455a0
#9 [c0000001ee083f20] start_secondary at c000000000049080
#10 [c0000001ee083f90] start_secondary_prolog at c000000000009a6c
crash>
Thanks,
Dave