questions about crash utility

[PATCH]: minimal mode extensions

dmesg content from crash without...

卜弋天

Thursday, 17 January 2013 Thu, 17 Jan '13

10:30 a.m.

Hello: i am using crash utility 6.0.8 to parse the dump file of kernel 3.4. my platform will generate ebi.bin after crash, this binary file dumps ddr from address 0x0 to 0x20000000, total 512MB ram. after i get this binary file, i prefix a elf header to it, the function to generate elf header is as below: static size_t mkelfheader(void *buf) { struct elf_phdr *nhdr, *phdr; struct elfhdr *elf; size_t offset = 0; void *bufp = buf; elf = (Elf32_Ehdr *) bufp; bufp += sizeof(Elf32_Ehdr); offset += sizeof(struct elfhdr); memcpy(elf->e_ident, ELFMAG, SELFMAG); elf->e_ident[EI_CLASS] = ELFCLASS32; elf->e_ident[EI_DATA] = ELFDATA2LSB; elf->e_ident[EI_VERSION]= EV_CURRENT; elf->e_ident[EI_OSABI] = ELFOSABI_NONE; memset(elf->e_ident+EI_PAD, 0, EI_NIDENT-EI_PAD); elf->e_type = ET_CORE; elf->e_machine = EM_ARM; elf->e_version = EV_CURRENT; elf->e_entry = 0; elf->e_phoff = sizeof(struct elfhdr); elf->e_shoff = 0; elf->e_flags = 0; elf->e_ehsize = sizeof(struct elfhdr); elf->e_phentsize= sizeof(struct elf_phdr); elf->e_phnum = 2; elf->e_shentsize= 0; elf->e_shnum = 0; elf->e_shstrndx = 0; nhdr = (struct elf_phdr *) bufp; bufp += sizeof(struct elf_phdr); offset += sizeof(struct elf_phdr); nhdr->p_type = PT_NOTE; nhdr->p_offset = 0; nhdr->p_vaddr = 0; nhdr->p_paddr = 0; nhdr->p_filesz = 0; nhdr->p_memsz = 0; nhdr->p_flags = 0; nhdr->p_align = 0; phdr = (struct elf_phdr *) bufp; bufp += sizeof(struct elf_phdr); offset += sizeof(struct elf_phdr); phdr->p_type = PT_LOAD; phdr->p_flags = PF_R|PF_W|PF_X; phdr->p_offset = offset; phdr->p_vaddr = 0xc0000000; phdr->p_paddr = 0x00200000; phdr->p_filesz = phdr->p_memsz = MEMSIZE; phdr->p_align = 0; return offset; } after all, there will be a cdump.elf which contains the generated elf header, tailed by ebi.bin. then i use crash utility to load this cdump.elf together with the vmlinux. it has below error: WARNING: could not find MAGIC_START! WARNING: cpu_present_mask indicates more than 4 (NR_CPUS) cpus crash: cannot determine base kernel version crash: vmlinux and cdump.elf do not match! our platform set CONFIG_PHYS_OFFSET=0x00200000 in kernel .config file, which means that the virtual address 0xc0000000 will map to physical address 0x00200000. for this reason, i set phdr->p_paddr = 0x00200000 when generate the elf header. please help me to find out what is wrong, thanks very much. Best Regards

Attachments:

attachment.html (text/html — 3.3 KB)

Show replies by date

Dave Anderson

Thursday, 17 January Thu, 17 Jan

1:17 p.m.

----- Original Message -----

...

The fact that crash gets as far as it does at least means that the ELF header you've created was deemed acceptable as an ARM vmcore. However, the error messages re: "cpu_present_mask indicates..." and "cannot determine base kernel version" indicate that the data that was read from the vmcore was clearly not the correct data. The "cpu_present_mask" value that it read contained too many bits -- presuming that the 32-bit ARM processor is still limited to only 4 cpus. (looks like upstream that CONFIG_NR_CPUS is still 2 in the arch/arm/configs files.) But more indicative of the wrong data being read is the second "cannot determine base kernel version" message, which was generated after it read the kernel's "init_uts_ns" uts_namespace structure. After reading it, it sees that the "release" string contains non-ASCII data, whereas it should contain the kernel version: crash> p init_uts_ns init_uts_ns = $3 = { kref = { refcount = { counter = 2 } }, name = { sysname = "Linux\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", nodename = "phenom-01.lab.bos.redhat.com\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", release = "2.6.32-313.el6.x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", version = "#1 SMP Thu Sep 27 16:25:19 EDT 2012\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", machine = "x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", domainname = "(none)\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000" } } crash> So it appears that you're reading data from the wrong locations in the dumpfile. You should be able to verify that by bringing up the crash session with the --minimal flag like this: $ crash --minimal vmlinux vmcore That will bypass most of the initialization, including all readmem() calls of the vmcore. Then do this: crash> rd linux_banner 20 ffffffff818000a0: 65762078756e694c 2e33206e6f697372 Linux version 3. ffffffff818000b0: 63662e312d312e35 365f3638782e3731 5.1-1.fc17.x86_6 ffffffff818000c0: 626b636f6d282034 69756240646c6975 4 (mockbuild@bui ffffffff818000d0: 2e33322d6d76646c 6465662e32786870 ldvm-23.phx2.fed ffffffff818000e0: 656a6f727061726f 202967726f2e7463 oraproject.org) ffffffff818000f0: 7265762063636728 372e34206e6f6973 (gcc version 4.7 ffffffff81800100: 303231303220302e 6465522820373035 .0 20120507 (Red ffffffff81800110: 372e342074614820 47282029352d302e Hat 4.7.0-5) (G ffffffff81800120: 3123202920294343 75685420504d5320 CC) ) #1 SMP Thu ffffffff81800130: 3120392067754120 2033343a30353a37 Aug 9 17:50:43 crash> rd -a linux_banner ffffffff818000a0: Linux version 3.5.1-1.fc17.x86_64 (mockbuild(a)buildvm-23.phx2 ffffffff818000dc: .fedoraproject.org) (gcc version 4.7.0 20120507 (Red Hat 4.7 ffffffff81800118: .0-5) (GCC) ) #1 SMP Thu Aug 9 17:50:43 UTC 2012 crash> I'm guessing that you will not see a string starting with "Linux version" with your dumpfile as shown above. If that's the case, then it's clear that the readmem() function is ultimately reading from the wrong vmcore file offset. Here's what you can try doing. Taking the linux_banner example above, you can check where in the dumpfile it's reading from by setting the debug flag, before doing a simple read -- like this example on an ARM dumpfile: crash> set debug 8 debug: 8 crash> rd linux_banner <addr: c033ea10 count: 1 flag: 488 (KVADDR)> <readmem: c033ea10, KVADDR, "32-bit KVADDR", 4, (FOE), ff94f048> <read_kdump: addr: c033ea10 paddr: 33ea10 cnt: 4> read_netdump: addr: c033ea10 paddr: 33ea10 cnt: 4 offset: 33f088 c033ea10: 756e694c Linu crash> The linux_banner is at virtual address c033ea10 (addr). First it gets translated into physical address 33ea10 (paddr). Then that paddr is translated into the vmcore file offset of 33f088. It lseeks to vmcore file offset 33f088 and reads 4 bytes, which contain "756e694c", or the first 4 bytes of the "Linux version ..." string. Note that if I subtract the physical address from vmcore file offset I get this: crash> eval 33f088 - 33ea10 hexadecimal: 678 decimal: 1656 octal: 3170 binary: 00000000000000000000011001111000 crash> which would put physical address 0 at a vmcore file offset of 0x678, and therefore implying that that the ELF header comprises the first 0x678 bytes. And looking at the vmcore, that can be verified: $ readelf -a vmcore ELF Header: Magic: 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 Class: ELF32 Data: 2's complement, little endian Version: 1 (current) OS/ABI: UNIX - System V ABI Version: 0 Type: CORE (Core file) Machine: ARM Version: 0x1 Entry point address: 0x0 Start of program headers: 52 (bytes into file) Start of section headers: 0 (bytes into file) Flags: 0x0 Size of this header: 52 (bytes) Size of program headers: 32 (bytes) Number of program headers: 3 Size of section headers: 0 (bytes) Number of section headers: 0 Section header string table index: 0 There are no sections in this file. There are no sections to group in this file. Program Headers: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 ... Note that the "Offset" value of the first PT_LOAD segment has a file offset value of 0x678. Another thing to do is to verify that your phys_base of 0x20000000 is being properly seen. In the --minimal session, you can verify that by doing this: crash> help -m | grep phys_base Trying the above should yield some clues into the problem you're encountering. Dave

卜弋天

9:29 p.m.

Hi Dave: thank you very much for your detail answer, this really helpful. please see my inline words. thanks.> Date: Thu, 17 Jan 2013 14:17:36 -0500

...

From: anderson(a)redhat.com To: crash-utility(a)redhat.com Subject: Re: [Crash-utility] questions about crash utility

...

The fact that crash gets as far as it does at least means that the ELF header you've created was deemed acceptable as an ARM vmcore. However, the error messages re: "cpu_present_mask indicates..." and "cannot determine base kernel version" indicate that the data that was read from the vmcore was clearly not the correct data. The "cpu_present_mask" value that it read contained too many bits -- presuming that the 32-bit ARM processor is still limited to only 4 cpus. (looks like upstream that CONFIG_NR_CPUS is still 2 in the arch/arm/configs files.) But more indicative of the wrong data being read is the second "cannot determine base kernel version" message, which was generated after it read the kernel's "init_uts_ns" uts_namespace structure. After reading it, it sees that the "release" string contains non-ASCII data, whereas it should contain the kernel version: crash> p init_uts_ns init_uts_ns = $3 = { kref = { refcount = { counter = 2 } }, name = { sysname = "Linux\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", nodename = "phenom-01.lab.bos.redhat.com\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", release = "2.6.32-313.el6.x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", version = "#1 SMP Thu Sep 27 16:25:19 EDT 2012\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", machine = "x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", domainname = "(none)\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000" } } crash> So it appears that you're reading data from the wrong locations in the dumpfile. You should be able to verify that by bringing up the crash session with the --minimal flag like this: $ crash --minimal vmlinux vmcore That will bypass most of the initialization, including all readmem() calls of the vmcore. Then do this: crash> rd linux_banner 20 ffffffff818000a0: 65762078756e694c 2e33206e6f697372 Linux version 3. ffffffff818000b0: 63662e312d312e35 365f3638782e3731 5.1-1.fc17.x86_6 ffffffff818000c0: 626b636f6d282034 69756240646c6975 4 (mockbuild@bui ffffffff818000d0: 2e33322d6d76646c 6465662e32786870 ldvm-23.phx2.fed ffffffff818000e0: 656a6f727061726f 202967726f2e7463 oraproject.org) ffffffff818000f0: 7265762063636728 372e34206e6f6973 (gcc version 4.7 ffffffff81800100: 303231303220302e 6465522820373035 .0 20120507 (Red ffffffff81800110: 372e342074614820 47282029352d302e Hat 4.7.0-5) (G ffffffff81800120: 3123202920294343 75685420504d5320 CC) ) #1 SMP Thu ffffffff81800130: 3120392067754120 2033343a30353a37 Aug 9 17:50:43 crash> rd -a linux_banner ffffffff818000a0: Linux version 3.5.1-1.fc17.x86_64 (mockbuild(a)buildvm-23.phx2 ffffffff818000dc: .fedoraproject.org) (gcc version 4.7.0 20120507 (Red Hat 4.7 ffffffff81800118: .0-5) (GCC) ) #1 SMP Thu Aug 9 17:50:43 UTC 2012 crash> I'm guessing that you will not see a string starting with "Linux version" with your dumpfile as shown above. If that's the case, then it's clear that the readmem() function is ultimately reading from the wrong vmcore file offset. Here's what you can try doing. Taking the linux_banner example above, you can check where in the dumpfile it's reading from by setting the debug flag, before doing a simple read -- like this example on an ARM dumpfile: crash> set debug 8 debug: 8 crash> rd linux_banner <addr: c033ea10 count: 1 flag: 488 (KVADDR)> <readmem: c033ea10, KVADDR, "32-bit KVADDR", 4, (FOE), ff94f048> <read_kdump: addr: c033ea10 paddr: 33ea10 cnt: 4> read_netdump: addr: c033ea10 paddr: 33ea10 cnt: 4 offset: 33f088 c033ea10: 756e694c Linu crash> The linux_banner is at virtual address c033ea10 (addr). First it gets translated into physical address 33ea10 (paddr). Then that paddr is translated into the vmcore file offset of 33f088. It lseeks to vmcore file offset 33f088 and reads 4 bytes, which contain "756e694c", or the first 4 bytes of the "Linux version ..." string. Note that if I subtract the physical address from vmcore file offset I get this: crash> eval 33f088 - 33ea10 hexadecimal: 678 decimal: 1656 octal: 3170 binary: 00000000000000000000011001111000 crash> which would put physical address 0 at a vmcore file offset of 0x678, and therefore implying that that the ELF header comprises the first 0x678 bytes. And looking at the vmcore, that can be verified: yes you are right, here i get the result as below:crash> set debug 8

...

$ readelf -a vmcore ELF Header: Magic: 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 Class: ELF32 Data: 2's complement, little endian Version: 1 (current) OS/ABI: UNIX - System V ABI Version: 0 Type: CORE (Core file) Machine: ARM Version: 0x1 Entry point address: 0x0 Start of program headers: 52 (bytes into file) Start of section headers: 0 (bytes into file) Flags: 0x0 Size of this header: 52 (bytes) Size of program headers: 32 (bytes) Number of program headers: 3 Size of section headers: 0 (bytes) Number of section headers: 0 Section header string table index: 0 There are no sections in this file. There are no sections to group in this file. Program Headers: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 ... Note that the "Offset" value of the first PT_LOAD segment has a file offset value of 0x678. here i got the result as below:Program Headers:

Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align NOTE 0x000000 0x00000000 0x00000000 0x00000 0x00000 0 LOAD 0x000074 0xc0000000 0x00200000 0x2fe00000 0x2fe00000 RWE 0 so the problem is i don't understand the elf header meaning accurately. if i modify code as below, everything is ok for me: offset += sizeof(struct elf_phdr); phdr->p_offset = offset+0x00200000; phdr->p_vaddr = 0xc0000000; phdr->p_paddr = 0x00200000; phdr->p_filesz = phdr->p_memsz = MEMSIZE-0x00200000; although my modification can make crash utility work well, i want to know exactly whether i am doing the right thing. 1. our platform has the ddr address from physical address 0x0. 2. when compiling Linux kernel, our platform set in .config file: CONFIG_PHYS_OFFSET=0x00200000 3. when Kernel crash, all ddr content will be dumped, from address 0x0~768MB. but kernel data starts from 0x00200000 actually. my questions are: 1. whether my setting of ELF header is correct this time? the offset, paddr, and p_memsz ? 2. i am wondering how does crash utility translate virtual address to physical address before and after it get the kernel page table? before get kernel page table, does it calculate as : (virtual_addr - p_vaddr + p_paddr) ? after get kernel page table, does it walk through the page table and find out the real physical address accordingly? 3. my real purpose is to get the ftrace content from dump file by crash utility , but seem the command trace is not for this case, do i need to compile the extension "trace" of crash utility? is there any guide to follow?

...

Another thing to do is to verify that your phys_base of 0x20000000 is being properly seen. In the --minimal session, you can verify that by doing this: crash> help -m | grep phys_base Trying the above should yield some clues into the problem you're encountering. Dave -- Crash-utility mailing list Crash-utility(a)redhat.com https://www.redhat.com/mailman/listinfo/crash-utility

Dave Anderson

Friday, 18 January Fri, 18 Jan

9:23 a.m.

----- Original Message -----

...

Hi Dave: thank you very much for your detail answer, this really helpful. please see my inline words. thanks. > Date: Thu, 17 Jan 2013 14:17:36 -0500 > From: anderson(a)redhat.com > To: crash-utility(a)redhat.com > Subject: Re: [Crash-utility] questions about crash utility > The fact that crash gets as far as it does at least means that the > ELF header you've created was deemed acceptable as an ARM vmcore. > However, the error messages re: "cpu_present_mask indicates..." and > "cannot determine base kernel version" indicate that the data > that was read from the vmcore was clearly not the correct data. > > The "cpu_present_mask" value that it read contained too > many bits -- presuming that the 32-bit ARM processor is > still limited to only 4 cpus. (looks like upstream that > CONFIG_NR_CPUS is still 2 in the arch/arm/configs files.) > > But more indicative of the wrong data being read is the second > "cannot determine base kernel version" message, which was generated > after it read the kernel's "init_uts_ns" uts_namespace structure. > After reading it, it sees that the "release" string contains > non-ASCII data, whereas it should contain the kernel version: > > crash> p init_uts_ns > init_uts_ns = $3 = { > kref = { > refcount = { > counter = 2 > } > }, > name = { > sysname = > "Linux\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > nodename = > "phenom-01.lab.bos.redhat.com\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > release = > "2.6.32-313.el6.x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > version = "#1 SMP Thu Sep 27 16:25:19 EDT > 2012\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > machine = > "x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > domainname = > "(none)\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000" > } > } > crash> > > So it appears that you're reading data from the wrong > locations in the dumpfile. You should be able to verify > that by bringing up the crash session with the --minimal > flag like this: > > $ crash --minimal vmlinux vmcore > > That will bypass most of the initialization, including all > readmem() calls of the vmcore. Then do this: > > crash> rd linux_banner 20 > ffffffff818000a0: 65762078756e694c 2e33206e6f697372 Linux version > 3. > ffffffff818000b0: 63662e312d312e35 365f3638782e3731 5.1-1.fc17.x86_6 > ffffffff818000c0: 626b636f6d282034 69756240646c6975 4(mockbuild@bui > ffffffff818000d0: 2e33322d6d76646c 6465662e32786870 ldvm-23.phx2.fed > ffffffff818000e0: 656a6f727061726f 202967726f2e7463 oraproject.org) > ffffffff818000f0: 7265762063636728 372e34206e6f6973 (gcc version 4.7 > ffffffff81800100: 303231303220302e 6465522820373035 .0 20120507 > (Red > ffffffff81800110: 372e342074614820 47282029352d302e Hat 4.7.0-5) (G > ffffffff81800120: 3123202920294343 75685420504d5320 CC) ) #1 SMP Thu > ffffffff81800130: 3120392067754120 2033343a30353a37 Aug 9 17:50:43 > crash> rd -a linux_banner > ffffffff818000a0: Linux version 3.5.1-1.fc17.x86_64 (mockbuild(a)buildvm-23.phx2 > ffffffff818000dc: .fedoraproject.org) (gcc version 4.7.0 20120507 (Red Hat 4.7 > ffffffff81800118: .0-5) (GCC) ) #1 SMP Thu Aug 9 17:50:43 UTC 2012 > crash> > > I'm guessing that you will not see a string starting with "Linux version" > with your dumpfile as shown above. > > If that's the case, then it's clear that the readmem() function is ultimately > reading from the wrong vmcore file offset. > > Here's what you can try doing. Taking the linux_banner example above, > you can check where in the dumpfile it's reading from by setting the debug > flag, before doing a simple read -- like this example on an ARM dumpfile: > > crash> set debug 8 > debug: 8 > crash> rd linux_banner > <addr: c033ea10 count: 1 flag: 488 (KVADDR)> > <readmem: c033ea10, KVADDR, "32-bit KVADDR", 4, (FOE), ff94f048> > <read_kdump: addr: c033ea10 paddr: 33ea10 cnt: 4> > read_netdump: addr: c033ea10 paddr: 33ea10 cnt: 4 offset: 33f088 > c033ea10: 756e694c Linu > crash> > > The linux_banner is at virtual address c033ea10 (addr). First it gets translated > into physical address 33ea10 (paddr). Then that paddr is translated into the > vmcore file offset of 33f088. It lseeks to vmcore file offset 33f088 and > reads 4 bytes, which contain "756e694c", or the first 4 bytes of the > "Linux version ..." string. > > Note that if I subtract the physical address from vmcore file offset > I get this: > > crash> eval 33f088 - 33ea10 > hexadecimal: 678 > decimal: 1656 > octal: 3170 > binary: 00000000000000000000011001111000 > crash> > > which would put physical address 0 at a vmcore file offset of 0x678, and > therefore implying that that the ELF header comprises the first 0x678 bytes. > And looking at the vmcore, that can be verified: > yes you are right, here i get the result as below: crash> set debug 8 debug: 8 crash> rd linux_banner <addr: c065a071 count: 1 flag: 488 (KVADDR)> <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 c065a071: 03e59130 0... the virtual address is 0xc065a071 , and the physical address is 0x85a071 , and the offset is 0x65a0e5. my elf header is 116 bytes long, 0x65a0e5 - 116=0x65A071, which has a gap 0x00200000 with the physical address 0x85a071. > $ readelf -a vmcore > ELF Header: > Magic: 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 > Class: ELF32 > Data: 2's complement, little endian > Version: 1 (current) > OS/ABI: UNIX - System V > ABI Version: 0 > Type: CORE (Core file) > Machine: ARM > Version: 0x1 > Entry point address: 0x0 > Start of program headers: 52 (bytes into file) > Start of section headers: 0 (bytes into file) > Flags: 0x0 > Size of this header: 52 (bytes) > Size of program headers: 32 (bytes) > Number of program headers: 3 > Size of section headers: 0 (bytes) > Number of section headers: 0 > Section header string table index: 0 > > There are no sections in this file. > > There are no sections to group in this file. > > Program Headers: > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 > LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 > LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 > ... > > Note that the "Offset" value of the first PT_LOAD segment has a file offset > value of 0x678. > here i got the result as below: Program Headers: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align NOTE 0x000000 0x00000000 0x00000000 0x00000 0x00000 0 LOAD 0x000074 0xc0000000 0x00200000 0x2fe00000 0x2fe00000 RWE 0 so the problem is i don't understand the elf header meaning accurately. if i modify code as below, everything is ok for me: offset += sizeof(struct elf_phdr); phdr->p_offset = offset+0x00200000; phdr->p_vaddr = 0xc0000000; phdr->p_paddr = 0x00200000; phdr->p_filesz = phdr->p_memsz = = MEMSIZE-0x00200000; although my modification can make crash utility work well, i want to know exactly whether i am doing the right thing. 1. our platform has the ddr address from physical address 0x0. 2. when compiling Linux kernel, our platform set in .config file: CONFIG_PHYS_OFFSET=0x00200000 3. when Kernel crash, all ddr content will be dumped, from address 0x0~768MB. but kernel data starts from 0x00200000 actually. my questions are: 1. whether my setting of ELF header is correct this time? the offset, paddr, and p_memsz?

I'm not really sure. Even though you've got it to work OK, I don't understand your new phdr->p_offset and phdr->p_filesz/phdr->p_memsz settings. The phdr->p_offset value typically points to the beginning of the physical memory segment, which in your case, would be at physical address 0x0 at file offset 0x74. And the phdr->p_filesz/phdr->p_memsz values are typically equal to the full size of the physical memory segment (MEMSIZE). I only have one ELF ARM dumpfile sample, but it does not have any physical offset: crash> vtop c0000000 VIRTUAL PHYSICAL c0000000 0 PAGE DIRECTORY: c0004000 PGD: c0007000 => 1140e PMD: c0007000 => 1140e PAGE: 0 (1MB) PAGE PHYSICAL MAPPING INDEX CNT FLAGS c042d000 0 0 0 0 80000 crash> Does "vtop c0000000" work as expected on your vmcore? Also, can you read the last physical page of memory? For example, on my ARM dump, I can check that by doing this: crash> kmem -p | tail -5 c04dcf60 57fb000 0 0 1 400 c04dcf80 57fc000 0 0 1 400 c04dcfa0 57fd000 0 0 1 400 c04dcfc0 57fe000 0 0 1 400 c04dcfe0 57ff000 0 0 1 400 crash> rd -p 57ff000 57ff000: ef9f0000 .... crash> Also, can you confirm that your kernel's symbol list starts at c0000000, i.e., something like this: crash> sym -l c0004000 (A) swapper_pg_dir c0008000 (t) .init c0008000 (T) __init_begin c0008000 (T) _sinittext c0008000 (T) _stext c0008000 (T) stext c0008040 (t) __create_page_tables c00080e4 (t) __enable_mmu_loc c00080f0 (t) __error_a c00080f4 (t) __lookup_machine_type c0008128 (t) __lookup_machine_type_data ... I just want to make sure that the kernel symbols actually start at c000000, and not c2000000.

...

2. i am wondering how does crash utility translate virtual address to physical address before and after it get the kernel page table? before get kernel page table, does it calculate as : (virtual_addr - p_vaddr + p_paddr) ? after get kernel page table, does it walk through the page table and find out the real physical address accordingly?

For kernel unity-mapped kernel virtual addresses, it's not necessary to walk the page tables. It simply does this: #define VTOP(X) \ ((unsigned long)(X)-(machdep->kvbase)+(machdep->machspec->phys_base)) You can check your machdep->kvbase and machdep->machspec->phys_base values by entering "help -m", for example: crash> help -m | grep -e kvbase -e phys_base kvbase: c0000000 phys_base: 0 crash> Certainly vmalloc (and user-space) virtual addresses require a page table walkthough, but the arm_kvtop() function does this: static int arm_kvtop(struct task_context *tc, ulong kvaddr, physaddr_t *paddr, int verbose) { if (!IS_KVADDR(kvaddr)) return FALSE; if (!vt->vmalloc_start) { *paddr = VTOP(kvaddr); return TRUE; } if (!IS_VMALLOC_ADDR(kvaddr)) { *paddr = VTOP(kvaddr); <=== unity-mapped kernel virtual addresses if (!verbose) return TRUE; } return arm_vtop(kvaddr, (ulong *)vt->kernel_pgd[0], paddr, verbose); } and where vmalloc addresses fall through and arm_vtop() is called to walk the page tables. However, you can translate unity-mapped addresses using the kernel page tables with the "vtop" command, as shown in the "vtop c000000" example above.

...

3. my real purpose is to get the ftrace content from dump file by crash utility , but seem the command trace is not for this case, do i need to compile the extension "trace" of crash utility? is there any guide to follow?

That's correct. You can do this: $ wget http://people.redhat.com/anderson/crash-6.1.2.tar.gz ... $ tar xvzmf crash-6.1.2.tar.gz ... $ cd crash-6.1.2 $ make ... $ make extensions ... $ ./crash vmlinux vmcore ... crash> extend trace.so ./extensions/trace.so: shared object loaded crash> help trace ... Dave

卜弋天

11:04 a.m.

...

Date: Fri, 18 Jan 2013 10:23:00 -0500 From: anderson(a)redhat.com To: crash-utility(a)redhat.com Subject: Re: [Crash-utility] questions about crash utility ----- Original Message ----- > > > > Hi Dave: > > thank you very much for your detail answer, this really helpful. > please see my inline words. thanks. > > > > Date: Thu, 17 Jan 2013 14:17:36 -0500 > > From: anderson(a)redhat.com > > To: crash-utility(a)redhat.com > > Subject: Re: [Crash-utility] questions about crash utility > > > The fact that crash gets as far as it does at least means that the > > ELF header you've created was deemed acceptable as an ARM vmcore. > > However, the error messages re: "cpu_present_mask indicates..." and > > "cannot determine base kernel version" indicate that the data > > that was read from the vmcore was clearly not the correct data. > > > > The "cpu_present_mask" value that it read contained too > > many bits -- presuming that the 32-bit ARM processor is > > still limited to only 4 cpus. (looks like upstream that > > CONFIG_NR_CPUS is still 2 in the arch/arm/configs files.) > > > > But more indicative of the wrong data being read is the second > > "cannot determine base kernel version" message, which was generated > > after it read the kernel's "init_uts_ns" uts_namespace structure. > > After reading it, it sees that the "release" string contains > > non-ASCII data, whereas it should contain the kernel version: > > > > crash> p init_uts_ns > > init_uts_ns = $3 = { > > kref = { > > refcount = { > > counter = 2 > > } > > }, > > name = { > > sysname = > > "Linux\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > nodename = > > "phenom-01.lab.bos.redhat.com\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > release = > > "2.6.32-313.el6.x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > version = "#1 SMP Thu Sep 27 16:25:19 EDT > > 2012\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > machine = > > "x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > domainname = > > "(none)\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000" > > } > > } > > crash> > > > > So it appears that you're reading data from the wrong > > locations in the dumpfile. You should be able to verify > > that by bringing up the crash session with the --minimal > > flag like this: > > > > $ crash --minimal vmlinux vmcore > > > > That will bypass most of the initialization, including all > > readmem() calls of the vmcore. Then do this: > > > > crash> rd linux_banner 20 > > ffffffff818000a0: 65762078756e694c 2e33206e6f697372 Linux version > > 3. > > ffffffff818000b0: 63662e312d312e35 365f3638782e3731 5.1-1.fc17.x86_6 > > ffffffff818000c0: 626b636f6d282034 69756240646c6975 4(mockbuild@bui > > ffffffff818000d0: 2e33322d6d76646c 6465662e32786870 ldvm-23.phx2.fed > > ffffffff818000e0: 656a6f727061726f 202967726f2e7463 oraproject.org) > > ffffffff818000f0: 7265762063636728 372e34206e6f6973 (gcc version 4.7 > > ffffffff81800100: 303231303220302e 6465522820373035 .0 20120507 > > (Red > > ffffffff81800110: 372e342074614820 47282029352d302e Hat 4.7.0-5) (G > > ffffffff81800120: 3123202920294343 75685420504d5320 CC) ) #1 SMP Thu > > ffffffff81800130: 3120392067754120 2033343a30353a37 Aug 9 17:50:43 > > crash> rd -a linux_banner > > ffffffff818000a0: Linux version 3.5.1-1.fc17.x86_64 (mockbuild(a)buildvm-23.phx2 > > ffffffff818000dc: .fedoraproject.org) (gcc version 4.7.0 20120507 (Red Hat 4.7 > > ffffffff81800118: .0-5) (GCC) ) #1 SMP Thu Aug 9 17:50:43 UTC 2012 > > crash> > > > > I'm guessing that you will not see a string starting with "Linux version" > > with your dumpfile as shown above. > > > > If that's the case, then it's clear that the readmem() function is ultimately > > reading from the wrong vmcore file offset. > > > > Here's what you can try doing. Taking the linux_banner example above, > > you can check where in the dumpfile it's reading from by setting the debug > > flag, before doing a simple read -- like this example on an ARM dumpfile: > > > > crash> set debug 8 > > debug: 8 > > crash> rd linux_banner > > <addr: c033ea10 count: 1 flag: 488 (KVADDR)> > > <readmem: c033ea10, KVADDR, "32-bit KVADDR", 4, (FOE), ff94f048> > > <read_kdump: addr: c033ea10 paddr: 33ea10 cnt: 4> > > read_netdump: addr: c033ea10 paddr: 33ea10 cnt: 4 offset: 33f088 > > c033ea10: 756e694c Linu > > crash> > > > > The linux_banner is at virtual address c033ea10 (addr). First it gets translated > > into physical address 33ea10 (paddr). Then that paddr is translated into the > > vmcore file offset of 33f088. It lseeks to vmcore file offset 33f088 and > > reads 4 bytes, which contain "756e694c", or the first 4 bytes of the > > "Linux version ..." string. > > > > Note that if I subtract the physical address from vmcore file offset > > I get this: > > > > crash> eval 33f088 - 33ea10 > > hexadecimal: 678 > > decimal: 1656 > > octal: 3170 > > binary: 00000000000000000000011001111000 > > crash> > > > > which would put physical address 0 at a vmcore file offset of 0x678, and > > therefore implying that that the ELF header comprises the first 0x678 bytes. > > And looking at the vmcore, that can be verified: > > > > yes you are right, here i get the result as below: > crash> set debug 8 > debug: 8 > crash> rd linux_banner > <addr: c065a071 count: 1 flag: 488 (KVADDR)> > <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> > <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> > read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 > c065a071: 03e59130 0... > > the virtual address is 0xc065a071 , and the physical address is > 0x85a071 , and the offset is 0x65a0e5. > my elf header is 116 bytes long, 0x65a0e5 - 116=0x65A071, which has a > gap 0x00200000 with the physical address 0x85a071. > > > > $ readelf -a vmcore > > ELF Header: > > Magic: 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 > > Class: ELF32 > > Data: 2's complement, little endian > > Version: 1 (current) > > OS/ABI: UNIX - System V > > ABI Version: 0 > > Type: CORE (Core file) > > Machine: ARM > > Version: 0x1 > > Entry point address: 0x0 > > Start of program headers: 52 (bytes into file) > > Start of section headers: 0 (bytes into file) > > Flags: 0x0 > > Size of this header: 52 (bytes) > > Size of program headers: 32 (bytes) > > Number of program headers: 3 > > Size of section headers: 0 (bytes) > > Number of section headers: 0 > > Section header string table index: 0 > > > > There are no sections in this file. > > > > There are no sections to group in this file. > > > > Program Headers: > > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > > NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 > > LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 > > LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 > > ... > > > > Note that the "Offset" value of the first PT_LOAD segment has a file offset > > value of 0x678. > > > > here i got the result as below: > Program Headers: > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > NOTE 0x000000 0x00000000 0x00000000 0x00000 0x00000 0 > LOAD 0x000074 0xc0000000 0x00200000 0x2fe00000 0x2fe00000 RWE 0 > > so the problem is i don't understand the elf header meaning > accurately. if i modify code as below, everything is ok for me: > > offset += sizeof(struct elf_phdr); > phdr->p_offset = offset+0x00200000; > phdr->p_vaddr = 0xc0000000; > phdr->p_paddr = 0x00200000; > phdr->p_filesz = phdr->p_memsz = = MEMSIZE-0x00200000; > > > although my modification can make crash utility work well, i want to > know exactly whether i am doing the right thing. > 1. our platform has the ddr address from physical address 0x0. > 2. when compiling Linux kernel, our platform set in .config file: > CONFIG_PHYS_OFFSET=0x00200000 > 3. when Kernel crash, all ddr content will be dumped, from address > 0x0~768MB. but kernel data starts from 0x00200000 actually. > > my questions are: > 1. whether my setting of ELF header is correct this time? the offset, > paddr, and p_memsz? I'm not really sure. Even though you've got it to work OK, I don't understand your new phdr->p_offset and phdr->p_filesz/phdr->p_memsz settings. The phdr->p_offset value typically points to the beginning of the physical memory segment, which in your case, would be at physical address 0x0 at file offset 0x74. And the phdr->p_filesz/phdr->p_memsz values are typically equal to the full size of the physical memory segment (MEMSIZE). if i set p_offset to 0, the file offset seems not correct. for example, when i try to read linux_banner, i got below result: crash> set debug 8

debug: 8 crash> rd linux_banner <addr: c065a071 count: 1 flag: 488 (KVADDR)> <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 c065a071: 03e59130 0... virtual address is c065a071, physical address is 85a071, this is ok.but the tool said the file offset is 65a0e5, which is not correct. because my dump binarycontains ddr content from 0x0, the data of physical address 85a071 should be at dump fileoffset 85a071+74= 85a0e5, rather than 65a0e5. so i guess the elf header should be modified to set phdr->p_offset =header size + 0x00200000.i don't know how to tell crash utility it should add value 0x00200000 when read dump file.

...

I only have one ELF ARM dumpfile sample, but it does not have any physical offset: crash> vtop c0000000 VIRTUAL PHYSICAL c0000000 0 PAGE DIRECTORY: c0004000 PGD: c0007000 => 1140e PMD: c0007000 => 1140e PAGE: 0 (1MB) PAGE PHYSICAL MAPPING INDEX CNT FLAGS c042d000 0 0 0 0 80000 crash> Does "vtop c0000000" work as expected on your vmcore? yes i think the vtop command works well on my side: crash> vtop c0000000

VIRTUAL PHYSICAL c0000000 200000PAGE DIRECTORY: c0004000 PGD: c0007000 => 21140e PMD: c0007000 => 21140e PAGE: 200000 (1MB) PAGE PHYSICAL MAPPING INDEX CNT FLAGS c1370800 200000 e5d43061 42 1 80068

...

Also, can you read the last physical page of memory? For example, on my ARM dump, I can check that by doing this: crash> kmem -p | tail -5 c04dcf60 57fb000 0 0 1 400 c04dcf80 57fc000 0 0 1 400 c04dcfa0 57fd000 0 0 1 400 c04dcfc0 57fe000 0 0 1 400 c04dcfe0 57ff000 0 0 1 400 crash> rd -p 57ff000 57ff000: ef9f0000 .... crash> result is as below: crash> kmem -p |tail -5

c19b934c 2ccfb000 0 0 1 400 c19b9370 2ccfc000 0 0 1 400 c19b9394 2ccfd000 0 0 1 400 c19b93b8 2ccfe000 0 0 1 400 c19b93dc 2ccff000 0 0 1 400 crash> rd -p 2ccff000 2ccff000: fffdffff ....

...

Also, can you confirm that your kernel's symbol list starts at c0000000, i.e., something like this: crash> sym -l c0004000 (A) swapper_pg_dir c0008000 (t) .init c0008000 (T) __init_begin c0008000 (T) _sinittext c0008000 (T) _stext c0008000 (T) stext c0008040 (t) __create_page_tables c00080e4 (t) __enable_mmu_loc c00080f0 (t) __error_a c00080f4 (t) __lookup_machine_type c0008128 (t) __lookup_machine_type_data ... I just want to make sure that the kernel symbols actually start at c000000, and not c2000000. yes, the symbols actually start from c0000000: crash> sym -l

c0004000 (A) swapper_pg_dir c0005fb8 (A) __crc_scsi_host_get c0008000 (t) .head.text c0008000 (T) _text c0008000 (T) stext c0008050 (t) __create_page_tables c0008104 (t) __turn_mmu_on_loc c0008110 (T) secondary_startup

...

> 2. i am wondering how does crash utility translate virtual address to > physical address before and after it get the kernel page table? > before get kernel page table, does it calculate as : (virtual_addr - > p_vaddr + p_paddr) ? after get kernel page table, does it walk > through the page table and find out the real physical address > accordingly? For kernel unity-mapped kernel virtual addresses, it's not necessary to walk the page tables. It simply does this: #define VTOP(X) \ ((unsigned long)(X)-(machdep->kvbase)+(machdep->machspec->phys_base)) You can check your machdep->kvbase and machdep->machspec->phys_base values by entering "help -m", for example: crash> help -m | grep -e kvbase -e phys_base kvbase: c0000000 phys_base: 0 crash> my result is as below, should be ok: crash> help -m | grep -e kvbase -e phys_base

kvbase: c0000000 phys_base: 200000

...

Certainly vmalloc (and user-space) virtual addresses require a page table walkthough, but the arm_kvtop() function does this: static int arm_kvtop(struct task_context *tc, ulong kvaddr, physaddr_t *paddr, int verbose) { if (!IS_KVADDR(kvaddr)) return FALSE; if (!vt->vmalloc_start) { *paddr = VTOP(kvaddr); return TRUE; } if (!IS_VMALLOC_ADDR(kvaddr)) { *paddr = VTOP(kvaddr); <=== unity-mapped kernel virtual addresses if (!verbose) return TRUE; } return arm_vtop(kvaddr, (ulong *)vt->kernel_pgd[0], paddr, verbose); } and where vmalloc addresses fall through and arm_vtop() is called to walk the page tables. However, you can translate unity-mapped addresses using the kernel page tables with the "vtop" command, as shown in the "vtop c000000" example above. > 3. my real purpose is to get the ftrace content from dump file by > crash utility , but seem the command trace is not for this case, do > i need to compile the extension "trace" of crash utility? is there > any guide to follow? That's correct. You can do this: $ wget http://people.redhat.com/anderson/crash-6.1.2.tar.gz ... $ tar xvzmf crash-6.1.2.tar.gz ... $ cd crash-6.1.2 $ make ... $ make extensions ... $ ./crash vmlinux vmcore ... crash> extend trace.so ./extensions/trace.so: shared object loaded crash> help trace ... i have made the trace extension work, however, trace show need trace-cmd, but in my ubuntu PC, run "sudo apt-get install trace-cmd", i get below error:E: Couldn't find package trace-cmd

by Google, i found that there is a project git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/trace-cmd.git. maybe i can only compile this tool and use it ?

...

Dave -- Crash-utility mailing list Crash-utility(a)redhat.com https://www.redhat.com/mailman/listinfo/crash-utility

Dave Anderson

11:41 a.m.

----- Original Message -----

...

> Date: Fri, 18 Jan 2013 10:23:00 -0500 > From: anderson(a)redhat.com > To: crash-utility(a)redhat.com > Subject: Re: [Crash-utility] questions about crash utility > > > > ----- Original Message ----- > > > > > > > > Hi Dave: > > > > thank you very much for your detail answer, this really helpful. > > please see my inline words. thanks. > > > > > > > Date: Thu, 17 Jan 2013 14:17:36 -0500 > > > From: anderson(a)redhat.com > > > To: crash-utility(a)redhat.com > > > Subject: Re: [Crash-utility] questions about crash utility > > > > > The fact that crash gets as far as it does at least means that > > > the > > > ELF header you've created was deemed acceptable as an ARM > > > vmcore. > > > However, the error messages re: "cpu_present_mask indicates..." > > > and > > > "cannot determine base kernel version" indicate that the data > > > that was read from the vmcore was clearly not the correct data. > > > > > > The "cpu_present_mask" value that it read contained too > > > many bits -- presuming that the 32-bit ARM processor is > > > still limited to only 4 cpus. (looks like upstream that > > > CONFIG_NR_CPUS is still 2 in the arch/arm/configs files.) > > > > > > But more indicative of the wrong data being read is the second > > > "cannot determine base kernel version" message, which was > > > generated > > > after it read the kernel's "init_uts_ns" uts_namespace > > > structure. > > > After reading it, it sees that the "release" string contains > > > non-ASCII data, whereas it should contain the kernel version: > > > > > > crash> p init_uts_ns > > > init_uts_ns = $3 = { > > > kref = { > > > refcount = { > > > counter = 2 > > > } > > > }, > > > name = { > > > sysname = > > > "Linux\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > nodename = > > > "phenom-01.lab.bos.redhat.com\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > release = > > > "2.6.32-313.el6.x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > version = "#1 SMP Thu Sep 27 16:25:19 EDT > > > 2012\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > machine = > > > "x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > domainname = > > > "(none)\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000" > > > } > > > } > > > crash> > > > > > > So it appears that you're reading data from the wrong > > > locations in the dumpfile. You should be able to verify > > > that by bringing up the crash session with the --minimal > > > flag like this: > > > > > > $ crash --minimal vmlinux vmcore > > > > > > That will bypass most of the initialization, including all > > > readmem() calls of the vmcore. Then do this: > > > > > > crash> rd linux_banner 20 > > > ffffffff818000a0: 65762078756e694c 2e33206e6f697372 Linux > > > version > > > 3. > > > ffffffff818000b0: 63662e312d312e35 365f3638782e3731 > > > 5.1-1.fc17.x86_6 > > > ffffffff818000c0: 626b636f6d282034 69756240646c6975 > > > 4(mockbuild@bui > > > ffffffff818000d0: 2e33322d6d76646c 6465662e32786870 > > > ldvm-23.phx2.fed > > > ffffffff818000e0: 656a6f727061726f 202967726f2e7463 > > > oraproject.org) > > > ffffffff818000f0: 7265762063636728 372e34206e6f6973 (gcc > > > version 4.7 > > > ffffffff81800100: 303231303220302e 6465522820373035 .0 20120507 > > > (Red > > > ffffffff81800110: 372e342074614820 47282029352d302e Hat > > > 4.7.0-5) (G > > > ffffffff81800120: 3123202920294343 75685420504d5320 CC) ) #1 > > > SMP Thu > > > ffffffff81800130: 3120392067754120 2033343a30353a37 Aug 9 > > > 17:50:43 > > > crash> rd -a linux_banner > > > ffffffff818000a0: Linux version 3.5.1-1.fc17.x86_64 > > > (mockbuild(a)buildvm-23.phx2 > > > ffffffff818000dc: .fedoraproject.org) (gcc version 4.7.0 > > > 20120507 (Red Hat 4.7 > > > ffffffff81800118: .0-5) (GCC) ) #1 SMP Thu Aug 9 17:50:43 UTC > > > 2012 > > > crash> > > > > > > I'm guessing that you will not see a string starting with > > > "Linux version" > > > with your dumpfile as shown above. > > > > > > If that's the case, then it's clear that the readmem() function > > > is ultimately > > > reading from the wrong vmcore file offset. > > > > > > Here's what you can try doing. Taking the linux_banner example > > > above, > > > you can check where in the dumpfile it's reading from by > > > setting the debug > > > flag, before doing a simple read -- like this example on an ARM > > > dumpfile: > > > > > > crash> set debug 8 > > > debug: 8 > > > crash> rd linux_banner > > > <addr: c033ea10 count: 1 flag: 488 (KVADDR)> > > > <readmem: c033ea10, KVADDR, "32-bit KVADDR", 4, (FOE), > > > ff94f048> > > > <read_kdump: addr: c033ea10 paddr: 33ea10 cnt: 4> > > > read_netdump: addr: c033ea10 paddr: 33ea10 cnt: 4 offset: > > > 33f088 > > > c033ea10: 756e694c Linu > > > crash> > > > > > > The linux_banner is at virtual address c033ea10 (addr). First > > > it gets translated > > > into physical address 33ea10 (paddr). Then that paddr is > > > translated into the > > > vmcore file offset of 33f088. It lseeks to vmcore file offset > > > 33f088 and > > > reads 4 bytes, which contain "756e694c", or the first 4 bytes > > > of the > > > "Linux version ..." string. > > > > > > Note that if I subtract the physical address from vmcore file > > > offset > > > I get this: > > > > > > crash> eval 33f088 - 33ea10 > > > hexadecimal: 678 > > > decimal: 1656 > > > octal: 3170 > > > binary: 00000000000000000000011001111000 > > > crash> > > > > > > which would put physical address 0 at a vmcore file offset of > > > 0x678, and > > > therefore implying that that the ELF header comprises the first > > > 0x678 bytes. > > > And looking at the vmcore, that can be verified: > > > > > > > yes you are right, here i get the result as below: > > crash> set debug 8 > > debug: 8 > > crash> rd linux_banner > > <addr: c065a071 count: 1 flag: 488 (KVADDR)> > > <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> > > <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> > > read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 > > c065a071: 03e59130 0... > > > > the virtual address is 0xc065a071 , and the physical address is > > 0x85a071 , and the offset is 0x65a0e5. > > my elf header is 116 bytes long, 0x65a0e5 - 116=0x65A071, which > > has a > > gap 0x00200000 with the physical address 0x85a071. > > > > > > > $ readelf -a vmcore > > > ELF Header: > > > Magic: 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 > > > Class: ELF32 > > > Data: 2's complement, little endian > > > Version: 1 (current) > > > OS/ABI: UNIX - System V > > > ABI Version: 0 > > > Type: CORE (Core file) > > > Machine: ARM > > > Version: 0x1 > > > Entry point address: 0x0 > > > Start of program headers: 52 (bytes into file) > > > Start of section headers: 0 (bytes into file) > > > Flags: 0x0 > > > Size of this header: 52 (bytes) > > > Size of program headers: 32 (bytes) > > > Number of program headers: 3 > > > Size of section headers: 0 (bytes) > > > Number of section headers: 0 > > > Section header string table index: 0 > > > > > > There are no sections in this file. > > > > > > There are no sections to group in this file. > > > > > > Program Headers: > > > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > > > NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 > > > LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 > > > LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 > > > ... > > > > > > Note that the "Offset" value of the first PT_LOAD segment has a > > > file offset > > > value of 0x678. > > > > > > > here i got the result as below: > > Program Headers: > > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > > NOTE 0x000000 0x00000000 0x00000000 0x00000 0x00000 0 > > LOAD 0x000074 0xc0000000 0x00200000 0x2fe00000 0x2fe00000 RWE 0 > > > > so the problem is i don't understand the elf header meaning > > accurately. if i modify code as below, everything is ok for me: > > > > offset += sizeof(struct elf_phdr); > > phdr->p_offset = offset+0x00200000; > > phdr->p_vaddr = 0xc0000000; > > phdr->p_paddr = 0x00200000; > > phdr->p_filesz = phdr->p_memsz = = MEMSIZE-0x00200000; > > > > > > although my modification can make crash utility work well, i want > > to > > know exactly whether i am doing the right thing. > > 1. our platform has the ddr address from physical address 0x0. > > 2. when compiling Linux kernel, our platform set in .config file: > > CONFIG_PHYS_OFFSET=0x00200000 > > 3. when Kernel crash, all ddr content will be dumped, from > > address > > 0x0~768MB. but kernel data starts from 0x00200000 actually. > > > > my questions are: > > 1. whether my setting of ELF header is correct this time? the > > offset, > > paddr, and p_memsz? > > I'm not really sure. Even though you've got it to work OK, I don't > understand your new phdr->p_offset and phdr->p_filesz/phdr->p_memsz > settings. The phdr->p_offset value typically points to the beginning > of the physical memory segment, which in your case, would be at physical > address 0x0 at file offset 0x74. And the phdr->p_filesz/phdr->p_memsz > values are typically equal to the full size of the physical memory > segment (MEMSIZE). > if i set p_offset to 0, the file offset seems not correct. for example, when i try to read linux_banner, i got below result: crash> set debug 8 debug: 8 crash> rd linux_banner <addr: c065a071 count: 1 flag: 488 (KVADDR)> <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 c065a071: 03e59130 0...

No, the phdr->p_offset should *not* be 0 -- it is a file pointer value that should point to the beginning of the physical memory dump in the vmcore. So it should be equal to the header size, or 0x74 in your case. I think maybe your vmcore is being recognized as a NETDUMP_ELF32 instead of a KDUMP_ELF32? What does "help -n" show on your system? On my ELF ARM vmcore, it starts like this: crash> help -n vmcore_data: flags: a0 (KDUMP_LOCAL|KDUMP_ELF32) ndfd: 6 ofp: a7599e8 header_size: 1656 num_pt_load_segments: 2 pt_load_segment[0]: file_offset: 678 phys_start: 0 phys_end: 5600000 zero_fill: 0 pt_load_segment[1]: file_offset: 5600678 phys_start: 5700000 phys_end: 5800000 zero_fill: 0 ... where the pt_load_segment's data shown above come from its 2 PT_LOAD segments: $ readelf -a vmcore ... Program Headers: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 ... But if your vmcore is being recognized as a NETUMP_ELF32, it will ignore the phdr->p_offset value, and simply do this in read_netdump(): read_netdump(int fd, void *bufptr, int cnt, ulong addr, physaddr_t paddr) { off_t offset; struct pt_load_segment *pls; int i; offset = 0; /* * The Elf32_Phdr has 32-bit fields for p_paddr, p_filesz and * p_memsz, so for now, multiple PT_LOAD segment support is * restricted to 64-bit machines for netdump/diskdump vmcores. * However, kexec/kdump has introduced the optional use of a * 64-bit ELF header for 32-bit processors. */ switch (DUMPFILE_FORMAT(nd->flags)) { case NETDUMP_ELF32: offset = (off_t)paddr + (off_t)nd->header_size; break; But it should be a KDUMP_ELF32 format, which uses the pt_load_segment data: case NETDUMP_ELF64: case KDUMP_ELF32: case KDUMP_ELF64: if (nd->num_pt_load_segments == 1) { offset = (off_t)paddr + (off_t)nd->header_size - (off_t)nd->pt_load_segments[0].phys_start; break; } for (i = offset = 0; i < nd->num_pt_load_segments; i++) { pls = &nd->pt_load_segments[i]; if ((paddr >= pls->phys_start) && (paddr < pls->phys_end)) { offset = (off_t)(paddr - pls->phys_start) + pls->file_offset; break; } if (pls->zero_fill && (paddr >= pls->phys_end) && (paddr < pls->zero_fill)) { memset(bufptr, 0, cnt); if (CRASHDEBUG(8)) fprintf(fp, "read_netdump: zero-fill: " "addr: %lx paddr: %llx cnt: %d\n", addr, (ulonglong)paddr, cnt); return cnt; } } If your vmcore is being seen as a NETDUMP_ELF32, then perhaps that's why your strange ELF header settings are "working". But it should be created such that it's recognized as a KDUMP_ELF32. If your vmcore is being seen as a KDUMP_ELF32, then it's not entirely clear to me why your ELF header settings work OK. But I suppose if everything is working OK, just leave it as is, and go about your business...

...

virtual address is c065a071, physical address is 85a071, this is ok. but the tool said the file offset is 65a0e5, which is not correct. because my dump binary contains ddr content from 0x0, the data of physical address 85a071 should be at dump file offset 85a071+74= 85a0e5, rather than 65a0e5. so i guess the elf header should be modified to set phdr->p_offset =header size + 0x00200000. i don't know how to tell crash utility it should add value 0x00200000 when read dump file. > I only have one ELF ARM dumpfile sample, but it does not have any > physical offset: > > crash> vtop c0000000 > VIRTUAL PHYSICAL > c0000000 0 > > PAGE DIRECTORY: c0004000 > PGD: c0007000 => 1140e > PMD: c0007000 => 1140e > PAGE: 0 (1MB) > > > PAGE PHYSICAL MAPPING INDEX CNT FLAGS > c042d000 0 0 0 0 80000 > crash> > > Does "vtop c0000000" work as expected on your vmcore? yes i think the vtop command works well on my side: crash> vtop c0000000 VIRTUAL PHYSICAL c0000000 200000 PAGE DIRECTORY: c0004000 PGD: c0007000 => 21140e PMD: c0007000 => 21140e PAGE: 200000 (1MB) PAGE PHYSICAL MAPPING INDEX CNT FLAGS c1370800 200000 e5d43061 42 1 80068 > > Also, can you read the last physical page of memory? For example, > on > my ARM dump, I can check that by doing this: > > crash> kmem -p | tail -5 > c04dcf60 57fb000 0 0 1 400 > c04dcf80 57fc000 0 0 1 400 > c04dcfa0 57fd000 0 0 1 400 > c04dcfc0 57fe000 0 0 1 400 > c04dcfe0 57ff000 0 0 1 400 > crash> rd -p 57ff000 > 57ff000: ef9f0000 .... > crash> > result is as below: crash> kmem -p |tail -5 c19b934c 2ccfb000 0 0 1 400 c19b9370 2ccfc000 0 0 1 400 c19b9394 2ccfd000 0 0 1 400 c19b93b8 2ccfe000 0 0 1 400 c19b93dc 2ccff000 0 0 1 400 crash> rd -p 2ccff000 2ccff000: fffdffff .... > Also, can you confirm that your kernel's symbol list starts > at c0000000, i.e., something like this: > > crash> sym -l > c0004000 (A) swapper_pg_dir > c0008000 (t) .init > c0008000 (T) __init_begin > c0008000 (T) _sinittext > c0008000 (T) _stext > c0008000 (T) stext > c0008040 (t) __create_page_tables > c00080e4 (t) __enable_mmu_loc > c00080f0 (t) __error_a > c00080f4 (t) __lookup_machine_type > c0008128 (t) __lookup_machine_type_data > ... > > I just want to make sure that the kernel symbols actually start > at c000000, and not c2000000. > yes, the symbols actually start from c0000000: crash> sym -l c0004000 (A) swapper_pg_dir c0005fb8 (A) __crc_scsi_host_get c0008000 (t) .head.text c0008000 (T) _text c0008000 (T) stext c0008050 (t) __create_page_tables c0008104 (t) __turn_mmu_on_loc c0008110 (T) secondary_startup > > 2. i am wondering how does crash utility translate virtual > > address to > > physical address before and after it get the kernel page table? > > before get kernel page table, does it calculate as : > > (virtual_addr - > > p_vaddr + p_paddr) ? after get kernel page table, does it walk > > through the page table and find out the real physical address > > accordingly? > > For kernel unity-mapped kernel virtual addresses, it's not > necessary > to walk the page tables. It simply does this: > > #define VTOP(X) \ > ((unsigned > long)(X)-(machdep->kvbase)+(machdep->machspec->phys_base)) > > You can check your machdep->kvbase and machdep->machspec->phys_base > values by entering "help -m", for example: > > crash> help -m | grep -e kvbase -e phys_base > kvbase: c0000000 > phys_base: 0 > crash> > my result is as below, should be ok: crash> help -m | grep -e kvbase -e phys_base kvbase: c0000000 phys_base: 200000 > Certainly vmalloc (and user-space) virtual addresses require a page > table walkthough, but the arm_kvtop() function does this: > > static int > arm_kvtop(struct task_context *tc, ulong kvaddr, physaddr_t *paddr, > int verbose) > { > if (!IS_KVADDR(kvaddr)) > return FALSE; > > if (!vt->vmalloc_start) { > *paddr = VTOP(kvaddr); > return TRUE; > } > > if (!IS_VMALLOC_ADDR(kvaddr)) { > *paddr = VTOP(kvaddr); <=== unity-mapped kernel virtual addresses > if (!verbose) > return TRUE; > } > > return arm_vtop(kvaddr, (ulong *)vt->kernel_pgd[0], paddr, > verbose); > } > > and where vmalloc addresses fall through and arm_vtop() is called > to walk > the page tables. > > However, you can translate unity-mapped addresses using the kernel > page tables > with the "vtop" command, as shown in the "vtop c000000" example > above. > > > 3. my real purpose is to get the ftrace content from dump file by > > crash utility , but seem the command trace is not for this case, > > do > > i need to compile the extension "trace" of crash utility? is > > there > > any guide to follow? > > That's correct. You can do this: > > $ wget http://people.redhat.com/anderson/crash-6.1.2.tar.gz > ... > $ tar xvzmf crash-6.1.2.tar.gz > ... > $ cd crash-6.1.2 > $ make > ... > $ make extensions > ... > $ ./crash vmlinux vmcore > ... > crash> extend trace.so > ./extensions/trace.so: shared object loaded > crash> help trace > ... i have made the trace extension work, however, trace show need trace-cmd, but in my ubuntu PC, run "sudo apt-get install trace-cmd", i get below error: E: Couldn't find package trace-cmd by Google, i found that there is a project git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/trace-cmd.git. maybe i can only compile this tool and use it ?

That should work: $ rpm -qi trace-cmd Name : trace-cmd Version : 1.2 Release : 4.20120606git8266dff.fc17 Architecture: x86_64 Install Date: Mon 26 Nov 2012 03:22:24 PM EST Group : Development/Tools Size : 298546 License : GPLv2 and LGPLv2 Signature : RSA/SHA256, Sun 16 Sep 2012 01:37:17 PM EDT, Key ID 50e94c991aca3465 Source RPM : trace-cmd-1.2-4.20120606git8266dff.fc17.src.rpm Build Date : Thu 13 Sep 2012 05:34:55 PM EDT Build Host : buildvm-04.phx2.fedoraproject.org Relocations : (not relocatable) Packager : Fedora Project Vendor : Fedora Project URL : http://git.kernel.org/?p=linux/kernel/git/rostedt/trace-cmd.git;a=summary Summary : A user interface to Ftrace Description : trace-cmd is a user interface to Ftrace. Instead of needing to use the debugfs directly, trace-cmd will handle of setting of options and tracers and will record into a data file. $ Dave

卜弋天

Monday, 21 January Mon, 21 Jan

7:40 a.m.

...

Date: Fri, 18 Jan 2013 12:41:38 -0500 From: anderson(a)redhat.com To: crash-utility(a)redhat.com Subject: Re: [Crash-utility] questions about crash utility ----- Original Message ----- > > > > > > > > Date: Fri, 18 Jan 2013 10:23:00 -0500 > > From: anderson(a)redhat.com > > To: crash-utility(a)redhat.com > > Subject: Re: [Crash-utility] questions about crash utility > > > > > > > > ----- Original Message ----- > > > > > > > > > > > > Hi Dave: > > > > > > thank you very much for your detail answer, this really helpful. > > > please see my inline words. thanks. > > > > > > > > > > Date: Thu, 17 Jan 2013 14:17:36 -0500 > > > > From: anderson(a)redhat.com > > > > To: crash-utility(a)redhat.com > > > > Subject: Re: [Crash-utility] questions about crash utility > > > > > > > The fact that crash gets as far as it does at least means that > > > > the > > > > ELF header you've created was deemed acceptable as an ARM > > > > vmcore. > > > > However, the error messages re: "cpu_present_mask indicates..." > > > > and > > > > "cannot determine base kernel version" indicate that the data > > > > that was read from the vmcore was clearly not the correct data. > > > > > > > > The "cpu_present_mask" value that it read contained too > > > > many bits -- presuming that the 32-bit ARM processor is > > > > still limited to only 4 cpus. (looks like upstream that > > > > CONFIG_NR_CPUS is still 2 in the arch/arm/configs files.) > > > > > > > > But more indicative of the wrong data being read is the second > > > > "cannot determine base kernel version" message, which was > > > > generated > > > > after it read the kernel's "init_uts_ns" uts_namespace > > > > structure. > > > > After reading it, it sees that the "release" string contains > > > > non-ASCII data, whereas it should contain the kernel version: > > > > > > > > crash> p init_uts_ns > > > > init_uts_ns = $3 = { > > > > kref = { > > > > refcount = { > > > > counter = 2 > > > > } > > > > }, > > > > name = { > > > > sysname = > > > > "Linux\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > > nodename = > > > > "phenom-01.lab.bos.redhat.com\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > > release = > > > > "2.6.32-313.el6.x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > > version = "#1 SMP Thu Sep 27 16:25:19 EDT > > > > 2012\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > > machine = > > > > "x86_64\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000", > > > > domainname = > > > > "(none)\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000\000" > > > > } > > > > } > > > > crash> > > > > > > > > So it appears that you're reading data from the wrong > > > > locations in the dumpfile. You should be able to verify > > > > that by bringing up the crash session with the --minimal > > > > flag like this: > > > > > > > > $ crash --minimal vmlinux vmcore > > > > > > > > That will bypass most of the initialization, including all > > > > readmem() calls of the vmcore. Then do this: > > > > > > > > crash> rd linux_banner 20 > > > > ffffffff818000a0: 65762078756e694c 2e33206e6f697372 Linux > > > > version > > > > 3. > > > > ffffffff818000b0: 63662e312d312e35 365f3638782e3731 > > > > 5.1-1.fc17.x86_6 > > > > ffffffff818000c0: 626b636f6d282034 69756240646c6975 > > > > 4(mockbuild@bui > > > > ffffffff818000d0: 2e33322d6d76646c 6465662e32786870 > > > > ldvm-23.phx2.fed > > > > ffffffff818000e0: 656a6f727061726f 202967726f2e7463 > > > > oraproject.org) > > > > ffffffff818000f0: 7265762063636728 372e34206e6f6973 (gcc > > > > version 4.7 > > > > ffffffff81800100: 303231303220302e 6465522820373035 .0 20120507 > > > > (Red > > > > ffffffff81800110: 372e342074614820 47282029352d302e Hat > > > > 4.7.0-5) (G > > > > ffffffff81800120: 3123202920294343 75685420504d5320 CC) ) #1 > > > > SMP Thu > > > > ffffffff81800130: 3120392067754120 2033343a30353a37 Aug 9 > > > > 17:50:43 > > > > crash> rd -a linux_banner > > > > ffffffff818000a0: Linux version 3.5.1-1.fc17.x86_64 > > > > (mockbuild(a)buildvm-23.phx2 > > > > ffffffff818000dc: .fedoraproject.org) (gcc version 4.7.0 > > > > 20120507 (Red Hat 4.7 > > > > ffffffff81800118: .0-5) (GCC) ) #1 SMP Thu Aug 9 17:50:43 UTC > > > > 2012 > > > > crash> > > > > > > > > I'm guessing that you will not see a string starting with > > > > "Linux version" > > > > with your dumpfile as shown above. > > > > > > > > If that's the case, then it's clear that the readmem() function > > > > is ultimately > > > > reading from the wrong vmcore file offset. > > > > > > > > Here's what you can try doing. Taking the linux_banner example > > > > above, > > > > you can check where in the dumpfile it's reading from by > > > > setting the debug > > > > flag, before doing a simple read -- like this example on an ARM > > > > dumpfile: > > > > > > > > crash> set debug 8 > > > > debug: 8 > > > > crash> rd linux_banner > > > > <addr: c033ea10 count: 1 flag: 488 (KVADDR)> > > > > <readmem: c033ea10, KVADDR, "32-bit KVADDR", 4, (FOE), > > > > ff94f048> > > > > <read_kdump: addr: c033ea10 paddr: 33ea10 cnt: 4> > > > > read_netdump: addr: c033ea10 paddr: 33ea10 cnt: 4 offset: > > > > 33f088 > > > > c033ea10: 756e694c Linu > > > > crash> > > > > > > > > The linux_banner is at virtual address c033ea10 (addr). First > > > > it gets translated > > > > into physical address 33ea10 (paddr). Then that paddr is > > > > translated into the > > > > vmcore file offset of 33f088. It lseeks to vmcore file offset > > > > 33f088 and > > > > reads 4 bytes, which contain "756e694c", or the first 4 bytes > > > > of the > > > > "Linux version ..." string. > > > > > > > > Note that if I subtract the physical address from vmcore file > > > > offset > > > > I get this: > > > > > > > > crash> eval 33f088 - 33ea10 > > > > hexadecimal: 678 > > > > decimal: 1656 > > > > octal: 3170 > > > > binary: 00000000000000000000011001111000 > > > > crash> > > > > > > > > which would put physical address 0 at a vmcore file offset of > > > > 0x678, and > > > > therefore implying that that the ELF header comprises the first > > > > 0x678 bytes. > > > > And looking at the vmcore, that can be verified: > > > > > > > > > > yes you are right, here i get the result as below: > > > crash> set debug 8 > > > debug: 8 > > > crash> rd linux_banner > > > <addr: c065a071 count: 1 flag: 488 (KVADDR)> > > > <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> > > > <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> > > > read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 > > > c065a071: 03e59130 0... > > > > > > the virtual address is 0xc065a071 , and the physical address is > > > 0x85a071 , and the offset is 0x65a0e5. > > > my elf header is 116 bytes long, 0x65a0e5 - 116=0x65A071, which > > > has a > > > gap 0x00200000 with the physical address 0x85a071. > > > > > > > > > > $ readelf -a vmcore > > > > ELF Header: > > > > Magic: 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 > > > > Class: ELF32 > > > > Data: 2's complement, little endian > > > > Version: 1 (current) > > > > OS/ABI: UNIX - System V > > > > ABI Version: 0 > > > > Type: CORE (Core file) > > > > Machine: ARM > > > > Version: 0x1 > > > > Entry point address: 0x0 > > > > Start of program headers: 52 (bytes into file) > > > > Start of section headers: 0 (bytes into file) > > > > Flags: 0x0 > > > > Size of this header: 52 (bytes) > > > > Size of program headers: 32 (bytes) > > > > Number of program headers: 3 > > > > Size of section headers: 0 (bytes) > > > > Number of section headers: 0 > > > > Section header string table index: 0 > > > > > > > > There are no sections in this file. > > > > > > > > There are no sections to group in this file. > > > > > > > > Program Headers: > > > > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > > > > NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 > > > > LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 > > > > LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 > > > > ... > > > > > > > > Note that the "Offset" value of the first PT_LOAD segment has a > > > > file offset > > > > value of 0x678. > > > > > > > > > > here i got the result as below: > > > Program Headers: > > > Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align > > > NOTE 0x000000 0x00000000 0x00000000 0x00000 0x00000 0 > > > LOAD 0x000074 0xc0000000 0x00200000 0x2fe00000 0x2fe00000 RWE 0 > > > > > > so the problem is i don't understand the elf header meaning > > > accurately. if i modify code as below, everything is ok for me: > > > > > > offset += sizeof(struct elf_phdr); > > > phdr->p_offset = offset+0x00200000; > > > phdr->p_vaddr = 0xc0000000; > > > phdr->p_paddr = 0x00200000; > > > phdr->p_filesz = phdr->p_memsz = = MEMSIZE-0x00200000; > > > > > > > > > although my modification can make crash utility work well, i want > > > to > > > know exactly whether i am doing the right thing. > > > 1. our platform has the ddr address from physical address 0x0. > > > 2. when compiling Linux kernel, our platform set in .config file: > > > CONFIG_PHYS_OFFSET=0x00200000 > > > 3. when Kernel crash, all ddr content will be dumped, from > > > address > > > 0x0~768MB. but kernel data starts from 0x00200000 actually. > > > > > > my questions are: > > > 1. whether my setting of ELF header is correct this time? the > > > offset, > > > paddr, and p_memsz? > > > > I'm not really sure. Even though you've got it to work OK, I don't > > understand your new phdr->p_offset and phdr->p_filesz/phdr->p_memsz > > settings. The phdr->p_offset value typically points to the beginning > > of the physical memory segment, which in your case, would be at physical > > address 0x0 at file offset 0x74. And the phdr->p_filesz/phdr->p_memsz > > values are typically equal to the full size of the physical memory > > segment (MEMSIZE). > > > > if i set p_offset to 0, the file offset seems not correct. for example, when i try to read > linux_banner, i got below result: > crash> set debug 8 > debug: 8 > crash> rd linux_banner > <addr: c065a071 count: 1 flag: 488 (KVADDR)> > <readmem: c065a071, KVADDR, "32-bit KVADDR", 4, (FOE), ffdf297c> > <read_kdump: addr: c065a071 paddr: 85a071 cnt: 4> > read_netdump: addr: c065a071 paddr: 85a071 cnt: 4 offset: 65a0e5 > c065a071: 03e59130 0... No, the phdr->p_offset should *not* be 0 -- it is a file pointer value that should point to the beginning of the physical memory dump in the vmcore. So it should be equal to the header size, or 0x74 in your case. I think maybe your vmcore is being recognized as a NETDUMP_ELF32 instead of a KDUMP_ELF32? What does "help -n" show on your system? On my ELF ARM vmcore, it starts like this: crash> help -n vmcore_data: flags: a0 (KDUMP_LOCAL|KDUMP_ELF32) ndfd: 6 ofp: a7599e8 header_size: 1656 num_pt_load_segments: 2 pt_load_segment[0]: file_offset: 678 phys_start: 0 phys_end: 5600000 zero_fill: 0 pt_load_segment[1]: file_offset: 5600678 phys_start: 5700000 phys_end: 5800000 zero_fill: 0 ... where the pt_load_segment's data shown above come from its 2 PT_LOAD segments: $ readelf -a vmcore ... Program Headers: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align NOTE 0x000094 0x00000000 0x004e345c 0x005e4 0x005e4 0 LOAD 0x000678 0xc0000000 0x00000000 0x5600000 0x5600000 RWE 0 LOAD 0x5600678 0xc5700000 0x05700000 0x100000 0x100000 RWE 0 ... But if your vmcore is being recognized as a NETUMP_ELF32, it will ignore the phdr->p_offset value, and simply do this in read_netdump(): read_netdump(int fd, void *bufptr, int cnt, ulong addr, physaddr_t paddr) { off_t offset; struct pt_load_segment *pls; int i; offset = 0; /* * The Elf32_Phdr has 32-bit fields for p_paddr, p_filesz and * p_memsz, so for now, multiple PT_LOAD segment support is * restricted to 64-bit machines for netdump/diskdump vmcores. * However, kexec/kdump has introduced the optional use of a * 64-bit ELF header for 32-bit processors. */ switch (DUMPFILE_FORMAT(nd->flags)) { case NETDUMP_ELF32: offset = (off_t)paddr + (off_t)nd->header_size; break; But it should be a KDUMP_ELF32 format, which uses the pt_load_segment data: case NETDUMP_ELF64: case KDUMP_ELF32: case KDUMP_ELF64: if (nd->num_pt_load_segments == 1) { offset = (off_t)paddr + (off_t)nd->header_size - (off_t)nd->pt_load_segments[0].phys_start; break; } for (i = offset = 0; i < nd->num_pt_load_segments; i++) { pls = &nd->pt_load_segments[i]; if ((paddr >= pls->phys_start) && (paddr < pls->phys_end)) { offset = (off_t)(paddr - pls->phys_start) + pls->file_offset; break; } if (pls->zero_fill && (paddr >= pls->phys_end) && (paddr < pls->zero_fill)) { memset(bufptr, 0, cnt); if (CRASHDEBUG(8)) fprintf(fp, "read_netdump: zero-fill: " "addr: %lx paddr: %llx cnt: %d\n", addr, (ulonglong)paddr, cnt); return cnt; } } If your vmcore is being seen as a NETDUMP_ELF32, then perhaps that's why your strange ELF header settings are "working". But it should be created such that it's recognized as a KDUMP_ELF32. If your vmcore is being seen as a KDUMP_ELF32, then it's not entirely clear to me why your ELF header settings work OK. But I suppose if everything is working OK, just leave it as is, and go about your business... my information is as below, it is KDUMP_ELF32: vmcore_data:

flags: a0 (KDUMP_LOCAL|KDUMP_ELF32) ndfd: 3 ofp: 98442d0 header_size: 2097268 num_pt_load_segments: 1 pt_load_segment[0]: file_offset: 200074 phys_start: 200000 phys_end: 30000000 zero_fill: 0 elf_header: b7656008 elf32: b7656008 notes32: b765603c load32: b765605c anyway, i can just ignore this issue right now.the remaining issue is how to make my procedures automatically.my purpose is to get trace ring buffer from kernel dump file and create a newfile which name is like trace_$date.txt. the $date should be the time this text fileis generated. what i am doing now is to create a .crashrc file in the ./ directory, and try to use file to make everything automatically. i need to make below proceduresautomatically: 1. load trace.so automatically. it is ok now. 2. read current date. it is ok now. 3. dump the kernel trace ring buffer into new text file the name of which contains current date. cannot do this now. 4. exit crash utility automatically. it is ok now. my .crashrc is as below, i don't know how to do the step 3 above: ---------------------- script start ------------------------------------------------------- echo $(date '+%Y%m%d_%H%M%S') <------ it is ok #cur=$(date '+%Y%m%d_%H%M%S') <----- it failed to use variable to store date #echo $cur <------it failed extend trace.so <------ it is ok trace report >./report_test.txt <------- to dump to a file which uses fixed file name, it is ok. trace report >./trace_$(date '+%Y%m%d_%H%M%S').txt <--- it failed when exit ---------------------- script end ----------------------------------------------- please help, thanks.

...

> virtual address is c065a071, physical address is 85a071, this is ok. > but the tool said the file offset is 65a0e5, which is not correct. because my dump binary > contains ddr content from 0x0, the data of physical address 85a071 should be at dump file > offset 85a071+74= 85a0e5, rather than 65a0e5. > > so i guess the elf header should be modified to set phdr->p_offset =header size + 0x00200000. > i don't know how to tell crash utility it should add value 0x00200000 when read dump file. > > > I only have one ELF ARM dumpfile sample, but it does not have any > > physical offset: > > > > crash> vtop c0000000 > > VIRTUAL PHYSICAL > > c0000000 0 > > > > PAGE DIRECTORY: c0004000 > > PGD: c0007000 => 1140e > > PMD: c0007000 => 1140e > > PAGE: 0 (1MB) > > > > > > PAGE PHYSICAL MAPPING INDEX CNT FLAGS > > c042d000 0 0 0 0 80000 > > crash> > > > > Does "vtop c0000000" work as expected on your vmcore? > > yes i think the vtop command works well on my side: > > crash> vtop c0000000 > VIRTUAL PHYSICAL > c0000000 200000 > PAGE DIRECTORY: c0004000 > PGD: c0007000 => 21140e > PMD: c0007000 => 21140e > PAGE: 200000 (1MB) > > PAGE PHYSICAL MAPPING INDEX CNT FLAGS > c1370800 200000 e5d43061 42 1 80068 > > > > > > Also, can you read the last physical page of memory? For example, > > on > > my ARM dump, I can check that by doing this: > > > > crash> kmem -p | tail -5 > > c04dcf60 57fb000 0 0 1 400 > > c04dcf80 57fc000 0 0 1 400 > > c04dcfa0 57fd000 0 0 1 400 > > c04dcfc0 57fe000 0 0 1 400 > > c04dcfe0 57ff000 0 0 1 400 > > crash> rd -p 57ff000 > > 57ff000: ef9f0000 .... > > crash> > > > > result is as below: > > crash> kmem -p |tail -5 > c19b934c 2ccfb000 0 0 1 400 > c19b9370 2ccfc000 0 0 1 400 > c19b9394 2ccfd000 0 0 1 400 > c19b93b8 2ccfe000 0 0 1 400 > c19b93dc 2ccff000 0 0 1 400 > > crash> rd -p 2ccff000 > 2ccff000: fffdffff .... > > > > Also, can you confirm that your kernel's symbol list starts > > at c0000000, i.e., something like this: > > > > crash> sym -l > > c0004000 (A) swapper_pg_dir > > c0008000 (t) .init > > c0008000 (T) __init_begin > > c0008000 (T) _sinittext > > c0008000 (T) _stext > > c0008000 (T) stext > > c0008040 (t) __create_page_tables > > c00080e4 (t) __enable_mmu_loc > > c00080f0 (t) __error_a > > c00080f4 (t) __lookup_machine_type > > c0008128 (t) __lookup_machine_type_data > > ... > > > > I just want to make sure that the kernel symbols actually start > > at c000000, and not c2000000. > > > > yes, the symbols actually start from c0000000: > > crash> sym -l > c0004000 (A) swapper_pg_dir > c0005fb8 (A) __crc_scsi_host_get > c0008000 (t) .head.text > c0008000 (T) _text > c0008000 (T) stext > c0008050 (t) __create_page_tables > c0008104 (t) __turn_mmu_on_loc > c0008110 (T) secondary_startup > > > > > 2. i am wondering how does crash utility translate virtual > > > address to > > > physical address before and after it get the kernel page table? > > > before get kernel page table, does it calculate as : > > > (virtual_addr - > > > p_vaddr + p_paddr) ? after get kernel page table, does it walk > > > through the page table and find out the real physical address > > > accordingly? > > > > For kernel unity-mapped kernel virtual addresses, it's not > > necessary > > to walk the page tables. It simply does this: > > > > #define VTOP(X) \ > > ((unsigned > > long)(X)-(machdep->kvbase)+(machdep->machspec->phys_base)) > > > > You can check your machdep->kvbase and machdep->machspec->phys_base > > values by entering "help -m", for example: > > > > crash> help -m | grep -e kvbase -e phys_base > > kvbase: c0000000 > > phys_base: 0 > > crash> > > > > my result is as below, should be ok: > > crash> help -m | grep -e kvbase -e phys_base > kvbase: c0000000 > phys_base: 200000 > > > > Certainly vmalloc (and user-space) virtual addresses require a page > > table walkthough, but the arm_kvtop() function does this: > > > > static int > > arm_kvtop(struct task_context *tc, ulong kvaddr, physaddr_t *paddr, > > int verbose) > > { > > if (!IS_KVADDR(kvaddr)) > > return FALSE; > > > > if (!vt->vmalloc_start) { > > *paddr = VTOP(kvaddr); > > return TRUE; > > } > > > > if (!IS_VMALLOC_ADDR(kvaddr)) { > > *paddr = VTOP(kvaddr); <=== unity-mapped kernel virtual addresses > > if (!verbose) > > return TRUE; > > } > > > > return arm_vtop(kvaddr, (ulong *)vt->kernel_pgd[0], paddr, > > verbose); > > } > > > > and where vmalloc addresses fall through and arm_vtop() is called > > to walk > > the page tables. > > > > However, you can translate unity-mapped addresses using the kernel > > page tables > > with the "vtop" command, as shown in the "vtop c000000" example > > above. > > > > > 3. my real purpose is to get the ftrace content from dump file by > > > crash utility , but seem the command trace is not for this case, > > > do > > > i need to compile the extension "trace" of crash utility? is > > > there > > > any guide to follow? > > > > That's correct. You can do this: > > > > $ wget http://people.redhat.com/anderson/crash-6.1.2.tar.gz > > ... > > $ tar xvzmf crash-6.1.2.tar.gz > > ... > > $ cd crash-6.1.2 > > $ make > > ... > > $ make extensions > > ... > > $ ./crash vmlinux vmcore > > ... > > crash> extend trace.so > > ./extensions/trace.so: shared object loaded > > crash> help trace > > ... > > i have made the trace extension work, however, trace show need trace-cmd, > but in my ubuntu PC, run "sudo apt-get install trace-cmd", i get below error: > E: Couldn't find package trace-cmd > > by Google, i found that there is a project > git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/trace-cmd.git. > maybe i can only compile this tool and use it ? > That should work: $ rpm -qi trace-cmd Name : trace-cmd Version : 1.2 Release : 4.20120606git8266dff.fc17 Architecture: x86_64 Install Date: Mon 26 Nov 2012 03:22:24 PM EST Group : Development/Tools Size : 298546 License : GPLv2 and LGPLv2 Signature : RSA/SHA256, Sun 16 Sep 2012 01:37:17 PM EDT, Key ID 50e94c991aca3465 Source RPM : trace-cmd-1.2-4.20120606git8266dff.fc17.src.rpm Build Date : Thu 13 Sep 2012 05:34:55 PM EDT Build Host : buildvm-04.phx2.fedoraproject.org Relocations : (not relocatable) Packager : Fedora Project Vendor : Fedora Project URL : http://git.kernel.org/?p=linux/kernel/git/rostedt/trace-cmd.git;a=summary Summary : A user interface to Ftrace Description : trace-cmd is a user interface to Ftrace. Instead of needing to use the debugfs directly, trace-cmd will handle of setting of options and tracers and will record into a data file. $ Dave -- Crash-utility mailing list Crash-utility(a)redhat.com https://www.redhat.com/mailman/listinfo/crash-utility

Dave Anderson

10:47 a.m.

----- Original Message -----

...

anyway, i can just ignore this issue right now. the remaining issue is how to make my procedures automatically. my purpose is to get trace ring buffer from kernel dump file and create a new file which name is like trace_$date.txt. the $date should be the time this text file is generated. what i am doing now is to create a .crashrc file in the ./ directory, and try to use file to make everything automatically. i need to make below procedures automatically: 1. load trace.so automatically. it is ok now. 2. read current date. it is ok now. 3. dump the kernel trace ring buffer into new text file the name of which contains current date. cannot do this now. 4. exit crash utility automatically. it is ok now. my .crashrc is as below, i don't know how to do the step 3 above: ---------------------- script start ------------------------------------------------------- echo $(date '+%Y%m%d_%H%M%S') <------ it is ok #cur=$(date '+%Y%m%d_%H%M%S') <----- it failed to use variable to store date #echo $cur <------it failed extend trace.so <------ it is ok trace report >./report_test.txt <------- to dump to a file which uses fixed file name, it is ok. trace report >./trace_$(date '+%Y%m%d_%H%M%S').txt <--- it failed when exit ---------------------- script end ----------------------------------------------- please help, thanks.

For a shell command, you would need to precede the command with a "|" or a "!". But the variable assignment would not persist, because each line in the .crashrc (or any input file) is done as separate shell invocation. For example, this input file has four separate shell invocations: crash> !cat input !cur=$(date '+%Y%m%d_%H%M%S'); echo $cur !echo cur: $cur |cur=$(date '+%Y%m%d_%H%M%S'); echo $cur |echo cur: $cur crash> So the "echo" commands have no knowledge of the prior shell command's setting of the "cur" variable: crash> < input 20130121_112503 cur: 20130121_112503 cur: crash> And with respect to this: crash> trace report >./trace_$(date '+%Y%m%d_%H%M%S').txt When the command is invoked from within the crash utility, crash simply opens a file with the name "./trace_$(date '+%Y%m%d_%H%M%S').txt", and redirects the output to it. The crash utility does not do any kind of parsing of the output file name for any embedded $(shell-cmd) or `shell-cmd` usage. Maybe you could do something like this in your .crashrc file:

...

trace_output

!ln trace_output $(date '+%Y%m%d_%H%M%S') extend trace.so trace report

...

trace_output

!rm -f output Dave

卜弋天

Tuesday, 22 January Tue, 22 Jan

12:05 a.m.

thank you very much Dave, i have resolved this issue by using your code.your suggestion and explaination are very helpful.

...

Date: Mon, 21 Jan 2013 11:47:06 -0500 From: anderson(a)redhat.com To: crash-utility(a)redhat.com Subject: Re: [Crash-utility] questions about crash utility ----- Original Message ----- > anyway, i can just ignore this issue right now. > the remaining issue is how to make my procedures automatically. > my purpose is to get trace ring buffer from kernel dump file and create a new > file which name is like trace_$date.txt. the $date should be the time this text file > is generated. > what i am doing now is to create a .crashrc file in the ./ directory, and try to > use file to make everything automatically. i need to make below procedures > automatically: > 1. load trace.so automatically. it is ok now. > 2. read current date. it is ok now. > 3. dump the kernel trace ring buffer into new text file the name of which contains current date. cannot do this now. > 4. exit crash utility automatically. it is ok now. > > my .crashrc is as below, i don't know how to do the step 3 above: > > ---------------------- script start ------------------------------------------------------- > > echo $(date '+%Y%m%d_%H%M%S') <------ it is ok > > #cur=$(date '+%Y%m%d_%H%M%S') <----- it failed to use variable to store date > #echo $cur <------it failed > > extend trace.so <------ it is ok > > trace report >./report_test.txt <------- to dump to a file which uses fixed file name, it is ok. > > > trace report >./trace_$(date '+%Y%m%d_%H%M%S').txt <--- it failed when exit > ---------------------- script end ----------------------------------------------- > > please help, thanks. For a shell command, you would need to precede the command with a "|" or a "!". But the variable assignment would not persist, because each line in the .crashrc (or any input file) is done as separate shell invocation. For example, this input file has four separate shell invocations: crash> !cat input !cur=$(date '+%Y%m%d_%H%M%S'); echo $cur !echo cur: $cur |cur=$(date '+%Y%m%d_%H%M%S'); echo $cur |echo cur: $cur crash> So the "echo" commands have no knowledge of the prior shell command's setting of the "cur" variable: crash> < input 20130121_112503 cur: 20130121_112503 cur: crash> And with respect to this: crash> trace report >./trace_$(date '+%Y%m%d_%H%M%S').txt When the command is invoked from within the crash utility, crash simply opens a file with the name "./trace_$(date '+%Y%m%d_%H%M%S').txt", and redirects the output to it. The crash utility does not do any kind of parsing of the output file name for any embedded $(shell-cmd) or `shell-cmd` usage. Maybe you could do something like this in your .crashrc file: > trace_output !ln trace_output $(date '+%Y%m%d_%H%M%S') extend trace.so trace report > trace_output !rm -f output Dave -- Crash-utility mailing list Crash-utility(a)redhat.com https://www.redhat.com/mailman/listinfo/crash-utility

4596

days inactive

4601

days old

devel@lists.crash-utility.osci.io

Manage subscription

8 comments

2 participants

tags (0)

participants (2)

Dave Anderson
卜弋天

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

questions about crash utility