Hello Kazu,

RTLD_DEEPBIND is not a good way. I'd like to recall the patch.
After my research, I find making functions and global variables static within extensions can avoid such problems.

Taking extensions/trace.so as example, trace_fini and trace_init are non-static functions, if we load trace.so first, everything works fine, but when we rename trace.so to trace2.so and load it again, we are expecting trace_init within trace2.so to be called, but since we have RTLD_GLOBAL, actually it's the trace_init within trace.so to be called. It causes problems because currently some global variables within trace.so are not in their initial status.

As I quoting dlopen manual:
Symbol references in the shared object are resolved using (in order): symbols in the link map of objects loaded for the main program and its dependencies; symbols in shared objects (and their dependencies) that were previously opened with dlopen() using the  RTLD_GLOBAL flag; and definitions in the shared object itself (and any dependencies that were loaded for that object).

When using readelf --dyn-syms trace.so, for non-static trace_fini and trace_init, I got:
Symbol table '.dynsym' contains 66 entries:
    ....
    64: 0000000000007e65    22 FUNC    GLOBAL DEFAULT   11 trace_fini
    65: 0000000000007e3d    40 FUNC    GLOBAL DEFAULT   11 trace_init

Since I'm not an expert to linkers and loaders, I guess the process is:

1. when loading trace.so
2. ld.so sees trace_init and trace_fini in .dynsym
3. ld.so searches the 2 symbols in main program, RTLD_GLOBAL shared objects, and trace.so.
4. ld.so finds them in trace.so, resolves their address.
5. everything works fine.

6. when loading trace2.so
7. ld.so sees trace_init and trace_fini in .dynsym
8. ld.so searches the 2 symbols in main program, RTLD_GLOBAL shared objects, and trace2.so.
9. ld.so finds them in RTLD_GLOBAL shared objects(trace.so), resolves their address.
10. trace_init in trace.so is called again, but global variables are not in initial status, fails.

If I make trace_fini and trace_init static, readelf --dyn-syms trace.so will no longer give the 2 symbols in .dynsym, and ld.so will no longer resolve them to the wrong place, then it works fine.

I think this cannot be a code patch, but a document for future extension developers, if you don't want to export a symbol to subsequent extensions, making it static.

On Mon, Mar 22, 2021 at 1:35 PM HAGIO KAZUHITO(萩尾 一仁) <k-hagio-ab@nec.com> wrote:
Hi Tao Liu,

-----Original Message-----
> If a same extension(Eg: extensions/trace.so) with two different names are loaded by
> "extend" command twice, it sometimes segfaults crash.
>
> It's because crash uses RTLD_NOW|RTLD_GLOBAL flags of dlopen to load an extension.
> RTDL_GLOBAL will make symbols defined by this shared object available for
> symbol resolution of subsequently loaded shared objects. So symbols with the same
> name will be exported from the former to the latter. In this case, when 2 extensions
> only differ from file names, the subsequently loaded extension will have unexpected
> initial values for global varibles.

Thanks for the report.

> This patch adds RTLD_DEEPBIND flag to dlopen, making extensions using its
> own symbols preference to symbols with the same name contained by others.

This looks a big API change for crash extension modules.

As far as I've tested, getopt() in an extension module does not work well
with this patch:

# make extensions

crash> extend extensions/echo.so
./extensions/echo.so: shared object loaded
crash> echo test
test
crash> echo test

crash> echo test test2
test2
crash> echo test test2

crash> echo test

crash> echo test test2

crash> echo test test2 test3
test3

Can we fix this?  And probably all other modules using getopt() imitates
this echo.c, they will also need to be fixed to adopt the patch.
Also I'm concerned that there might be another regression.

Do we need to fix the issue at these costs?  or is there any better way?

Thanks,
Kazu

>
> Signed-off-by: Tao Liu <ltao@redhat.com>
> ---
>  extensions.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/extensions.c b/extensions.c
> index d23b1e3..e07f9a9 100644
> --- a/extensions.c
> +++ b/extensions.c
> @@ -317,7 +317,7 @@ load_extension(char *lib)
>          *  _init() function before dlopen() returns below.
>       */
>       pc->curext = ext;
> -     ext->handle = dlopen(ext->filename, RTLD_NOW|RTLD_GLOBAL);
> +     ext->handle = dlopen(ext->filename, RTLD_NOW|RTLD_GLOBAL|RTLD_DEEPBIND);
>
>       if (!ext->handle) {
>               strcpy(buf, dlerror());
> --
> 2.29.2
>
> --
> Crash-utility mailing list
> Crash-utility@redhat.com
> https://listman.redhat.com/mailman/listinfo/crash-utility


--
Crash-utility mailing list
Crash-utility@redhat.com
https://listman.redhat.com/mailman/listinfo/crash-utility