You are viewing the version of this documentation from Perl 5.41.12. This is a development version of Perl.

CONTENTS

NAME

perlclib - Interacting with standard C library functions

DESCRIPTION

The perl interpreter is written in C; XS code also expands to C. Inevitably, this code will call some functions from the C library, libc. This document gives some guidance on interfacing with that library.

One thing Perl porters should note is that perl doesn't tend to use that much of the C standard library internally; you'll see very little use of, for example, the ctype.h functions in there. This is because Perl tends to reimplement or abstract standard library functions, so that we know exactly how they're going to operate.

libc functions to avoid

There are many many libc functions. Most of them are fair game to use, but some are not. Some of the possible reasons are:

The following commentary and tables give some functions in the first column that shouldn't be used in C or XS code, with the preferred alternative (if any) in the second column.

Conventions

In the following tables:

~

marks the function as deprecated; it should not be used regardless.

t

is a type.

p

is a pointer.

n

is a number.

s

is a string.

sv, av, hv, etc. represent variables of their respective types.

File Operations

Instead of the stdio.h functions, you should use the Perl abstraction layer. Instead of FILE* types, you need to be handling PerlIO* types. Don't forget that with the new PerlIO layered I/O abstraction FILE* types may not even be available. See also the perlapio documentation for more information about the following functions:

Instead Of:                 Use:

stdin                       PerlIO_stdin()
stdout                      PerlIO_stdout()
stderr                      PerlIO_stderr()

fopen(fn, mode)             PerlIO_open(fn, mode)
freopen(fn, mode, stream)   PerlIO_reopen(fn, mode, perlio) (Dep-
                              recated)
fflush(stream)              PerlIO_flush(perlio)
fclose(stream)              PerlIO_close(perlio)

File Input and Output

Instead Of:                 Use:

fprintf(stream, fmt, ...)   PerlIO_printf(perlio, fmt, ...)

[f]getc(stream)             PerlIO_getc(perlio)
[f]putc(stream, n)          PerlIO_putc(perlio, n)
ungetc(n, stream)           PerlIO_ungetc(perlio, n)

Note that the PerlIO equivalents of fread and fwrite are slightly different from their C library counterparts:

fread(p, size, n, stream)   PerlIO_read(perlio, buf, numbytes)
fwrite(p, size, n, stream)  PerlIO_write(perlio, buf, numbytes)

fputs(s, stream)            PerlIO_puts(perlio, s)

There is no equivalent to fgets (or the deprecated gets); one should use sv_gets instead:

fgets(s, n, stream)         sv_gets(sv, perlio, append)

File Positioning

Instead Of:                 Use:

feof(stream)                PerlIO_eof(perlio)
fseek(stream, n, whence)    PerlIO_seek(perlio, n, whence)
rewind(stream)              PerlIO_rewind(perlio)

fgetpos(stream, p)          PerlIO_getpos(perlio, sv)
fsetpos(stream, p)          PerlIO_setpos(perlio, sv)

ferror(stream)              PerlIO_error(perlio)
clearerr(stream)            PerlIO_clearerr(perlio)

Memory Management and String Handling

Instead Of:                    Use:

t* p = malloc(n)               Newx(p, n, t)
t* p = calloc(n, s)            Newxz(p, n, t)
p = realloc(p, n)              Renew(p, n, t)

It is not portable to try to allocate 0 bytes; allocating 1 or more is portable. Never pass pointers between Newx, Renew, Safefree and libc equivalents malloc, realloc, free. They are not from the same memory pool or allocator. Either an instant or delayed SEGV will occur, or subtle memory leaks or subtle heap corruption.

memcpy(dst, src, n)            Copy(src, dst, n, t)
memmove(dst, src, n)           Move(src, dst, n, t)
memcpy(dst, src, sizeof(t))    StructCopy(src, dst, t)

Notice the different order of arguments to Copy and Move than used in memcpy and memmove.

memset(dst, 0, n * sizeof(t))  Zero(dst, n, t)
memzero(dst, 0)                Zero(dst, n, char)
free(p)                        Safefree(p)

strdup(p)                      savepv(p)
strndup(p, n)                  savepvn(p, n) (Hey, strndup doesn't
                                              exist!)

Sometimes instead of zeroing the allocated heap by using Newxz() you should consider "poisoning" the data. This means writing a bit pattern into it that should be illegal as pointers (and floating point numbers), and also hopefully surprising enough as integers, so that any code attempting to use the data without forethought will break sooner rather than later. Poisoning can be done using the Poison() macros, which have similar arguments to Zero():

PoisonWith(dst, n, t, b)    scribble memory with byte b
PoisonNew(dst, n, t)        equal to PoisonWith(dst, n, t, 0xAB)
PoisonFree(dst, n, t)       equal to PoisonWith(dst, n, t, 0xEF)
Poison(dst, n, t)           equal to PoisonFree(dst, n, t)

strstr(big, little)            instr(big, little)
memmem(big, blen, little, len) ninstr(big, bigend, little, little_end)
strcmp(s1, s2)                 strLE(s1, s2) / strEQ(s1, s2)
                                             / strGT(s1,s2)
strncmp(s1, s2, n)             strnNE(s1, s2, n) / strnEQ(s1, s2, n)

memcmp(p1, p2, n)              memNE(p1, p2, n)
!memcmp(p1, p2, n)             memEQ(p1, p2, n)

Most of the time, though, you'll want to be dealing with SVs internally instead of raw char * strings:

strlen(s)                   sv_len(sv)
strcpy(dt, src)             sv_setpv(sv, s)
strncpy(dt, src, n)         sv_setpvn(sv, s, n)
strcat(dt, src)             sv_catpv(sv, s)
strncat(dt, src)            sv_catpvn(sv, s)

If you do need raw strings, use these instead:

my_strnlen(s, maxlen)
my_strlcpy(dt, src, sizeof(dt))
my_strlcat(dt, src, sizeof(dt))

Similiarly, you can use SVs for creating strings from formats

sprintf(s, fmt, ...)         sv_setpvf(sv, fmt, ...)
vsprintf(str, fmt, va_list)  sv_vsetpvf(sv, fmt, va_list)

Or for raw strings,

my_snprintf(dt, len, fmt, ...)
my_vsnprintf(dt, len, fmt, va_list)
vsprintf(str, fmt, va_list)  sv_vsnprintf(sv, fmt, va_list)

Note also the existence of sv_catpvf and sv_vcatpvfn, combining concatenation with formatting; and Perl_form() for another form of formatted populating.

Note that glibc printf(), sprintf(), etc. are buggy before glibc version 2.17. They won't allow a %.s format with a precision to create a string that isn't valid UTF-8 if the current underlying locale of the program is UTF-8. What happens is that the %s and its operand are simply skipped without any notice. https://sourceware.org/bugzilla/show_bug.cgi?id=6530.

Character Class Tests

There are several types of character class tests that Perl implements. All are more fully described in "Character classification" in perlapi and "Character case changing" in perlapi.

The C library routines listed in the table below return values based on the current locale. Use the entries in the final column for that functionality. The other two columns always assume a POSIX (or C) locale. The entries in the ASCII column are only meaningful for ASCII inputs, returning FALSE for anything else. Use these only when you know that is what you want. The entries in the Latin1 column assume that the non-ASCII 8-bit characters are as Unicode defines them, the same as ISO-8859-1, often called Latin 1.

Instead Of:  Use for ASCII:   Use for Latin1:      Use for locale:

isalnum(c)  isALPHANUMERIC(c) isALPHANUMERIC_L1(c) isALPHANUMERIC_LC(c)
isalpha(c)  isALPHA(c)        isALPHA_L1(c)        isALPHA_LC(u )
isascii(c)  isASCII(c)                             isASCII_LC(c)
isblank(c)  isBLANK(c)        isBLANK_L1(c)        isBLANK_LC(c)
iscntrl(c)  isCNTRL(c)        isCNTRL_L1(c)        isCNTRL_LC(c)
isdigit(c)  isDIGIT(c)        isDIGIT_L1(c)        isDIGIT_LC(c)
isgraph(c)  isGRAPH(c)        isGRAPH_L1(c)        isGRAPH_LC(c)
islower(c)  isLOWER(c)        isLOWER_L1(c)        isLOWER_LC(c)
isprint(c)  isPRINT(c)        isPRINT_L1(c)        isPRINT_LC(c)
ispunct(c)  isPUNCT(c)        isPUNCT_L1(c)        isPUNCT_LC(c)
isspace(c)  isSPACE(c)        isSPACE_L1(c)        isSPACE_LC(c)
isupper(c)  isUPPER(c)        isUPPER_L1(c)        isUPPER_LC(c)
isxdigit(c) isXDIGIT(c)       isXDIGIT_L1(c)       isXDIGIT_LC(c)

tolower(c)  toLOWER(c)        toLOWER_L1(c)
toupper(c)  toUPPER(c)

For the corresponding functions like iswupper(), etc., use isUPPER_uvchr() for non-locale; or isUPPER_LC_uvchr() for locale. And use toLOWER_uvchr() instead of towlower(), etc.. There are no direct equivalents for locale; best to put the string into an SV.

Don't use any of the functions like isalnum_l(). Those are non-portable, and interfere with Perl's internal handling.

To emphasize that you are operating only on ASCII characters, you can append _A to each of the macros in the ASCII column: isALPHA_A, isDIGIT_A, and so on.

(There is no entry in the Latin1 column for isascii even though there is an isASCII_L1, which is identical to isASCII; the latter name is clearer. There is no entry in the Latin1 column for toupper because the result can be non-Latin1. You have to use toUPPER_uvchr, as described in "Character case changing" in perlapi.)

Note that the libc caseless comparisons are crippled; Unicode provides a richer set, using the concept of folding. If you need more than equality/non-equality, it's probably best to store your strings in an SV and use SV functions to do the comparision. Similarly for collation.

stdlib.h functions

Instead Of:                 Use:

atof(s)                     my_atof(s) or Atof(s)
atoi(s)                     grok_atoUV(s, &uv, &e)
atol(s)                     grok_atoUV(s, &uv, &e)
strtod(s, &p)               Strtod(s, &p)
strtol(s, &p, n)            Strtol(s, &p, b)
strtoul(s, &p, n)           Strtoul(s, &p, b)

But note that even the alternative functions are subject to locale; see "Dealing with locales".

Typical use is to do range checks on uv before casting:

int i; UV uv;
char* end_ptr = input_end;
if (grok_atoUV(input, &uv, &end_ptr)
    && uv <= INT_MAX)
  i = (int)uv;
  ... /* continue parsing from end_ptr */
} else {
  ... /* parse error: not a decimal integer in range 0 .. MAX_IV */
}

Notice also the grok_bin, grok_hex, and grok_oct functions in numeric.c for converting strings representing numbers in the respective bases into NVs. Note that grok_atoUV() doesn't handle negative inputs, or leading whitespace (being purposefully strict).

Miscellaneous functions

You should not even want to use setjmp.h functions, but if you think you do, use the JMPENV stack in scope.h instead.

~asctime()              Perl_sv_strftime_tm()
~asctime_r()            Perl_sv_strftime_tm()
 chsize()               my_chsize()
~ctime()                Perl_sv_strftime_tm()
~ctime_r()              Perl_sv_strftime_tm()
~cuserid()              DO NOT USE; see its man page
 dirfd()                my_dirfd()
 duplocale()            Perl_setlocale()
~ecvt()                 my_snprintf()
~endgrent_r()           endgrent()
~endhostent_r()         endhostent()
~endnetent_r()          endnetent()
~endprotoent_r()        endprotoent()
~endpwent_r()           endpwent()
~endservent_r()         endservent()
~endutent()             endutxent()
 exit(n)                my_exit(n)
~fcvt()                 my_snprintf()
 freelocale()           Perl_setlocale()
~ftw()                  nftw()
 getenv(s)              PerlEnv_getenv(s)
~gethostbyaddr()        getaddrinfo()
~gethostbyname()        getnameinfo()
~getpass()              DO NOT USE; see its man page
~getpw()                getpwuid()
~getutent()             getutxent()
~getutid()              getutxid()
~getutline()            getutxline()
~gsignal()              DO NOT USE; see its man page
 localeconv()           Perl_localeconv()
 mblen()                mbrlen()
 mbtowc()               mbrtowc()
 newlocale()            Perl_setlocale()
 pclose()               my_pclose()
 popen()                my_popen()
~pututline()            pututxline()
~qecvt()                my_snprintf()
~qfcvt()                my_snprintf()
 querylocale()          Perl_setlocale()
 int rand()             double Drand01()
 srand(n)               { seedDrand01((Rand_seed_t)n);
                          PL_srand_called = TRUE; }
~readdir_r()            readdir()
 realloc()              saferealloc(), Renew() or Renewc()
~re_comp()              regcomp()
~re_exec()              regexec()
~rexec()                rcmd()
~rexec_af()             rcmd()
 setenv(s, val)         my_setenv(s, val)
~setgrent_r()           setgrent()
~sethostent_r()         sethostent()
 setlocale()            Perl_setlocale()
 setlocale_r()          Perl_setlocale()
~setnetent_r()          setnetent()
~setprotoent_r()        setprotoent()
~setpwent_r()           setpwent()
~setservent_r()         setservent()
~setutent()             setutxent()
 sigaction()            rsignal(signo, handler)
~siginterrupt()         rsignal() with the SA_RESTART flag instead
 signal(signo, handler) rsignal(signo, handler)
~ssignal()              DO NOT USE; see its man page
 strcasecmp()           a Perl foldEQ-family function
 strerror()             sv_string_from_errnum()
 strerror_l()           sv_string_from_errnum()
 strerror_r()           sv_string_from_errnum()
 strftime()             Perl_sv_strftime_tm()
 strtod()               my_strtod() or Strtod()
 system(s)              Don't. Look at pp_system or use my_popen.
~tempnam()              mkstemp()
~tmpnam()               mkstemp()
 tmpnam_r()             mkstemp()
 uselocale()            Perl_setlocale()
 vsnprintf()            my_vsnprintf()
 wctob()                wcrtomb()
 wctomb()               wcrtomb()
 wsetlocale()           Perl_setlocale()

The Perl-furnished alternatives are documented in perlapi, which you should peruse anyway to see what all is available to you.

The lists are incomplete. Think when using an unlisted function if it seems likely to interfere with Perl.

Dealing with locales

Like it or not, your code will be executed in the context of a locale, as are all C language programs. See perllocale. Most libc calls are not affected by the locale, but a surprising number are:

addmntent()           getspent_r()        sethostent()
alphasort()           getspnam()          sethostent_r()
asctime()             getspnam_r()        setnetent()
asctime_r()           getwc()             setnetent_r()
asprintf()            getwchar()          setnetgrent()
atof()                glob()              setprotoent()
atoi()                gmtime()            setprotoent_r()
atol()                gmtime_r()          setpwent()
atoll()               grantpt()           setpwent_r()
btowc()               iconv_open()        setrpcent()
catopen()             inet_addr()         setservent()
ctime()               inet_aton()         setservent_r()
ctime_r()             inet_network()      setspent()
cuserid()             inet_ntoa()         sgetspent_r()
daylight              inet_ntop()         shm_open()
dirname()             inet_pton()         shm_unlink()
dprintf()             initgroups()        snprintf()
endaliasent()         innetgr()           sprintf()
endgrent()            iruserok()          sscanf()
endgrent_r()          iruserok_af()       strcasecmp()
endhostent()          isalnum()           strcasestr()
endhostent_r()        isalnum_l()         strcoll()
endnetent()           isalpha()           strerror()
endnetent_r()         isalpha_l()         strerror_l()
endprotoent()         isascii()           strerror_r()
endprotoent_r()       isascii_l()         strfmon()
endpwent()            isblank()           strfmon_l()
endpwent_r()          isblank_l()         strfromd()
endrpcent()           iscntrl()           strfromf()
endservent()          iscntrl_l()         strfroml()
endservent_r()        isdigit()           strftime()
endspent()            isdigit_l()         strftime_l()
err()                 isgraph()           strncasecmp()
error()               isgraph_l()         strptime()
error_at_line()       islower()           strsignal()
errx()                islower_l()         strtod()
fgetwc()              isprint()           strtof()
fgetwc_unlocked()     isprint_l()         strtoimax()
fgetws()              ispunct()           strtol()
fgetws_unlocked()     ispunct_l()         strtold()
fnmatch()             isspace()           strtoll()
forkpty()             isspace_l()         strtoq()
fprintf()             isupper()           strtoul()
fputwc()              isupper_l()         strtoull()
fputwc_unlocked()     iswalnum()          strtoumax()
fputws()              iswalnum_l()        strtouq()
fputws_unlocked()     iswalpha()          strverscmp()
fscanf()              iswalpha_l()        strxfrm()
fwprintf()            iswblank()          swprintf()
fwscanf()             iswblank_l()        swscanf()
getaddrinfo()         iswcntrl()          syslog()
getaliasbyname_r()    iswcntrl_l()        timegm()
getaliasent_r()       iswdigit()          timelocal()
getdate()             iswdigit_l()        timezone
getdate_r()           iswgraph()          tolower()
getfsent()            iswgraph_l()        tolower_l()
getfsfile()           iswlower()          toupper()
getfsspec()           iswlower_l()        toupper_l()
getgrent()            iswprint()          towctrans()
getgrent_r()          iswprint_l()        towlower()
getgrgid()            iswpunct()          towlower_l()
getgrgid_r()          iswpunct_l()        towupper()
getgrnam()            iswspace()          towupper_l()
getgrnam_r()          iswspace_l()        tzname
getgrouplist()        iswupper()          tzset()
gethostbyaddr()       iswupper_l()        ungetwc()
gethostbyaddr_r()     iswxdigit()         vasprintf()
gethostbyname()       iswxdigit_l()       vdprintf()
gethostbyname2()      isxdigit()          verr()
gethostbyname2_r()    isxdigit_l()        verrx()
gethostbyname_r()     localeconv()        versionsort()
gethostent()          localtime()         vfprintf()
gethostent_r()        localtime_r()       vfscanf()
gethostid()           MB_CUR_MAX          vfwprintf()
getlogin()            mblen()             vprintf()
getlogin_r()          mbrlen()            vscanf()
getmntent()           mbrtowc()           vsnprintf()
getmntent_r()         mbsinit()           vsprintf()
getnameinfo()         mbsnrtowcs()        vsscanf()
getnetbyaddr()        mbsrtowcs()         vswprintf()
getnetbyaddr_r()      mbstowcs()          vsyslog()
getnetbyname()        mbtowc()            vwarn()
getnetbyname_r()      mktime()            vwarnx()
getnetent()           nan()               vwprintf()
getnetent_r()         nanf()              warn()
getnetgrent()         nanl()              warnx()
getnetgrent_r()       nl_langinfo()       wcrtomb()
getprotobyname()      openpty()           wcscasecmp()
getprotobyname_r()    printf()            wcschr()
getprotobynumber()    psiginfo()          wcscoll()
getprotobynumber_r()  psignal()           wcsftime()
getprotoent()         putpwent()          wcsncasecmp()
getprotoent_r()       putspent()          wcsnrtombs()
getpw()               putwc()             wcsrchr()
getpwent()            putwchar()          wcsrtombs()
getpwent_r()          regcomp()           wcstod()
getpwnam()            regexec()           wcstof()
getpwnam_r()          res_nclose()        wcstoimax()
getpwuid()            res_ninit()         wcstold()
getpwuid_r()          res_nquery()        wcstombs()
getrpcbyname_r()      res_nquerydomain()  wcstoumax()
getrpcbynumber_r()    res_nsearch()       wcswidth()
getrpcent_r()         res_nsend()         wcsxfrm()
getrpcport()          rpmatch()           wctob()
getservbyname()       ruserok()           wctomb()
getservbyname_r()     ruserok_af()        wctrans()
getservbyport()       scandir()           wctype()
getservbyport_r()     scanf()             wcwidth()
getservent()          setaliasent()       wordexp()
getservent_r()        setgrent()          wprintf()
getspent()            setgrent_r()        wscanf()

(The list doesn't include functions that manipulate the locale, such as setlocale().)

If any of these functions are called directly or indirectly from your code, you are affected by the current locale.

The first thing to know about this list is that there are better alternatives to many of the functions, which it's highly likely that you should be using instead. See "libc functions to avoid" above. This includes using Perl IO perlapio.

The second thing to know is that Perl is documented to not pay attention to the current locale except for code executed within the scope of a use locale statement. If you violate that, you may be creating bugs, depending on the application.

The next thing to know is that many of these functions depend only on the locale in regards to numeric values. Your code is likely to have been written expecting that the decimal point (radix) character is a dot (U+002E: FULL STOP), and that strings of integer numbers are not separated into groups (1,000,000 in an American locale means a million; your code is likely not expecting the commas.) The good news is that normally (as of Perl v5.22), your code will get called with the locale set so those expectations are met. Explicit action has to be taken to change this (described a little ways below). This is accomplished by Perl not actually switching into a locale that doesn't conform to these expectations, except when explicitly told to do so. The Perl input/output and formatting routines do this switching for you automatically, if appropriate, and then switch back. If, for some reason, you need to do it yourself, the easiest way from C and XS code is to use the macro "WITH_LC_NUMERIC_SET_TO_NEEDED" in perlapi. You can wrap this macro around an entire block of code that you want to be executed in the correct environment. The bottom line is that your code is likely to work as expected in this regard without you having to take any action.

This leaves the remaining functions. Your code will get called with all but the numeric locale portions set to the underlying locale. Often, the locale is of not much import to your code, and you also won't have to take any action; things will just work out. But you should examine the man pages of the ones you use to verify this. Often, Perl has better ways of doing the same functionality. Consider using SVs and their access routines rather than calling the low level functions that, for example, find how many bytes are in a UTF-8 encoded character.

You can determine if you have been called from within the scope of a use locale by using the boolen macro "IN_LOCALE" in perlapi.

If you need to not be in the underlying locale, you can call "Perl_setlocale" in perlapi to change it temporarily to the one you need (likely the "C" locale), and then change it back before returning. This can be very problematic on threaded perls on some platforms. See "Dealing with embedded perls and threads".

A problem with changing the locale of a single category is that mojibake can arise on some platforms if the LC_CTYPE category and the changed one are not the same. On the platforms that that isn't an issue, the preprocessor directive LIBC_HANDLES_MISMATCHED_CTYPE will be defined. Otherwise, you may have to change more than one category to correctly accomplish your task. And, there will be many locale combinations where the mojibake likely won't happen, so you won't be confronted with this until the code gets executed in the field by someone who doesn't speak your language very well.

Earlier we mentioned that explicit action is required to have your code get called with the numeric portions of the locale not meeting the typical expectations of having a dot for the radix character and no punctuation separating groups of digits. That action is to call the function "switch_to_global_locale" in perlapi.

switch_to_global_locale() was written initially to cope with the Tk library, but is general enough for other similar situations. Tk changes the global locale to match its expectations (later versions of it allow this to be turned off). This presents a conflict with Perl thinking it also controls the locale. Calling this function tells Perl to yield control. Calling "sync_locale" in perlapi tells Perl to take control again, accepting whatever the locale has been changed to in the interim. If your code is called during that interim, all portions of the locale will be the raw underlying values. Should you need to manipulate numbers, you are on your own with regard to the radix character and grouping. If you find yourself in this situation, it is generally best to make the interval between the calls to these two functions as short as possible, and avoid calculations until after perl has control again.

It is important for perl to know about all the possible locale categories on the platform, even if they aren't apparently used in your program. Perl knows all of the Linux ones. If your platform has others, you can submit an issue at https://github.com/Perl/perl5/issues for inclusion of it in the next release. In the meantime, it is possible to edit the Perl source to teach it about the category, and then recompile. Search for instances of, say, LC_PAPER in the source, and use that as a template to add the omitted one.

There are further complications under multi-threaded operation. Keep on reading.

Dealing with embedded perls and threads

It is possible to embed a Perl interpreter within a larger program. See perlembed.

MULTIPLICITY is the way this is accomplished internally; it is described in "How multiple interpreters and concurrency are supported" in perlguts. Multiple Perl interpreters may be embedded.

It is also possible to compile perl to support threading. See perlthrtut. Perl's implementation of threading requires MULTIPLICITY, but not the other way around.

MULTIPLICITY without threading means that only one thing runs at a time, so there are no concurrency issues, but each component or instance can affect the global state, potentially interfering with the execution of other instances. This can happen if one instance:

If your code doesn't do any of these things, nor depends on any of their values, then Congratulations!!, you don't have to worry about MULTIPLICITY or threading. But wait, a surprising number of libc functions do depend on data global to the process in some way that may not be immediately obvious. For example, calling strtok(3) changes the global state of a process, and thus needs special attention.

The section 3 libc uses that we know about that have MULTIPLICITY and/or multi-thread issues are:

addmntent()             getrpcent_r()        re_exec()
alphasort()             getrpcport()         regcomp()
asctime()               getservbyname()      regerror()
asctime_r()             getservbyname_r()    regexec()
asprintf()              getservbyport()      res_nclose()
atof()                  getservbyport_r()    res_ninit()
atoi()                  getservent()         res_nquery()
atol()                  getservent_r()       res_nquerydomain()
atoll()                 getspent()           res_nsearch()
basename()              getspent_r()         res_nsend()
btowc()                 getspnam()           rexec()
catgets()               getspnam_r()         rexec_af()
catopen()               getttyent()          rpmatch()
clearenv()              getttynam()          ruserok()
clearerr_unlocked()     getusershell()       ruserok_af()
crypt()                 getutent()           scandir()
crypt_gensalt()         getutid()            scanf()
crypt_r()               getutline()          secure_getenv()
ctermid()               getutxent()          seed48()
ctermid_r()             getutxid()           seed48_r()
ctime()                 getutxline()         setaliasent()
ctime_r()               getwc()              setcontext()
cuserid()               getwchar()           setenv()
daylight                getwchar_unlocked()  setfsent()
dbm_clearerr()          getwc_unlocked()     setgrent()
dbm_close()             glob()               setgrent_r()
dbm_delete()            gmtime()             sethostent()
dbm_error()             gmtime_r()           sethostent_r()
dbm_fetch()             grantpt()            sethostid()
dbm_firstkey()          hcreate()            setkey()
dbm_nextkey()           hcreate_r()          setlocale()
dbm_open()              hdestroy()           setlocale_r()
dbm_store()             hdestroy_r()         setlogmask()
dirname()               hsearch()            setnetent()
dlerror()               hsearch_r()          setnetent_r()
dprintf()               iconv()              setnetgrent()
drand48()               iconv_open()         setprotoent()
drand48_r()             inet_addr()          setprotoent_r()
ecvt()                  inet_aton()          setpwent()
encrypt()               inet_network()       setpwent_r()
endaliasent()           inet_ntoa()          setrpcent()
endfsent()              inet_ntop()          setservent()
endgrent()              inet_pton()          setservent_r()
endgrent_r()            initgroups()         setspent()
endhostent()            initstate_r()        setstate_r()
endhostent_r()          innetgr()            setttyent()
endnetent()             iruserok()           setusershell()
endnetent_r()           iruserok_af()        setutent()
endnetgrent()           isalnum()            setutxent()
endprotoent()           isalnum_l()          sgetspent()
endprotoent_r()         isalpha()            sgetspent_r()
endpwent()              isalpha_l()          shm_open()
endpwent_r()            isascii()            shm_unlink()
endrpcent()             isascii_l()          siginterrupt()
endservent()            isblank()            sleep()
endservent_r()          isblank_l()          snprintf()
endspent()              iscntrl()            sprintf()
endttyent()             iscntrl_l()          srand48()
endusershell()          isdigit()            srand48_r()
endutent()              isdigit_l()          srandom_r()
endutxent()             isgraph()            sscanf()
erand48()               isgraph_l()          ssignal()
erand48_r()             islower()            strcasecmp()
err()                   islower_l()          strcasestr()
error()                 isprint()            strcoll()
error_at_line()         isprint_l()          strerror()
errx()                  ispunct()            strerror_l()
ether_aton()            ispunct_l()          strerror_r()
ether_ntoa()            isspace()            strfmon()
execlp()                isspace_l()          strfmon_l()
execvp()                isupper()            strfromd()
execvpe()               isupper_l()          strfromf()
exit()                  iswalnum()           strfroml()
__fbufsize()            iswalnum_l()         strftime()
fcloseall()             iswalpha()           strftime_l()
fcvt()                  iswalpha_l()         strncasecmp()
fflush_unlocked()       iswblank()           strptime()
fgetc_unlocked()        iswblank_l()         strsignal()
fgetgrent()             iswcntrl()           strtod()
fgetpwent()             iswcntrl_l()         strtof()
fgetspent()             iswdigit()           strtoimax()
fgets_unlocked()        iswdigit_l()         strtok()
fgetwc()                iswgraph()           strtol()
fgetwc_unlocked()       iswgraph_l()         strtold()
fgetws()                iswlower()           strtoll()
fgetws_unlocked()       iswlower_l()         strtoq()
fnmatch()               iswprint()           strtoul()
forkpty()               iswprint_l()         strtoull()
__fpending()            iswpunct()           strtoumax()
fprintf()               iswpunct_l()         strtouq()
__fpurge()              iswspace()           strverscmp()
fputc_unlocked()        iswspace_l()         strxfrm()
fputs_unlocked()        iswupper()           swapcontext()
fputwc()                iswupper_l()         swprintf()
fputwc_unlocked()       iswxdigit()          swscanf()
fputws()                iswxdigit_l()        sysconf()
fputws_unlocked()       isxdigit()           syslog()
fread_unlocked()        isxdigit_l()         system()
fscanf()                jrand48()            tdelete()
__fsetlocking()         jrand48_r()          tempnam()
fts_children()          l64a()               tfind()
fts_read()              lcong48()            timegm()
ftw()                   lcong48_r()          timelocal()
fwprintf()              lgamma()             timezone
fwrite_unlocked()       lgammaf()            tmpnam()
fwscanf()               lgammal()            tmpnam_r()
gamma()                 localeconv()         tolower()
gammaf()                localtime()          tolower_l()
gammal()                localtime_r()        toupper()
getaddrinfo()           login()              toupper_l()
getaliasbyname()        login_tty()          towctrans()
getaliasbyname_r()      logout()             towlower()
getaliasent()           logwtmp()            towlower_l()
getaliasent_r()         lrand48()            towupper()
getchar_unlocked()      lrand48_r()          towupper_l()
getcontext()            makecontext()        tsearch()
getc_unlocked()         mallinfo()           ttyname()
get_current_dir_name()  MB_CUR_MAX           ttyname_r()
getdate()               mblen()              ttyslot()
getdate_r()             mbrlen()             twalk()
getenv()                mbrtowc()            twalk_r()
getfsent()              mbsinit()            tzname
getfsfile()             mbsnrtowcs()         tzset()
getfsspec()             mbsrtowcs()          ungetwc()
getgrent()              mbstowcs()           unsetenv()
getgrent_r()            mbtowc()             updwtmp()
getgrgid()              mcheck()             utmpname()
getgrgid_r()            mcheck_check_all()   va_arg()
getgrnam()              mcheck_pedantic()    valloc()
getgrnam_r()            mktime()             vasprintf()
getgrouplist()          mprobe()             vdprintf()
gethostbyaddr()         mrand48()            verr()
gethostbyaddr_r()       mrand48_r()          verrx()
gethostbyname()         mtrace()             versionsort()
gethostbyname2()        muntrace()           vfprintf()
gethostbyname2_r()      nan()                vfscanf()
gethostbyname_r()       nanf()               vfwprintf()
gethostent()            nanl()               vprintf()
gethostent_r()          newlocale()          vscanf()
gethostid()             nftw()               vsnprintf()
getlogin()              nl_langinfo()        vsprintf()
getlogin_r()            nrand48()            vsscanf()
getmntent()             nrand48_r()          vswprintf()
getmntent_r()           openpty()            vsyslog()
getnameinfo()           perror()             vwarn()
getnetbyaddr()          posix_fallocate()    vwarnx()
getnetbyaddr_r()        printf()             vwprintf()
getnetbyname()          profil()             warn()
getnetbyname_r()        psiginfo()           warnx()
getnetent()             psignal()            wcrtomb()
getnetent_r()           ptsname()            wcscasecmp()
getnetgrent()           putchar_unlocked()   wcschr()
getnetgrent_r()         putc_unlocked()      wcscoll()
getopt()                putenv()             wcsftime()
getopt_long()           putpwent()           wcsncasecmp()
getopt_long_only()      putspent()           wcsnrtombs()
getpass()               pututline()          wcsrchr()
getprotobyname()        pututxline()         wcsrtombs()
getprotobyname_r()      putwc()              wcstod()
getprotobynumber()      putwchar()           wcstof()
getprotobynumber_r()    putwchar_unlocked()  wcstoimax()
getprotoent()           putwc_unlocked()     wcstold()
getprotoent_r()         pvalloc()            wcstombs()
getpw()                 qecvt()              wcstoumax()
getpwent()              qfcvt()              wcswidth()
getpwent_r()            querylocale()        wcsxfrm()
getpwnam()              rand()               wctob()
getpwnam_r()            random_r()           wctomb()
getpwuid()              rcmd()               wctrans()
getpwuid_r()            rcmd_af()            wctype()
getrpcbyname()          readdir()            wcwidth()
getrpcbyname_r()        readdir64()          wordexp()
getrpcbynumber()        readdir64_r()        wprintf()
getrpcbynumber_r()      readdir_r()          wscanf()
getrpcent()             re_comp()            wsetlocale()

(If you know of additional functions that are unsafe on some platform or another, notify us via filing a bug report at https://github.com/Perl/perl5/issues.)

Some of these are safe under MULTIPLICITY, problematic only under threading. If a use doesn't appear in the above list, we think it is MULTIPLICITY and thread-safe on all platforms.

All the uses listed above are function calls, except for these:

daylight  MB_CUR_MAX  timezone  tzname

There are three main approaches to coping with issues involving these constructs, each suitable for different circumstances:

Without threading, you don't have to worry about being interrupted by the system giving control to another thread. With threading, you will have to uses mutexes, and be concerned with the possibility of deadlock.

Functions always unsuitable for use under multi-threads

A few functions are considered totally unsuited for use in a multi-thread environment. These must be called only during single-thread operation.

 endusershell()    @getaliasent()      muntrace()   rexec()
 ether_aton()      @getrpcbyname()     profil()     rexec_af()
 ether_ntoa()      @getrpcbynumber()   rcmd()       setusershell()
 fts_children()    @getrpcent()        rcmd_af()    ttyslot()
 fts_read()         getusershell()     re_comp()
@getaliasbyname()   mtrace()           re_exec()

@ above marks the functions for which there are preferred alternatives available on some platforms, and those alternatives may be suitable for multi-thread use.

Functions which must be called at least once before starting threads

Some functions perform initialization on their first call that must be done while still in a single-thread environment, but subsequent calls are thread-safe when executed in a critical section. Therefore, they must be called at least once before switching to multi-threads:

getutent()  getutline()  getutxid()    mallinfo()  valloc()
getutid()   getutxent()  getutxline()  pvalloc()

Functions that are thread-safe when called with appropriate arguments

Some of the functions are thread-safe if called with arguments that comply with certain (easily met) restrictions. These are:

ctermid()        mbrlen()      mbsrtowcs()  wcrtomb()
cuserid()        mbrtowc()     tmpnam()     wcsnrtombs()
error_at_line()  mbsnrtowcs()  va_arg()     wcsrtombs()

See the man pages of each for details. (For completeness, the list includes functions that you shouldn't be using anyway because of other reasons.)