<h3>The question</h3> <p>What is the difference between <code>Cwd::cwd</code> and <code>Cwd::getcwd</code> in Perl, generally, without regard to any specific platform? Why does Perl have both? What is the intended use, which one should I use in which scenarios? (Example use cases will be appreciated.) Does it matter? (Assuming I don’t mix them.) Does choice of either one affect portability in any way? Which one is more commonly used in modules?</p> <p>Even if I interpret the manual is saying that except for corner cases <code>cwd</code> is <code>`pwd`</code> and <code>getcwd</code> just calls <code>getcwd</code> from <code>unistd.h</code>, what is the actual difference? This works only on POSIX systems, anyway.</p> <p>I can always read the implementation but that tells me nothing about the meaning of those functions. Implementation details may change, not so defined meaning. (Otherwise a breaking change occurs, which is serious business.)</p> <h3>What does the manual say</h3> <p>Quoting Perl’s Cwd module manpage:</p> <blockquote> <p>Each of these functions are called without arguments and return the absolute path of the current working directory.</p> <ul> <li> <p>getcwd</p> <p><code>my $cwd = getcwd();</code></p> <p>Returns the current working directory.</p> <p>Exposes the POSIX function getcwd(3) or re-implements it if it's not available.</p> </li> <li> <p>cwd</p> <p><code>my $cwd = cwd();</code></p> <p>The cwd() is the most natural form for the current architecture. For most systems it is identical to `pwd` (but without the trailing line terminator).</p> </li> </ul> </blockquote> <p>And in the Notes section:</p> <blockquote> <ul> <li>Actually, on Mac OS, the <code>getcwd()</code>, <code>fastgetcwd()</code> and <code>fastcwd()</code> functions are all aliases for the <code>cwd()</code> function, which, on Mac OS, calls `pwd`. Likewise, the <code>abs_path()</code> function is an alias for <code>fast_abs_path()</code> </li> </ul> </blockquote> <p>OK, I know that on Mac OS<sup>1</sup> there is no difference between <code>getcwd()</code> and <code>cwd()</code> as both actually boil down to <code>`pwd`</code>. But what on other platforms? (<em>I’m especially interested in Debian Linux.</em>)</p> <hr> <p><sup>1</sup> Classic Mac OS, not OS X. <code>$^O</code> values are <code>MacOS</code> and <code>darwin</code> for Mac OS and OS X, respectively. <em>Thanks, @tobyink and @ikegami.</em></p> <p><sub>And a little meta-question: How to avoid asking similar questions for other modules with very similar functions? Is there a universal way of discovering the difference, other than digging through the implementation? (<em>Currently, I think that if the documentation is not clear about intended use and differences, I have to ask someone more experienced or read the implementation myself.</em>)</sub></p>

<h3>Generally speaking</h3> <p>I think the idea is that <code>cwd()</code> always resolves to the external, OS-specific way of getting the current working directory. That is, running <code>pwd</code> on Linux, <code>command /c cd</code> on DOS, <code>/usr/bin/fullpath -t</code> in QNX, and so on — all examples are from actual <code>Cwd.pm</code>. The <code>getcwd()</code> is supposed to use the POSIX system call if it is available, and falls back to the <code>cwd()</code> if not.</p> <p>Why we have both? In the current implementation I believe exporting just <code>getcwd()</code> would be enough for most of systems, but who knows why the logic of “if syscall is available, use it, else run <code>cwd()</code>” can fail on some system (e.g. on MorphOS in Perl 5.6.1).</p> <h3>On Linux</h3> <p>On Linux, <code>cwd()</code> will run <code>`/bin/pwd`</code> (will actually execute the binary and get its output), while <code>getcwd()</code> will issue <code>getcwd(2)</code> system call.</p> <h3>Actual effect inspected via <code>strace</code> </h3> <p>One can use <code>strace(1)</code> to see that in action:</p> <p>Using <code>cwd()</code>:</p> <pre class="prettyprint lang-none prettyprint-override"><code>$ strace -f perl -MCwd -e 'cwd(); ' 2>&1 | grep execve execve("/usr/bin/perl", ["perl", "-MCwd", "-e", "cwd(); "], [/* 27 vars */]) = 0 [pid 31276] execve("/bin/pwd", ["/bin/pwd"], [/* 27 vars */] <unfinished ...> [pid 31276] <... execve resumed> ) = 0 </code></pre> <p>Using <code>getcwd()</code>:</p> <pre class="prettyprint lang-none prettyprint-override"><code>$ strace -f perl -MCwd -e 'getcwd(); ' 2>&1 | grep execve execve("/usr/bin/perl", ["perl", "-MCwd", "-e", "getcwd(); "], [/* 27 vars */]) = 0 </code></pre> <h3>Reading <code>Cwd.pm</code> source</h3> <p>You can take a look at the sources (<code>Cwd.pm</code>, e.g. in CPAN) and see that for Linux <code>cwd()</code> call is mapped to <code>_backtick_pwd</code> which, as the name suggests, calls the <code>pwd</code> in backticks.</p> <p>Here is a snippet from <code>Cwd.pm</code>, with my comments:</p> <pre class="prettyprint"><code>unless ($METHOD_MAP{$^O}{cwd} or defined &cwd) { ... # some logic to find the pwd binary here, $found_pwd_cmd is set to 1 on Linux ... if( $os eq 'MacOS' || $found_pwd_cmd ) { *cwd = \&_backtick_pwd; # on Linux we actually go here } else { *cwd = \&getcwd; } } </code></pre> <h3>Performance benchmark</h3> <p>Finally, the difference between two is that <code>cwd()</code>, which calls another binary, must be slower. We can make some kind of a performance test:</p> <pre class="prettyprint lang-none prettyprint-override"><code>$ time perl -MCwd -e 'for (1..10000) { cwd(); }' real 0m7.177s user 0m0.380s sys 0m1.440s </code></pre> <p>Now compare it with the system call:</p> <pre class="prettyprint lang-none prettyprint-override"><code>$ time perl -MCwd -e 'for (1..10000) { getcwd(); }' real 0m0.018s user 0m0.009s sys 0m0.008s </code></pre> <h3>Discussion, choice</h3> <p>But as you don't usually query the current working directory too often, both options will work — unless you cannot spawn any more processes for some reason related to <code>ulimit</code>, out of memory situation, etc.</p> <p>Finally, as for selecting which one to use: for Linux, I would always use <code>getcwd()</code>. I suppose you will need to make your tests and select which function to use if you are going to write a portable piece of code that will run on some really strange platform (here, of course, Linux, OS X, and Windows are not in the list of strange platforms).</p>

How do Perl Cwd::cwd and Cwd::getcwd functions differ?

The question

What is the difference between Cwd::cwd and Cwd::getcwd in Perl, generally, without regard to any specific platform? Why does Perl have both? What is the intended use, which one should I use in which scenarios? (Example use cases will be appreciated.) Does it matter? (Assuming I don’t mix them.) Does choice of either one affect portability in any way? Which one is more commonly used in modules?

Even if I interpret the manual is saying that except for corner cases cwd is `pwd` and getcwd just calls getcwd from unistd.h, what is the actual difference? This works only on POSIX systems, anyway.

I can always read the implementation but that tells me nothing about the meaning of those functions. Implementation details may change, not so defined meaning. (Otherwise a breaking change occurs, which is serious business.)

What does the manual say

Quoting Perl’s Cwd module manpage:

Each of these functions are called without arguments and return the absolute path of the current working directory.

getcwd

my $cwd = getcwd();

Returns the current working directory.

Exposes the POSIX function getcwd(3) or re-implements it if it's not available.

cwd

my $cwd = cwd();

The cwd() is the most natural form for the current architecture. For most systems it is identical to `pwd` (but without the trailing line terminator).

And in the Notes section:

Actually, on Mac OS, the getcwd(), fastgetcwd() and fastcwd() functions are all aliases for the cwd() function, which, on Mac OS, calls `pwd`. Likewise, the abs_path() function is an alias for fast_abs_path()

OK, I know that on Mac OS¹ there is no difference between getcwd() and cwd() as both actually boil down to `pwd`. But what on other platforms? (I’m especially interested in Debian Linux.)

¹ Classic Mac OS, not OS X. $^O values are MacOS and darwin for Mac OS and OS X, respectively. Thanks, @tobyink and @ikegami.

_{And a little meta-question: How to avoid asking similar questions for other modules with very similar functions? Is there a universal way of discovering the difference, other than digging through the implementation? (Currently, I think that if the documentation is not clear about intended use and differences, I have to ask someone more experienced or read the implementation myself.)}

811

asked Aug 09 '14 15:08

Palec

1 Answers

Generally speaking

I think the idea is that cwd() always resolves to the external, OS-specific way of getting the current working directory. That is, running pwd on Linux, command /c cd on DOS, /usr/bin/fullpath -t in QNX, and so on — all examples are from actual Cwd.pm. The getcwd() is supposed to use the POSIX system call if it is available, and falls back to the cwd() if not.

Why we have both? In the current implementation I believe exporting just getcwd() would be enough for most of systems, but who knows why the logic of “if syscall is available, use it, else run cwd()” can fail on some system (e.g. on MorphOS in Perl 5.6.1).

On Linux

On Linux, cwd() will run `/bin/pwd` (will actually execute the binary and get its output), while getcwd() will issue getcwd(2) system call.

Actual effect inspected via `strace`

One can use strace(1) to see that in action:

Using cwd():

$ strace -f perl -MCwd -e 'cwd(); ' 2>&1 | grep execve
execve("/usr/bin/perl", ["perl", "-MCwd", "-e", "cwd(); "], [/* 27 vars */]) = 0
[pid 31276] execve("/bin/pwd", ["/bin/pwd"], [/* 27 vars */] <unfinished ...>
[pid 31276] <... execve resumed> )      = 0

Using getcwd():

$ strace -f perl -MCwd -e 'getcwd(); ' 2>&1 | grep execve
execve("/usr/bin/perl", ["perl", "-MCwd", "-e", "getcwd(); "], [/* 27 vars */]) = 0

Reading `Cwd.pm` source

You can take a look at the sources (Cwd.pm, e.g. in CPAN) and see that for Linux cwd() call is mapped to _backtick_pwd which, as the name suggests, calls the pwd in backticks.

Here is a snippet from Cwd.pm, with my comments:

unless ($METHOD_MAP{$^O}{cwd} or defined &cwd) {
    ...
    # some logic to find the pwd binary here, $found_pwd_cmd is set to 1 on Linux
    ...
    if( $os eq 'MacOS' || $found_pwd_cmd )
    {
        *cwd = \&_backtick_pwd;  # on Linux we actually go here
    }
    else {
        *cwd = \&getcwd;
    }
}

Performance benchmark

Finally, the difference between two is that cwd(), which calls another binary, must be slower. We can make some kind of a performance test:

$ time perl -MCwd -e 'for (1..10000) { cwd(); }'

real    0m7.177s
user    0m0.380s
sys     0m1.440s

Now compare it with the system call:

$ time perl -MCwd -e 'for (1..10000) { getcwd(); }'

real    0m0.018s
user    0m0.009s
sys     0m0.008s

Discussion, choice

But as you don't usually query the current working directory too often, both options will work — unless you cannot spawn any more processes for some reason related to ulimit, out of memory situation, etc.

Finally, as for selecting which one to use: for Linux, I would always use getcwd(). I suppose you will need to make your tests and select which function to use if you are going to write a portable piece of code that will run on some really strange platform (here, of course, Linux, OS X, and Windows are not in the list of strange platforms).

131

answered Sep 18 '22 14:09

afenster

Related questions
                            
                                Python: execfile from other file's working directory?
                            
                                Is it possible to pass a workingdirectory with a space to MSBuild EXEC command task?
                            
                                Python file open() in Enthought Canopy fails with: "IOError No such file or directory"
                            
                                Setting a custom working directory for a process started with exec
                            
                                How to get working directory in node-webkit
                            
                                Python in VSCode: Set working directory to python file's path everytime
                            
                                Find Git Revision of a Working Directory Missing the .git Directory
                            
                                git init, add, commit from a different directory
                            
                                How to set the startup directory in Git Bash?
                            
                                Change pytest working directory to test case directory
                            
                                Can I import a patch without touching the working directory?
                            
                                Find Install directory and working directory of VSTO Outlook Addin; or any Office Addin
                            
                                How to get the current (working) directory in Scala?
                            
                                How to change directory in mysql command line tool?
                            
                                Difference between Current Directory and Working Directory in Windows
                            
                                C# Get working directory of another process
                            
                                Visual studio 2017 Developer Command Prompt switches current directory
                            
                                Moving down a folder in working directory
                            
                                Command for "Set working directory to source file location"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do Perl Cwd::cwd and Cwd::getcwd functions differ?

Tags:

perl

working-directory

getcwd

The question

What does the manual say

Palec

People also ask

1 Answers

Generally speaking

On Linux

Actual effect inspected via `strace`

Reading `Cwd.pm` source

Performance benchmark

Discussion, choice

afenster

Recent Activity

Donate For Us

How do Perl Cwd::cwd and Cwd::getcwd functions differ?

Tags:

perl

working-directory

getcwd

The question

What does the manual say

Palec

People also ask

1 Answers

Generally speaking

On Linux

Actual effect inspected via strace

Reading Cwd.pm source

Performance benchmark

Discussion, choice

afenster

Related questions

Recent Activity

Donate For Us

Actual effect inspected via `strace`

Reading `Cwd.pm` source