Expected inputs and outputs: <pre class="prettyprint"><code>a -> a a.txt -> a archive.tar.gz -> archive directory/file -> file d.x.y.z/f.a.b.c -> f logs/date.log.txt -> date # Mine! </code></pre> Here's my implementation that feels dirty to me: <pre class="prettyprint"><code>>>> from pathlib import Path >>> example_path = Path("August 08 2015, 01'37'30.log.txt") >>> example_path.stem "August 08 2015, 01'37'30.log" >>> example_path.suffixes ['.log', '.txt'] >>> suffixes_length = sum(map(len, example_path.suffixes)) >>> true_stem = example_path.name[:-suffixes_length] >>> true_stem "August 08 2015, 01'37'30" </code></pre> Because it breaks on <code>Path</code>s without suffixes: <pre class="prettyprint"><code>>>> ns_path = Path("no_suffix") >>> sl = sum(map(len, ns_path.suffixes)) >>> ns_path.name[:-sl] '' </code></pre> So I need to check if the <code>Path</code> has a suffix first: <pre class="prettyprint"><code>>>> def get_true_stem(path: Path): ... if path.suffix: ... sl = sum(map(len, path.suffixes)) ... return path.name[:-sl] ... else: ... return path.stem ... >>> >>> get_true_stem(example_path) "August 08, 2015, 01'37'30" >>> get_true_stem(ns_path) "no_suffix" </code></pre> And this is my current use case: <pre class="prettyprint"><code>>>> file_date = datetime.strptime(true_stem, "%B %d %Y, %H'%M'%S") >>> file_date datetime.datetime(2015, 8, 8, 1, 37, 30) >>> new_dest = format(file_date, "%Y-%m-%dT%H:%M:%S%z") + ".log" # ISO-8601 >>> shutil.move(str(example_path), new_dest) </code></pre> Thanks.

You could just <code>.split</code> it: <pre class="prettyprint"><code>>>> Path('logs/date.log.txt').stem.split('.')[0] 'date' </code></pre> <code>os.path</code> works just as well: <pre class="prettyprint"><code>>>> os.path.basename('logs/date.log.txt').split('.')[0] 'date' </code></pre> It passes all of the tests: <pre class="prettyprint"><code>In [11]: all(Path(k).stem.split('.')[0] == v for k, v in { ....: 'a': 'a', ....: 'a.txt': 'a', ....: 'archive.tar.gz': 'archive', ....: 'directory/file': 'file', ....: 'd.x.y.z/f.a.b.c': 'f', ....: 'logs/date.log.txt': 'date' ....: }.items()) Out[11]: True </code></pre>

How about a while loop method, where you keep taking <code>.stem</code> until the path has no suffixes remaining , Example - <pre class="prettyprint"><code>from pathlib import Path example_path = Path("August 08 2015, 01'37'30.log.txt") example_path_stem = example_path.stem while example_path.suffixes: example_path_stem = example_path.stem example_path = Path(example_path_stem) </code></pre> Please note, the while loop exits the loop when <code>example_path.suffixes</code> returns an empty list (As empty list are False like in boolean context) . <hr> Example/Demo - <pre class="prettyprint"><code>>>> from pathlib import Path >>> example_path = Path("August 08 2015, 01'37'30.log.txt") >>> example_path_stem = example_path.stem >>> while example_path.suffixes: ... example_path_stem = example_path.stem ... example_path = Path(example_path_stem) ... >>> example_path_stem "August 08 2015, 01'37'30" </code></pre> For your second input - <code>no_suffix</code> - <pre class="prettyprint"><code>>>> example_path = Path("no_suffix") >>> example_path_stem = example_path.stem >>> while example_path.suffixes: ... example_path_stem = example_path.stem ... example_path = Path(example_path_stem) ... >>> example_path_stem 'no_suffix' </code></pre>

Clean way to get the "true" stem of a Path object?

Tags:

python

path

pathlib

Expected inputs and outputs:

a                 -> a
a.txt             -> a
archive.tar.gz    -> archive
directory/file    -> file
d.x.y.z/f.a.b.c   -> f
logs/date.log.txt -> date # Mine!

Here's my implementation that feels dirty to me:

>>> from pathlib import Path
>>> example_path = Path("August 08 2015, 01'37'30.log.txt")
>>> example_path.stem
"August 08 2015, 01'37'30.log"
>>> example_path.suffixes
['.log', '.txt']
>>> suffixes_length = sum(map(len, example_path.suffixes))
>>> true_stem = example_path.name[:-suffixes_length]
>>> true_stem
"August 08 2015, 01'37'30"

Because it breaks on Paths without suffixes:

>>> ns_path = Path("no_suffix")
>>> sl = sum(map(len, ns_path.suffixes))
>>> ns_path.name[:-sl]
''

So I need to check if the Path has a suffix first:

>>> def get_true_stem(path: Path):
...     if path.suffix:
...         sl = sum(map(len, path.suffixes))
...         return path.name[:-sl]
...     else:
...         return path.stem
...
>>>
>>> get_true_stem(example_path)
"August 08, 2015, 01'37'30"
>>> get_true_stem(ns_path)
"no_suffix"

And this is my current use case:

>>> file_date = datetime.strptime(true_stem, "%B %d %Y, %H'%M'%S")
>>> file_date
datetime.datetime(2015, 8, 8, 1, 37, 30)
>>> new_dest = format(file_date, "%Y-%m-%dT%H:%M:%S%z") + ".log" # ISO-8601
>>> shutil.move(str(example_path), new_dest)

Thanks.

377

asked Aug 08 '15 06:08

Navith

6 Answers

You could just .split it:

>>> Path('logs/date.log.txt').stem.split('.')[0]
'date'

os.path works just as well:

>>> os.path.basename('logs/date.log.txt').split('.')[0]
'date'

It passes all of the tests:

In [11]: all(Path(k).stem.split('.')[0] == v for k, v in {
   ....:     'a': 'a',
   ....:     'a.txt': 'a',
   ....:     'archive.tar.gz': 'archive',
   ....:     'directory/file': 'file',
   ....:     'd.x.y.z/f.a.b.c': 'f',
   ....:     'logs/date.log.txt': 'date'
   ....: }.items())
Out[11]: True

111

answered Oct 22 '22 05:10

Blender

How about a while loop method, where you keep taking .stem until the path has no suffixes remaining , Example -

from pathlib import Path
example_path = Path("August 08 2015, 01'37'30.log.txt")
example_path_stem = example_path.stem
while example_path.suffixes:
    example_path_stem = example_path.stem
    example_path = Path(example_path_stem)

Please note, the while loop exits the loop when example_path.suffixes returns an empty list (As empty list are False like in boolean context) .

Example/Demo -

>>> from pathlib import Path
>>> example_path = Path("August 08 2015, 01'37'30.log.txt")
>>> example_path_stem = example_path.stem
>>> while example_path.suffixes:
...     example_path_stem = example_path.stem
...     example_path = Path(example_path_stem)
...
>>> example_path_stem
"August 08 2015, 01'37'30"

For your second input - no_suffix -

>>> example_path = Path("no_suffix")
>>> example_path_stem = example_path.stem
>>> while example_path.suffixes:
...     example_path_stem = example_path.stem
...     example_path = Path(example_path_stem)
...
>>> example_path_stem
'no_suffix'

answered Oct 22 '22 03:10

Anand S Kumar

Here's another possible solution to the given problem:

from pathlib import Path

if __name__ == '__main__':
    dataset = [
        ('a', 'a'),
        ('a.txt', 'a'),
        ('archive.tar.gz', 'archive'),
        ('directory/file', 'file'),
        ('d.x.y.z/f.a.b.c', 'f'),
        ('logs/date.log.txt', 'date'),
    ]
    for path, stem in dataset:
        path = Path(path)
        assert path.name.replace("".join(path.suffixes), "") == stem

answered Oct 22 '22 03:10

BPL

Why not go recursively?

from pathlib import Path

def true_stem(path):
   stem = Path(path).stem
   return stem if stem == path else true_stem(stem)

assert(true_stem('d.x.y.z/f.a.b.c') == 'f')

answered Oct 22 '22 05:10

plankthom

Another approach uses pattern matching:

import re
from pathlib import Path
all(re.search('[.]|',Path(k).name) for k,v in {
   'a': 'a',
   'a.txt': 'a',
   'archive.tar.gz': 'archive',
   'directory/file': 'file',
   'd.x.y.z/f.a.b.c': 'f',
   'logs/date.log.txt': 'date'
   }.items())

the pattern '[.]' may be used if all your paths have at least one suffix

answered Oct 22 '22 03:10

Jim Robinson

If you wanted to use pathlib uniquely, you could also use:

>>> Path('logs/date.log.txt').with_suffix('').stem
'date'

EDIT:

As pointed out in the comments this doesn't work if you have an extension with more than 2 suffixes. Although this doesn't sound very likely (and pathlib itself doesn't have a native way to deal with it), if you wanted to use pathlib uniquely, you could use:

>>> Path('logs/date.log.txt.foo').with_suffix('').with_suffix('').stem
'date'

answered Oct 22 '22 04:10

universvm

Related questions
                            
                                Using the same decorator (with arguments) with functions and methods
                            
                                Python: find a list within members of another list(in order)
                            
                                Image color detection using python
                            
                                How do I install M2Crypto on Ubuntu?
                            
                                SSH Tunnel for Python MySQLdb connection
                            
                                Strange PEP8 recommendation on comparing Boolean values to True or False
                            
                                simple inter-process communication
                            
                                Run BASH built-in commands in Python?
                            
                                Check if file system is case-insensitive in Python
                            
                                Using Python's max to return two equally large values
                            
                                Python: JSON string to list of dictionaries - Getting error when iterating
                            
                                Get IP Address when testing flask application through nosetests
                            
                                How can I get Python to automatically create missing key/value pairs in a dictionary? [duplicate]
                            
                                Python write string of bytes to file
                            
                                What does "if var" mean in python?
                            
                                What is the Difference between PySphere and PyVmomi?
                            
                                Python property returning property object
                            
                                Convert date to float for linear regression on Pandas data frame
                            
                                pg_config executable not found when using pgxnclient on Windows 7 x64
                            
                                How do I catch errors with scrapy so I can do something when I get User Timeout error?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Clean way to get the "true" stem of a Path object?

Tags:

python

path

pathlib

Navith

People also ask

6 Answers

Blender

Anand S Kumar

BPL

plankthom

Jim Robinson

universvm

Recent Activity

Donate For Us