I'm trying to write a program that capitalizes the first letter of each sentence. This is what I have so far, but I cannot figure out how to add back the period in between sentences. For example, if I input: <blockquote> hello. goodbye </blockquote> the output is <blockquote> Hello Goodbye </blockquote> and the period has disappeared. <pre class="prettyprint"><code>string=input('Enter a sentence/sentences please:') sentence=string.split('.') for i in sentence: print(i.capitalize(),end='') </code></pre>

You could use regular expressions. Define a regex that matches the first word of a sentence: <pre class="prettyprint"><code>import re p = re.compile(r'(?<=[\.\?!]\s)(\w+)) </code></pre> This regex contains a positive lookbehind assertion <code>(?<=...)</code> which matches either a <code>.</code>, <code>?</code> or <code>!</code>, followed by a whitespace character <code>\s</code>. This is followed by a group that matches one or more alphanumeric characters <code>\w+</code>. In effect, matching the next word after the end of a sentence. You can define a function that will capitalise regex match objects, and feed this function to <code>sub()</code>: <pre class="prettyprint"><code>def cap(match): return(match.group().capitalize()) p.sub(cap, 'Your text here. this is fun! yay.') </code></pre> You might want to do the same for another regex that matches the word at the beginning of a string: <pre class="prettyprint"><code>p2 = re.compile(r'^\w+') </code></pre> Or make the original regex even harder to read, by combining them: <pre class="prettyprint"><code>p = re.compile(r'((?<=[\.\?!]\s)(\w+)|(^\w+))') </code></pre>

How to capitalize the first letter of every sentence?

Tags:

python

python-3.x

capitalization

I'm trying to write a program that capitalizes the first letter of each sentence. This is what I have so far, but I cannot figure out how to add back the period in between sentences. For example, if I input:

hello. goodbye

the output is

Hello Goodbye

and the period has disappeared.

string=input('Enter a sentence/sentences please:')
sentence=string.split('.')
for i in sentence:
    print(i.capitalize(),end='')

439

asked Apr 02 '14 02:04

user3307366

2 Answers

You could use nltk for sentence segmentation:

#!/usr/bin/env python3
import textwrap
from pprint import pprint
import nltk.data # $ pip install http://www.nltk.org/nltk3-alpha/nltk-3.0a3.tar.gz
# python -c "import nltk; nltk.download('punkt')"

sent_tokenizer = nltk.data.load('tokenizers/punkt/english.pickle')
text = input('Enter a sentence/sentences please:')
print("\n" + textwrap.fill(text))
sentences = sent_tokenizer.tokenize(text)
sentences = [sent.capitalize() for sent in sentences]
pprint(sentences)

Output

Enter a sentence/sentences please:
a period might occur inside a sentence e.g., see! and the sentence may
end without the dot!
['A period might occur inside a sentence e.g., see!',
 'And the sentence may end without the dot!']

158

answered Nov 01 '22 16:11

jfs

You could use regular expressions. Define a regex that matches the first word of a sentence:

import re
p = re.compile(r'(?<=[\.\?!]\s)(\w+))

This regex contains a positive lookbehind assertion (?<=...) which matches either a ., ? or !, followed by a whitespace character \s. This is followed by a group that matches one or more alphanumeric characters \w+. In effect, matching the next word after the end of a sentence.

You can define a function that will capitalise regex match objects, and feed this function to sub():

def cap(match):
    return(match.group().capitalize())

p.sub(cap, 'Your text here. this is fun! yay.')

You might want to do the same for another regex that matches the word at the beginning of a string:

p2 = re.compile(r'^\w+')

Or make the original regex even harder to read, by combining them:

p = re.compile(r'((?<=[\.\?!]\s)(\w+)|(^\w+))')

answered Nov 01 '22 14:11

desired login

Related questions
                            
                                File Open Function with Try & Except Python 2.7.1
                            
                                Seed() and Random Numbers in Python [duplicate]
                            
                                Python list difference
                            
                                Python script to remove all comments from XML file
                            
                                How can i name object "keys" programmatically in JavaScript?
                            
                                Error with __init__ 'module' object is not callable
                            
                                Create dynamic button in PyQt
                            
                                Creating a DLL from a wrapped cpp file with SWIG
                            
                                What does placing \ at the end of a line do in python?
                            
                                Python: Make last item of array become the first
                            
                                What's the difference between "()" and "[]" when generating in Python?
                            
                                Input int list in Python 3 [duplicate]
                            
                                Running process of remote SSH server in the background using Python Paramiko
                            
                                Is it possible to have an alias for sys.stdout in python?
                            
                                How do you create a numpy vertical arange?
                            
                                Compile vim7.4 source code with python support failed
                            
                                Why is my scrapy spider not following the Request callback in my item parse function?
                            
                                django-rest-framework: __init__() takes exactly 1 argument (2 given)
                            
                                Python: First In First Out Print
                            
                                Sum corresponding elements of multiple python dictionaries

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With