Negating sentences using POS-tagging

Tags:

I'm trying to find a way to negate sentences based on POS-tagging. Please consider:

include_once 'class.postagger.php';

function negate($sentence) {  
  $tagger = new PosTagger('includes/lexicon.txt');
  $tags = $tagger->tag($sentence);
  foreach ($tags as $t) {
    $input[] = trim($t['token']) . "/" . trim($t['tag']) .  " ";
  }
  $sentence = implode(" ", $input);
  $postagged = $sentence;

  // Concatenate "not" to every JJ, RB or VB
  // Todo: ignore negative words (not, never, neither)
  $sentence = preg_replace("/(\w+)\/(JJ|MD|RB|VB|VBD|VBN)\b/", "not$1/$2", $sentence);

  // Remove all POS tags
  $sentence = preg_replace("/\/[A-Z$]+/", "", $sentence);

  return "$postagged<br>$sentence";
}

BTW: In this example, I'm using the POS-tagging implementation and lexicon of Ian Barber. An example of this code running would be:

echo negate("I will never go to their place again");
I/NN will/MD never/RB go/VB to/TO their/PRP$ place/NN again/RB 
I notwill notnever notgo to their place notagain

As you can see, (and this issue is also commented in the code), negating words themselves are being negated as wel: never becomes notnever, which obviously shouldn't happen. Since my regex skills aren't all that, is there a way to exclude these words from the regex used?

[edit] Also, I would very much welcome other comments / critiques you might have in this negating implementation, since I'm sure it's (still) quite flawed :-)

202

asked May 01 '12 13:05

Pr0no

1 Answers

Give this a try:

$sentence = preg_replace("/(\s)(?:(?!never|neither|not)(\w*))\/(JJ|MD|RB|VB|VBD|VBN)\b/", "$1not$2", $sentence);

110

answered Oct 11 '22 18:10

Nate

Related questions
                            
                                PHP Forking: Kill child when it becomes a zombie
                            
                                Access the id of the object inserted after a prepared statement in PHP using MYSQLi
                            
                                What is difference between , (comma) and . (dot) as a concatenation operator?
                            
                                php array reference passing to function
                            
                                How to read wordpress cookies in my website?
                            
                                Doctrine2 + soft delete as a state pattern
                            
                                Most efficient way of detecting and removing elements in array based on elements' first and last value
                            
                                PHP - Reading COM Port from Windows
                            
                                How can I tell if a given string is a valid input to PHP's preg_match?
                            
                                Upgrading huge number of Joomla sites
                            
                                PHP SoapClient creating XML references for identical elements, makes it unacceptable for service
                            
                                image compression in php
                            
                                How to enable php extensions and database support?
                            
                                MySQL string comparison
                            
                                Caching MongoDB objects in PHP
                            
                                Checking if value exists in php array - not working?
                            
                                Video streaming from Android device to LAMP Server
                            
                                Are there any general purpose CRUD client applications?
                            
                                Mysqli insert statement
                            
                                random function: higher values appear less often than lower

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Negating sentences using POS-tagging

Tags:

regex

php

nlp

Pr0no

People also ask

1 Answers

Nate

Recent Activity

Donate For Us