I have a large set of words (about 10,000) and I need to find if any of those words appear in a given block of text. Is there a faster algorithm than doing a simple text search for each of the words in the block of text?

input the 10,000 words into a hashtable then check each of the words in the block of text if its hash has an entry. Faster though I don't know, just another method (would depend on how many words you are searching for). simple perl examp: <pre class="prettyprint"><code>my $word_block = "the guy went afk after being popped by a brownrabbit"; my %hash = (); my @words = split /\s/, $word_block; while(<DATA>) { chomp; $hash{$_} = 1; } foreach $word (@words) { print "found word: $word\n" if exists $hash{$word}; } __DATA__ afk lol brownrabbit popped garbage trash sitdown </code></pre>

Algorithm for multiple word matching in text

3 Answers

input the 10,000 words into a hashtable then check each of the words in the block of text if its hash has an entry.

Faster though I don't know, just another method (would depend on how many words you are searching for).

simple perl examp:

Click to copy

my $word_block = "the guy went afk after being popped by a brownrabbit";
my %hash = ();
my @words = split /\s/, $word_block;
while(<DATA>) { chomp; $hash{$_} = 1; }
foreach $word (@words)
{
    print "found word: $word\n" if exists $hash{$word};
}

__DATA__
afk
lol
brownrabbit
popped
garbage
trash
sitdown

139

answered Oct 02 '22 14:10

Related questions
                            
                                Complete search algorithm for combinations of coins
                            
                                What kind of problems are state machines good for? [closed]
                            
                                Gauss-Legendre Algorithm in python
                            
                                Efficient way to count number of swaps to insertion sort an array of integers in increasing order
                            
                                Fast solution to Subset sum
                            
                                Soundness and Completeness of a algorithm
                            
                                Calculate bounding polygon of alpha shape from the Delaunay triangulation
                            
                                How to find k nearest neighbors to the median of n distinct numbers in O(n) time?
                            
                                algorithm to check a connect four field
                            
                                Best algorithm to find the minimum absolute difference between two numbers in an array
                            
                                How to build a knowledge graph?
                            
                                how to write a recurrence relation for a given piece of code
                            
                                Efficient way to find overlapping of N rectangles
                            
                                Base 10 to base n conversions [closed]
                            
                                Generating random numbers without repeats
                            
                                SICP example: Counting change, cannot understand
                            
                                Turn while loop into math equation?
                            
                                algorithm that will take numbers or words and find all possible combinations
                            
                                Efficient Algorithm for String Concatenation with Overlap
                            
                                StackOverflowError computing factorial of a BigInteger?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Algorithm for multiple word matching in text

Tags:

text

algorithm

search

Enrico Detoma

People also ask

3 Answers

user105033

Cuga

FryGuy

Recent Activity

Donate For Us