Fulltext Search DynamoDB

Tags:

Following situation:

I´m storing elements in a DyanmoDb for my customers. HashKey is a Element ID and Range Key is the customer ID. In addition to these fields I´m storing an array of strings -> tags (e.g. ["Pets", "House"]) and a multiline text.

I want to provide a search function in my application, where the user can type a free text or select tags and get all related elements.

In my opinion a plain DB query is not the correct solution. I was playing around with CloudSearch, but I´m not really sure if this is the correct solution, because everytime the user adds a tag the index must be updated...

I hope you have some hints for me.

881

asked May 31 '17 17:05

SnowMax

1 Answers

You can use an instant-search engine like Typesense to search through data in your DynamoDB table:

https://github.com/typesense/typesense

There's also ElasticSearch, but it has a steep learning curve and can become a beast to manage, given the number of features and configuration options it supports.

At a high level:

Turn on DynamoDB streams
Setup an AWS Lambda trigger to listen to these change events
Write code inside your lambda function to index data into Typesense:

def lambda_handler(event, context):
    client = typesense.Client({
        'nodes': [{
            'host': '<Endpoint URL>',
            'port': '<Port Number>',
            'protocol': 'https',
        }],
        'api_key': '<API Key>',
        'connection_timeout_seconds': 2
    })

    processed = 0
    for record in event['Records']:
        ddb_record = record['dynamodb']
        if record['eventName'] == 'REMOVE':
            res = client.collections['<collection-name>'].documents[str(ddb_record['OldImage']['id']['N'])].delete()
        else:
            document = ddb_record['NewImage'] # format your document here and the use upsert function to index it.
            res = client.collections['<collection-name>'].upsert(document)
            print(res)
        processed = processed + 1
        print('Successfully processed {} records'.format(processed))
    return processed

Here's a detailed article from Typesense's docs on how to do this: https://typesense.org/docs/0.19.0/guide/dynamodb-full-text-search.html

147

answered Sep 21 '22 12:09

ErJab

Related questions
                            
                                memcached-session-manager on AWS
                            
                                Distinguish bounce and OOTO with Amazon SES
                            
                                Building Erlang applications for the cloud
                            
                                AWS Elastic Beanstalk - Request Entity Too Large (413)
                            
                                Give an instance only access to tag itself?
                            
                                AWS Load Balancer EC2 health check request timed out failure
                            
                                How do I use an AWS SessionToken to read from S3 in pyspark?
                            
                                What is the best way to run Map/Reduce stuff on data from Mongo?
                            
                                Connecting AWS EC2 instance asks for password although PEM file is provided [closed]
                            
                                Writing bytes stream to s3 using python
                            
                                AWS Cloudfront and ELB Security Groups
                            
                                Joining 2 large postgres tables using int8range not scaling well
                            
                                Access HTTP request (headers, query string, cookies, body) object in lambda with http endpoint
                            
                                Unable to connect on AWS - RDS DB : SQL Server 2012 Express
                            
                                How to attach Elastic IP to EC2 instance during bootstrapping in aws CLI?
                            
                                Concatenate s3 files when using AWS Firehose
                            
                                How to authenticate to Google Cloud API without Application Default Credentials or Cloud SDK?
                            
                                AWS Cloudwatch event - how to trigger for different timezones?
                            
                                Differences between using Lex and Alexa
                            
                                AWS CLI s3 copy fails with 403 error, trying to administrate a user-uploaded object

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fulltext Search DynamoDB

Tags:

amazon-web-services

elasticsearch

amazon-dynamodb

amazon-cloudsearch

SnowMax

People also ask

1 Answers

ErJab

Recent Activity

Donate For Us