As I understand in Golang: <code>the DB handle is meant to be long-lived and shared between many goroutines</code>. But when I using Golang with AWS lambda, it's a very different story since lambdas are stopping the function when it's finished. I am using:<code>defer db.Close()</code> in Lambda Invoke function but it isn't affect. On MySQL, it's still keep that connection as <code>Sleep query</code>. As a result, it causes <code>too many connections</code> on MySQL. Currently, I have to set <code>wait_timeout</code> in MySQL to small number. But it's not the best solution, in my opinion. Is there any way to close connection when using Go SQL driver with Lambda? Thanks,

There are two problems that we need to address <ul> <li>Correctly managing state between lambda invocations</li> <li>Configuring a connection pool</li> </ul> <h3>Correctly managing state</h3> Let us understand a bit of how the container is managed by AWS. From the AWS docs: <blockquote> After a Lambda function is executed, AWS Lambda maintains the execution context for some time in anticipation of another Lambda function invocation. In effect, the service freezes the execution context after a Lambda function completes, and thaws the context for reuse, if AWS Lambda chooses to reuse the context when the Lambda function is invoked again. This execution context reuse approach has the following implications: <ul> <li>Any declarations in your Lambda function code (outside the handler code, see Programming Model) remains initialized, providing additional optimization when the function is invoked again. For example, if your Lambda function establishes a database connection, instead of reestablishing the connection, the original connection is used in subsequent invocations. We suggest adding logic in your code to check if a connection exists before creating one.</li> <li>Each execution context provides 500MB of additional disk space in the /tmp directory. The directory content remains when the execution context is frozen, providing transient cache that can be used for multiple invocations. You can add extra code to check if the cache has the data that you stored. For information on deployment limits, see AWS Lambda Limits.</li> <li>Background processes or callbacks initiated by your Lambda function that did not complete when the function ended resume if AWS Lambda chooses to reuse the execution context. You should make sure any background processes or callbacks (in case of Node.js) in your code are complete before the code exits.</li> </ul> </blockquote> This first bullet point says that state is maintained between executions. Let us see this in action: <pre class="prettyprint"><code>let counter = 0 module.exports.handler = (event, context, callback) => { counter++ callback(null, { count: counter }) } </code></pre> If you deploy this and call multiple times consecutively you will see that the counter will be incremented between calls. Now that you know that - you should not call <code>defer db.Close()</code>, instead you should be reusing the database instance. You can do that by simply making <code>db</code> a package level variable. First, create a database package that will export an <code>Open</code> function: <pre class="prettyprint"><code>package database import ( "fmt" "os" _ "github.com/go-sql-driver/mysql" "github.com/jinzhu/gorm" ) var ( host = os.Getenv("DB_HOST") port = os.Getenv("DB_PORT") user = os.Getenv("DB_USER") name = os.Getenv("DB_NAME") pass = os.Getenv("DB_PASS") ) func Open() (db *gorm.DB) { args := fmt.Sprintf("%s:%s@tcp(%s:%s)/%s?parseTime=true", user, pass, host, port, name) // Initialize a new db connection. db, err := gorm.Open("mysql", args) if err != nil { panic(err) } return } </code></pre> Then use it on your handler.go file: <pre class="prettyprint"><code>package main import ( "context" "github.com/aws/aws-lambda-go/events" "github.com/aws/aws-lambda-go/lambda" "github.com/jinzhu/gorm" "github.com/<username>/<name-of-lib>/database" ) var db *gorm.DB func init() { db = database.Open() } func Handler() (events.APIGatewayProxyResponse, error) { // You can use db here. return events.APIGatewayProxyResponse{ StatusCode: 201, }, nil } func main() { lambda.Start(Handler) } </code></pre> OBS: don't forget to replace <code>github.com/<username>/<name-of-lib>/database</code> with the right path. Now, you might still see the <code>too many connections</code> error. If that happens you will need a connection pool. <h3>Configuring a connection pool</h3> From Wikipedia: <blockquote> In software engineering, a connection pool is a cache of database connections maintained so that the connections can be reused when future requests to the database are required. Connection pools are used to enhance the performance of executing commands on a database. </blockquote> You will need a connection pool that the number of allowed connections must be equal to the number of parallel lambdas running, you have two choices: <ul> <li>MySQL Proxy</li> </ul> <blockquote> MySQL Proxy is a simple program that sits between your client and MySQL server(s) and that can monitor, analyze or transform their communication. Its flexibility allows for a wide variety of uses, including load balancing, failover, query analysis, query filtering and modification, and many more. </blockquote> <ul> <li>AWS Aurora:</li> </ul> <blockquote> Amazon Aurora Serverless is an on-demand, auto-scaling configuration for Amazon Aurora (MySQL-compatible edition), where the database will automatically start up, shut down, and scale capacity up or down based on your application's needs. It enables you to run your database in the cloud without managing any database instances. It's a simple, cost-effective option for infrequent, intermittent, or unpredictable workloads. </blockquote> Regardless of your choice, there are plenty of tutorials on the internet on how to configure both.

sql.DB on aws-lambda too many connection

1 Answers

There are two problems that we need to address

Correctly managing state between lambda invocations
Configuring a connection pool

Correctly managing state

Let us understand a bit of how the container is managed by AWS. From the AWS docs:

After a Lambda function is executed, AWS Lambda maintains the execution context for some time in anticipation of another Lambda function invocation. In effect, the service freezes the execution context after a Lambda function completes, and thaws the context for reuse, if AWS Lambda chooses to reuse the context when the Lambda function is invoked again. This execution context reuse approach has the following implications:

Any declarations in your Lambda function code (outside the handler code, see Programming Model) remains initialized, providing additional optimization when the function is invoked again. For example, if your Lambda function establishes a database connection, instead of reestablishing the connection, the original connection is used in subsequent invocations. We suggest adding logic in your code to check if a connection exists before creating one.

Each execution context provides 500MB of additional disk space in the /tmp directory. The directory content remains when the execution context is frozen, providing transient cache that can be used for multiple invocations. You can add extra code to check if the cache has the data that you stored. For information on deployment limits, see AWS Lambda Limits.

Background processes or callbacks initiated by your Lambda function that did not complete when the function ended resume if AWS Lambda chooses to reuse the execution context. You should make sure any background processes or callbacks (in case of Node.js) in your code are complete before the code exits.

This first bullet point says that state is maintained between executions. Let us see this in action:

let counter = 0

module.exports.handler = (event, context, callback) => {
  counter++
  callback(null, { count: counter })
}

If you deploy this and call multiple times consecutively you will see that the counter will be incremented between calls.

Now that you know that - you should not call defer db.Close(), instead you should be reusing the database instance. You can do that by simply making db a package level variable.

First, create a database package that will export an Open function:

package database

import (
    "fmt"
    "os"

    _ "github.com/go-sql-driver/mysql"
    "github.com/jinzhu/gorm"
)

var (
    host = os.Getenv("DB_HOST")
    port = os.Getenv("DB_PORT")
    user = os.Getenv("DB_USER")
    name = os.Getenv("DB_NAME")
    pass = os.Getenv("DB_PASS")
)

func Open() (db *gorm.DB) {
    args := fmt.Sprintf("%s:%s@tcp(%s:%s)/%s?parseTime=true", user, pass, host, port, name)
    // Initialize a new db connection.
    db, err := gorm.Open("mysql", args)
    if err != nil {
        panic(err)
    }
    return
}

Then use it on your handler.go file:

package main

import (
    "context"

    "github.com/aws/aws-lambda-go/events"
    "github.com/aws/aws-lambda-go/lambda"
    "github.com/jinzhu/gorm"
    "github.com/<username>/<name-of-lib>/database"
)

var db *gorm.DB

func init() {
    db = database.Open()
}

func Handler() (events.APIGatewayProxyResponse, error) {
    // You can use db here.
    return events.APIGatewayProxyResponse{
        StatusCode: 201,
    }, nil
}

func main() {
    lambda.Start(Handler)
}

OBS: don't forget to replace github.com/<username>/<name-of-lib>/database with the right path.

Now, you might still see the too many connections error. If that happens you will need a connection pool.

Configuring a connection pool

From Wikipedia:

In software engineering, a connection pool is a cache of database connections maintained so that the connections can be reused when future requests to the database are required. Connection pools are used to enhance the performance of executing commands on a database.

You will need a connection pool that the number of allowed connections must be equal to the number of parallel lambdas running, you have two choices:

MySQL Proxy

MySQL Proxy is a simple program that sits between your client and MySQL server(s) and that can monitor, analyze or transform their communication. Its flexibility allows for a wide variety of uses, including load balancing, failover, query analysis, query filtering and modification, and many more.

AWS Aurora:

Amazon Aurora Serverless is an on-demand, auto-scaling configuration for Amazon Aurora (MySQL-compatible edition), where the database will automatically start up, shut down, and scale capacity up or down based on your application's needs. It enables you to run your database in the cloud without managing any database instances. It's a simple, cost-effective option for infrequent, intermittent, or unpredictable workloads.

Regardless of your choice, there are plenty of tutorials on the internet on how to configure both.

answered Oct 09 '22 02:10

celicoo

Related questions
                            
                                Polymorphism in Go - does it exist?
                            
                                why you can't use a type from a different package if it has the same 'signature' ? golang
                            
                                How do I read a UDP connection until a timeout is reached?
                            
                                Including context objects through multiple HTTP handlers in golang
                            
                                Confusion with "..." operator in golang
                            
                                Accepting map parameters
                            
                                Logging responses to incoming HTTP requests inside http.HandleFunc
                            
                                Beego POST request body always empty
                            
                                Go with SQL Server driver is unable to connect successfully, login fail
                            
                                How to contain space in value string of link flag when using go build
                            
                                How can I write a Go test that writes to stdin?
                            
                                Elegant way to implement template method pattern in Golang
                            
                                Group by key + merge array values
                            
                                Go template and function
                            
                                Check if boolean value is set in Go
                            
                                Scanln in Golang doesn't accept whitespace
                            
                                Why doesn't my Makefile interpolate an expression in a command
                            
                                JSON response in Golang’s GIN returning as scrambled data
                            
                                Passing arguments to executable when debugging with Delve
                            
                                Adding anonymous struct element to slice

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

sql.DB on aws-lambda too many connection

Tags:

go

aws-lambda

connection-pooling

nguyenhoai890

People also ask

1 Answers

Correctly managing state

Configuring a connection pool

celicoo

Recent Activity

Donate For Us