The data type of the field is String. I would like to fetch the data where character length of field name is greater than 40. I tried these queries but returning error. 1. <pre class="prettyprint"><code>db.usercollection.find( {$where: "(this.name.length > 40)"} ).limit(2); output :error: { "$err" : "TypeError: Cannot read property 'length' of undefined near '40)' ", "code" : 16722 } </code></pre> this is working in 2.4.9 But my version is 2.6.5

For MongoDB 3.6 and newer: The <code>$expr</code> operator allows the use of aggregation expressions within the query language, thus you can leverage the use of <code>$strLenCP</code> operator to check the length of the string as follows: <pre class="prettyprint"><code>db.usercollection.find({ "name": { "$exists": true }, "$expr": { "$gt": [ { "$strLenCP": "$name" }, 40 ] } }) </code></pre> <hr> For MongoDB 3.4 and newer: You can also use the aggregation framework with the <code>$redact</code> pipeline operator that allows you to proccess the logical condition with the <code>$cond</code> operator and uses the special operations <code>$$KEEP</code> to "keep" the document where the logical condition is true or <code>$$PRUNE</code> to "remove" the document where the condition was false. This operation is similar to having a <code>$project</code> pipeline that selects the fields in the collection and creates a new field that holds the result from the logical condition query and then a subsequent <code>$match</code>, except that <code>$redact</code> uses a single pipeline stage which is more efficient. As for the logical condition, there are String Aggregation Operators that you can use <code>$strLenCP</code> operator to check the length of the string. If the length is <code>$gt</code> a specified value, then this is a true match and the document is "kept". Otherwise it is "pruned" and discarded. <hr> Consider running the following aggregate operation which demonstrates the above concept: <pre class="prettyprint"><code>db.usercollection.aggregate([ { "$match": { "name": { "$exists": true } } }, { "$redact": { "$cond": [ { "$gt": [ { "$strLenCP": "$name" }, 40] }, "$$KEEP", "$$PRUNE" ] } }, { "$limit": 2 } ]) </code></pre> <hr> If using <code>$where</code>, try your query without the enclosing brackets: <pre class="prettyprint"><code>db.usercollection.find({$where: "this.name.length > 40"}).limit(2); </code></pre> A better query would be to to check for the field's existence and then check the length: <pre class="prettyprint"><code>db.usercollection.find({name: {$type: 2}, $where: "this.name.length > 40"}).limit(2); </code></pre> or: <pre class="prettyprint"><code>db.usercollection.find({name: {$exists: true}, $where: "this.name.length > 40"}).limit(2); </code></pre> MongoDB evaluates non-<code>$where</code> query operations before <code>$where</code> expressions and non-<code>$where</code> query statements may use an index. A much better performance is to store the length of the string as another field and then you can index or search on it; applying <code>$where</code> will be much slower compared to that. It's recommended to use JavaScript expressions and the <code>$where</code> operator as a last resort when you can't structure the data in any other way, or when you are dealing with a small subset of data. <hr> A different and faster approach that avoids the use of the <code>$where</code> operator is the <code>$regex</code> operator. Consider the following pattern which searches for <pre class="prettyprint"><code>db.usercollection.find({"name": {"$type": 2, "$regex": /^.{41,}$/}}).limit(2); </code></pre> Note - From the docs: <blockquote> If an index exists for the field, then MongoDB matches the regular expression against the values in the index, which can be faster than a collection scan. Further optimization can occur if the regular expression is a “prefix expression”, which means that all potential matches start with the same string. This allows MongoDB to construct a “range” from that prefix and only match against those values from the index that fall within that range. A regular expression is a “prefix expression” if it starts with a caret <code>(^)</code> or a left anchor <code>(\A)</code>, followed by a string of simple symbols. For example, the regex <code>/^abc.*/</code> will be optimized by matching only against the values from the index that start with <code>abc</code>. Additionally, while <code>/^a/, /^a.*/,</code> and <code>/^a.*$/</code> match equivalent strings, they have different performance characteristics. All of these expressions use an index if an appropriate index exists; however, <code>/^a.*/</code>, and <code>/^a.*$/</code> are slower. <code>/^a/</code> can stop scanning after matching the prefix. </blockquote>

Queries with <code>$where</code> and <code>$expr</code> are slow if there are too many documents. Using <code>$regex</code> is much faster than <code>$where</code>, <code>$expr</code>. <pre class="prettyprint lang-js prettyprint-override"><code>db.usercollection.find({ "name": /^[\s\S]{40,}$/, // name.length >= 40 }) or db.usercollection.find({ "name": { "$regex": "^[\s\S]{40,}$" }, // name.length >= 40 }) </code></pre> This query is the same meaning with <pre class="prettyprint"><code>db.usercollection.find({ "$where": "this.name && this.name.length >= 40", }) or db.usercollection.find({ "name": { "$exists": true }, "$expr": { "$gte": [ { "$strLenCP": "$name" }, 40 ] } }) </code></pre> I tested each queries for my collection. <pre class="prettyprint"><code># find $where: 10529.359ms $expr: 5305.801ms $regex: 2516.124ms # count $where: 10872.006ms $expr: 2630.155ms $regex: 158.066ms </code></pre>

String field value length in mongoDB

Tags:

mongodb

field

string-length

The data type of the field is String. I would like to fetch the data where character length of field name is greater than 40.

I tried these queries but returning error. 1.

db.usercollection.find( {$where: "(this.name.length > 40)"} ).limit(2);  output :error: {     "$err" : "TypeError: Cannot read property 'length' of undefined near '40)' ",     "code" : 16722 }

this is working in 2.4.9 But my version is 2.6.5

478

asked Apr 11 '15 12:04

SURYA GOKARAJU

2 Answers

For MongoDB 3.6 and newer:

The $expr operator allows the use of aggregation expressions within the query language, thus you can leverage the use of $strLenCP operator to check the length of the string as follows:

db.usercollection.find({      "name": { "$exists": true },     "$expr": { "$gt": [ { "$strLenCP": "$name" }, 40 ] }  })

For MongoDB 3.4 and newer:

You can also use the aggregation framework with the $redact pipeline operator that allows you to proccess the logical condition with the $cond operator and uses the special operations $$KEEP to "keep" the document where the logical condition is true or $$PRUNE to "remove" the document where the condition was false.

This operation is similar to having a $project pipeline that selects the fields in the collection and creates a new field that holds the result from the logical condition query and then a subsequent $match, except that $redact uses a single pipeline stage which is more efficient.

As for the logical condition, there are String Aggregation Operators that you can use $strLenCP operator to check the length of the string. If the length is $gt a specified value, then this is a true match and the document is "kept". Otherwise it is "pruned" and discarded.

Consider running the following aggregate operation which demonstrates the above concept:

db.usercollection.aggregate([     { "$match": { "name": { "$exists": true } } },     {         "$redact": {             "$cond": [                 { "$gt": [ { "$strLenCP": "$name" }, 40] },                 "$$KEEP",                 "$$PRUNE"             ]         }     },     { "$limit": 2 } ])

If using $where, try your query without the enclosing brackets:

db.usercollection.find({$where: "this.name.length > 40"}).limit(2);

A better query would be to to check for the field's existence and then check the length:

db.usercollection.find({name: {$type: 2}, $where: "this.name.length > 40"}).limit(2);

or:

db.usercollection.find({name: {$exists: true}, $where: "this.name.length >  40"}).limit(2);

MongoDB evaluates non-$where query operations before $where expressions and non-$where query statements may use an index. A much better performance is to store the length of the string as another field and then you can index or search on it; applying $where will be much slower compared to that. It's recommended to use JavaScript expressions and the $where operator as a last resort when you can't structure the data in any other way, or when you are dealing with a small subset of data.

A different and faster approach that avoids the use of the $where operator is the $regex operator. Consider the following pattern which searches for

db.usercollection.find({"name": {"$type": 2, "$regex": /^.{41,}$/}}).limit(2);

Note - From the docs:

If an index exists for the field, then MongoDB matches the regular expression against the values in the index, which can be faster than a collection scan. Further optimization can occur if the regular expression is a “prefix expression”, which means that all potential matches start with the same string. This allows MongoDB to construct a “range” from that prefix and only match against those values from the index that fall within that range.

A regular expression is a “prefix expression” if it starts with a caret (^) or a left anchor (\A), followed by a string of simple symbols. For example, the regex /^abc.*/ will be optimized by matching only against the values from the index that start with abc.

Additionally, while /^a/, /^a.*/, and /^a.*$/ match equivalent strings, they have different performance characteristics. All of these expressions use an index if an appropriate index exists; however, /^a.*/, and /^a.*$/ are slower. /^a/ can stop scanning after matching the prefix.

121

answered Oct 13 '22 07:10

chridam

Queries with $where and $expr are slow if there are too many documents.

Using $regex is much faster than $where, $expr.

db.usercollection.find({    "name": /^[\s\S]{40,}$/, // name.length >= 40 })  or   db.usercollection.find({    "name": { "$regex": "^[\s\S]{40,}$" }, // name.length >= 40 })

This query is the same meaning with

db.usercollection.find({    "$where": "this.name && this.name.length >= 40", })  or  db.usercollection.find({      "name": { "$exists": true },     "$expr": { "$gte": [ { "$strLenCP": "$name" }, 40 ] }  })

I tested each queries for my collection.

# find $where: 10529.359ms $expr: 5305.801ms $regex: 2516.124ms  # count $where: 10872.006ms $expr: 2630.155ms $regex: 158.066ms

answered Oct 13 '22 09:10

Fumiya Karasawa

Related questions
                            
                                MongoDB ORM for Python? [closed]
                            
                                Return only matched sub-document elements within a nested array
                            
                                How to use a variable as a field name in mongodb-native findOne()?
                            
                                How to replace substring in mongodb document
                            
                                Mongoose Schema hasn't been registered for model
                            
                                What security mechanisms does Meteor have? [closed]
                            
                                How to set a primary key in MongoDB?
                            
                                What's the fastest way to copy a collection within the same database?
                            
                                Mongodb: What to know before using? [closed]
                            
                                Querying after populate in Mongoose
                            
                                How to join multiple collections with $lookup in mongodb
                            
                                What's a clean way to stop mongod on Mac OS X?
                            
                                New to MongoDB Can not run command mongo
                            
                                Cannot connect to mongodb errno:61 Connection refused
                            
                                What are the advantages of using a schema-free database like MongoDB compared to a relational database?
                            
                                MongoDB Schema Design - Many small documents or fewer large documents?
                            
                                How can I wait for a docker container to be up and running?
                            
                                Failed to start mongod.service: Unit mongod.service not found
                            
                                Mongoose limit/offset and count query
                            
                                How to implement has_many :through relationships with Mongoid and mongodb?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With