PostgreSQL Full Text Search Spanish character Ñ

Tags:

postgresql

I am facing an issue when doing full text search with PostgreSQL on text that contains de Spanish character 'Ñ'

When I try to tokenize the Spanish word 'AÑO' (year) I get the following results depending on if input is upper or lower case:

SELECT to_tsvector('spanish','AÑO'),to_tsquery('spanish','año')
"to_tsvector"   "to_tsquery"
"'aÑo':1"   "'año'"

As you can see result is not the same and it is case sensitive, so it makes my application full text search queries case sensitive if they contain this character.

Is there any way to overcome this issue? I have been searching on PostgreSQL documentation about full text search, and I don't know how to change this behaviour on installed dictionaries.

Thank you so much. Martí

609

asked Aug 08 '17 12:08

1 Answers

The ability for to_tsvector to convert Ñ into ñ depends on the locale, and specifically on lc_ctype. Presumably your database is using an LC_CTYPE such as C whose knowledge is limited to US-ASCII.

Example with an LC_CTYPE compatible with Unicode:

test=> show lc_ctype;
  lc_ctype   
-------------
 fr_FR.UTF-8
(1 row)

test=> SELECT to_tsvector('spanish','AÑO'),to_tsquery('spanish','año');
 to_tsvector | to_tsquery 
-------------+------------
 'año':1     | 'año'
(1 row)

Note that the downcasing is what you expect.

Opposite example with C:

creation:

CREATE DATABASE cc lc_ctype 'C' template template0;

Note the lack of downcasing, as in the question:

cc=> show lc_ctype ;
 lc_ctype 
----------
 C
(1 row)

cc=> SELECT to_tsvector('spanish','AÑO'),to_tsquery('spanish','año');
 to_tsvector | to_tsquery 
-------------+------------
 'aÑo':1     | 'año'
(1 row)

113

answered Jan 02 '23 12:01

Daniel Vérité

Related questions
                            
                                How to call a stored procedure and get return value in Slick (using Scala)
                            
                                PostgreSQL: How to go around ts_vector size limitations?
                            
                                Recording the invoker of a Postgres function that is set to SECURITY DEFINER
                            
                                postgresql does not appear in Data Source when generating .ADO.net Entity Data Model
                            
                                PostgreSQL equivalent of Oplog Tailing in MongoDB
                            
                                Calculate the angle of exterior rings PostGIS (polygons & multipolygons)
                            
                                Docker - PG::ConnectionBad
                            
                                Reference psql parameter inside PL/pgSQL anonymous block
                            
                                Issues connecting to Amazon RDS Postgres database on node.js using sequelize ORM
                            
                                PostgreSQL: going to run out of IDs in integer columns
                            
                                How to connect to remote PostgreSQL with R, certificate validation required
                            
                                Use python and psycopg2 to execute a sql file that contains a DROP DATABASE statement
                            
                                What is the maximum number of VALUES that can be put in a PostgreSQL INSERT statement?
                            
                                PDO inTransaction() returning false after database exception
                            
                                Function alias for Postgres default function
                            
                                SQL - conditionally select a column if exists
                            
                                How can I log `PREPARE` statements in PostgreSQL?
                            
                                How to check if value is in a list or if the list is empty?
                            
                                How to use an array as a variable in Postgres?
                            
                                How to change the data type of a table column to enum?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

PostgreSQL Full Text Search Spanish character Ñ

Tags:

full-text-search

postgresql

Marti Pàmies Solà

People also ask

1 Answers

Daniel Vérité

Recent Activity

Donate For Us