Which database can be used to store processed data from NLP engine

Tags:

I am looking at taking unstructured data in the form of files, processing it and storing it in a database for retrieval. The data will be in natural language and the queries to get information will also be in natural language. Ex: the data could be "Roses are red" and the query could be "What is the color of a rose?"

I have looked at several nlp systems, focusing more on open-source information extraction and relation extraction system and the following seems apt and easy for quick start: https://www.npmjs.com/package/mitie

This can give data in the form of (word,type) pairs. It also gives a relation as result of running the the processing (check the site example).

I want to know if sql is good database to save this information. For retrieving the information, I will need to convert the natural language query also to some kind of (word, meaning) pairs and for using sql I will have to write a layer that converts natural language to sql queries.

Please suggest if there are any open source database that work well in this situation. I'm open to suggestions for databases that work with other open-source information extraction and relation extraction systems if not MITIE.

456

asked May 24 '17 07:05

Swati Pardeshi

1 Answers

SQL wont be an appropriate choice for your problem. You can use NLP or rules to extract relationships and then store that relationship in a Triple Store or a Graph database. There are many good open source Graph Databases like Neo4j and Apache Titan. You can query Google for Triple-stores, I suppose Apache Jena should be a good choice. After storing your data you can query your graphs using any of the Graph Query Languages like Gremlin or Cypher etc. (like SQL). Note that the heart of your system would be a Knowledge Graph.

You may also setup a Lucene/Solr based Search System on your unstructured data which may help you with answering your queries in conjunction with Graph Databases. All of these (NLP, IR, Graph DB/Triplestores etc.) would coexist to solve your problem.

It would be like an ensemble. No silver bullets :) However to start with look at Graph DB's or Triple-stores.

165

answered Nov 15 '22 08:11

Yavar

Related questions
                            
                                How to select all the sub_category records with its main_category_name and parent_id in a single query in MySQL?
                            
                                How can you merge two tables without losing any of the rows in SQL?
                            
                                Replace/display new line from MYSQL to PHP
                            
                                SELECT using BETWEEN in MYSQL and PHP Variable
                            
                                Laravel orm database query inside model function
                            
                                Ubuntu 16.04 mysql_old_password cannot be loaded
                            
                                mysql select closest date from today
                            
                                Is it more efficient to use AWS s3 versioning to store gzipped mysql dumps?
                            
                                Python pymysql - iterate through mysql table key and value
                            
                                Forward engineering in mysql work bench not working
                            
                                How to select till the sum reach some value
                            
                                A way around having to loop through results to access more information using the first result set
                            
                                How to define own aggregate function in Mysql for GROUP BY?
                            
                                Search for all combinations of first, middle and last name
                            
                                double inner join in linq/ lambda query?
                            
                                SQL query for top 100, displayed in reverse
                            
                                Bash MySQL query
                            
                                Mysqldump update table
                            
                                mysql query: SELECT DISTINCT column1, GROUP BY column2
                            
                                How does memcache with MySQL work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Which database can be used to store processed data from NLP engine

Tags:

database

mysql

nlp

information-retrieval

information-extraction

Swati Pardeshi

People also ask

1 Answers

Yavar

Recent Activity

Donate For Us