Automated normalization of mySQL database - how to do it?

3 Answers

You need to study the columns to identify 'like' entities and break them out into seperate tabels. At best an automated tool might identify groups of rows with identical values for some of the columns, but a person who understood the data would have to decide if those truely belong as a seperate entity.

Here's a contrived example - suppose your columns were first name, last name, address, city, state, zip. An automated tool might identify rows of people who were members of the same family with the same last name, address, city, state, and zip and incorrectly conclude that those five columns represented an entity. It might then split the tables up:

First Name, ReferenceID

and another table

ID, Last Name, Address, City, State, Zip

See what i mean?

113

answered Oct 23 '22 13:10

n8wrl

I can't think of any way you can automate it. You would have to create the tables that you want, and then go through and replace each piece of data with manual queries.

e.g.,

INSERT INTO contact
SELECT DISTINCT first_name, last_name, phone
FROM massive_table;

then you could drop the columns out of the massive table and replace it with a contact_id column.

You would have a similar process when pulling out rows that go into a one-to-many table.

answered Oct 23 '22 14:10

Brian Ramsay

In cleaning up messy data, I like to create user defined mysql functions to do typical data-scrubbing stuff... that way you can reuse them later. Approaching this way also lets you see if you can find existing udf's that have been written which you can use (with or without modification)... for example mysqludf.org

answered Oct 23 '22 15:10

codemonkey

Related questions
                            
                                Rules of Mysql Gap-lock/Next-key Locks
                            
                                Show the default value for a variable
                            
                                Why Next-Key lock is called this way?
                            
                                How to convert MySQL JSON array to comma separated string
                            
                                PDO connection to remote DB using SSL; FastCGI errors when verifying server cert
                            
                                How to insert a Pandas Dataframe into MySql using PyMySQL
                            
                                GitHub Actions: How to run `services` on Windows or macOS?
                            
                                keycloak docker mysql connection link failure
                            
                                How to get datetime from default timezone (IST) to a different user based timezone
                            
                                How to change MySQL root password to default?
                            
                                How can I do boolean logic on two columns in MySQL?
                            
                                Binlog MySQL Replication is a "Bag of Hurt". Are there any good alternatives?
                            
                                Grails MySQL MaxPoolSize
                            
                                What is the best way to generate ranks in MYSQL?
                            
                                Comparing strings in PHP the same way MySQL does
                            
                                mysql tables structure - one very large table or separate tables?
                            
                                .Net ORM that works well with MySQL [closed]
                            
                                What tools are available to document a legacy database schema (PDF, DOC, HTML, RTF) [closed]
                            
                                Python + MySQLdb executemany
                            
                                MySQL Session Table Approach

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Automated normalization of mySQL database - how to do it?

Tags:

mysql

normalization

CL23

People also ask

3 Answers

n8wrl

Brian Ramsay

codemonkey

Recent Activity

Donate For Us