How to delete rows in hive hadoop database

Question

I'm a newbie with hadoop & hive. I want to delete certain rows in my database - which is on hive-hadoop. I know its not supported out of the box, and that hadoop is a read only file system. I'm curious about what are the best approaches for accomplishing this. If anyone has done this before, can they share their learnings/procedures?

Thanks!

Jerome Banks · Accepted Answer

In Big Data there really aren't deletes. That said, you can overwrite your table or partition if it isn't too big, or isolate your deletes to a particular partition like JamCon suggests.

For datasets which are not too huge, you can do something like

INSERT OVERWRITE TABLE mytable
SELECT * FROM mytable
WHERE ID NOT IN ( 'delete1', 'delete2', 'delete3');

How to delete rows in hive hadoop database

Tags:

delete-row

hadoop

hive

Sunny

1 Answers

Jerome Banks

Recent Activity

Donate For Us

How to delete rows in hive hadoop database

Tags:

delete-row

hadoop

hive

Sunny

1 Answers

Jerome Banks

Related questions

Recent Activity

Donate For Us