My local machine does not have an hdfs installation. I want to retrieve files from a remote hdfs cluster. What's the best way to achieve this? Do I need to get
the files from hdfs to one of the cluster machines fs and then use ssh to retrieve them? I want to be able to do this programmatically through say a bash script.
Here are the steps:
hadoop fs -get
command to get the files directlyAlso there are alternatives
If your host cannot have Hadoop binaries installed to be client, then you can use following instructions.
ssh <user>@<host> "hadoop fs -get <hdfs_path> <os_path>"
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With