Hadoop HDFS supports C API, Java API and even RESTful API support is available. You can find more info on RESTful API at http://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/WebHDFS.html. There is even the command line tool available i.e. hadoop fs -copyFromLocal. If you are the architect for your Big Data project, and you are responsible for bringing in files into Hadoop, would you use the C, Java API or RESTful API?
I will use Java API for this project as it continues to be a very popular choice owing to the large number of Java developers in the world, as well as the fact that some popular frameworks, such as Apache Hadoop, were developed in Java. Scala, which runs inside the Java Virtual Machine (JVM), is also widely used in data science; Apache Spark was written in Scala, and Apache Flink was written in a combination of Java and Scala. Java gives you access to a large ecosystem of profilers, debuggers, monitoring tools, libraries for enterprise security and interoperability, and much more.
Get Answers For Free
Most questions answered within 1 hours.