Which two of the following are valid statements? (Choose two)
HDFS is optimized for storing a large number of files smaller than the HDFS block size.
HDFS has the Characteristic of supporting a “write once, read many” data access model.
HDFS is a distributed file system that replaces ext3 or ext4 on Linux nodes in a Hadoop cluster.
HDFS is a distributed file system that runs on top of native OS filesystems and is well suited to
storage of very large data sets.
B:HDFS is designed to support very large files. Applications that are compatible with
HDFS are those that deal with large data sets. These applications write their data only once but
they read it one or more times and require these reads to be satisfied at streaming speeds. HDFS
supports write-once-read-many semantics on files.
*Hadoop Distributed File System: A distributed file system that provides high-throughput access to
*DFS is designed to support very large files.
Reference:24 Interview Questions & Answers for Hadoop MapReduce developers