MapR is the only distribution for Apache Hadoop that provides consistent, atomic snapshots. Other distributions that use HDFS cannot guarantee a consistent snapshot as the system requires the file to be closed before being processed to include any recently appended data. If any directory has files that are active and open, the snapshot can potentially not capture all of the data.
MapR, has been rearchitected to include a random read-write file system this rearchitect file system was redesigned to support redirect on write snapshots that provide consistent point in time recovery without incurring an additional storage penalty. No data is copied in order to create a snapshot . MapR Snapshots provide protection from user and application errors with flexible schedules to accommodate a range of recovery point objectives. MapR Snapshots can be scheduled or performed on demand. Recovering from a snapshot is as easy as dragging and dropping the directory or files to the current directory.
MapR Snapshots offer high performance and space efficiency.. As a result, a snapshot of a petabyte volume can be performed in seconds. A snapshot operation does not have any impact on write performance.