Class GetRDDOfAllElementsHandler

  • All Implemented Interfaces:
    OperationHandler<GetRDDOfAllElements>, OutputOperationHandler<GetRDDOfAllElements,​org.apache.spark.rdd.RDD<Element>>

    public class GetRDDOfAllElementsHandler
    extends AbstractGetRDDHandler<GetRDDOfAllElements,​org.apache.spark.rdd.RDD<Element>>
    A handler for the GetRDDOfAllElements operation.

    If the gaffer.accumulo.spark.directrdd.use_rfile_reader option is set to true then the RDD will be produced by directly reading the RFiles in the Accumulo table, rather than using ElementInputFormat to get data via the tablet servers. In order to read the RFiles directly, the user must have read access to the files. Also note that any data that has not been minor compacted will not be read. Reading the Rfiles directly can increase the performance.

    If the gaffer.accumulo.spark.directrdd.use_rfile_reader option is not set then the standard approach of obtaining data via the tablet servers is used.

    When reading data via the tablet servers, read performance may be improved by setting the gaffer.accumulo.spark.rdd.use_batch_scanner option to true.
    • Constructor Detail

      • GetRDDOfAllElementsHandler

        public GetRDDOfAllElementsHandler()