Term of the Moment

audio cassette


Look Up Another Term


Definition: Apache Spark


An open source big data framework from the Apache Software Foundation. Spark is used to analyze huge amounts of real-time data in RAM in contrast to Hadoop (another Apache project), which continuously writes to the storage drive. As a result, Spark is generally many times faster. Because Spark does not have its own distributed file system, it is often used in conjunction with the Hadoop Distributed File System. See Hadoop and big data.