What is Spark?
è Apache Spark is an open-source cluster computing framework. Its primary purpose is to handle the real-time generated data.
è Spark is a Bigdata Processing Framework.
è Spark software is written in Scala Language.
è Spark Applications we can implement using Java, Scala, Python and R Language
è Spark has only processing. It does not have storage.
è Spark can process any data.
è Spark was built on the top of the Hadoop MapReduce. It was optimized to run in memory whereas alternative approaches like Hadoop's MapReduce writes data to and from computer hard drives. So, Spark process the data much quicker than other alternatives.