About 47,200 results
Open links in new tab
  1. Apache Spark™ - Unified Engine for large-scale data analytics

    Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

  2. Overview - Spark 4.0.0 Documentation

    It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, pandas API on Spark for pandas workloads, MLlib for machine learning, GraphX …

  3. Documentation | Apache Spark

    The documentation linked to above covers getting started with Spark, as well the built-in components MLlib, Spark Streaming, and GraphX. In addition, this page lists other resources …

  4. Quick Start - Spark 4.0.0 Documentation

    Where to Go from Here This tutorial provides a quick introduction to using Spark. We will first introduce the API through Spark’s interactive shell (in Python or Scala), then show how to …

  5. Examples - Apache Spark

    Spark allows you to perform DataFrame operations with programmatic APIs, write SQL, perform streaming analyses, and do machine learning. Spark saves you from learning multiple …

  6. Downloads - Apache Spark

    As new Spark releases come out for each development stream, previous ones will be archived, but they are still available at Spark release archives. NOTE: Previous releases of Spark may …

  7. PySpark Overview — PySpark 4.0.0 documentation - Apache Spark

    May 19, 2025 · PySpark combines Python’s learnability and ease of use with the power of Apache Spark to enable processing and analysis of data at any size for everyone familiar with Python. …

  8. Spark SQL & DataFrames | Apache Spark

    Integrated Seamlessly mix SQL queries with Spark programs. Spark SQL lets you query structured data inside Spark programs, using either SQL or a familiar DataFrame API. Usable …

  9. RDD Programming Guide - Spark 3.5.5 Documentation

    This guide shows each of these features in each of Spark’s supported languages. It is easiest to follow along with if you launch Spark’s interactive shell – either bin/spark-shell for the Scala …

  10. Spark SQL and DataFrames - Spark 4.0.0 Documentation

    Spark SQL is a Spark module for structured data processing. Unlike the basic Spark RDD API, the interfaces provided by Spark SQL provide Spark with more information about the structure …

Refresh