Skip to content

Commit 2e1e1f8

Browse files
committed
[MINOR][DOCS] Soften security warning and keep it in cluster management docs only
### What changes were proposed in this pull request? Soften security warning and keep it in cluster management docs only, not in the main doc page, where it's not necessarily relevant. ### Why are the changes needed? The statement is perhaps unnecessarily 'frightening' as the first section in the main docs page. It applies to clusters not local mode, anyhow. ### Does this PR introduce _any_ user-facing change? Just a docs change. ### How was this patch tested? N/A Closes #32206 from srowen/SecurityStatement. Authored-by: Sean Owen <srowen@gmail.com> Signed-off-by: Sean Owen <srowen@gmail.com>
1 parent 91bd384 commit 2e1e1f8

File tree

7 files changed

+17
-16
lines changed

7 files changed

+17
-16
lines changed

docs/index.md

Lines changed: 0 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -25,11 +25,6 @@ It provides high-level APIs in Java, Scala, Python and R,
2525
and an optimized engine that supports general execution graphs.
2626
It also supports a rich set of higher-level tools including [Spark SQL](sql-programming-guide.html) for SQL and structured data processing, [MLlib](ml-guide.html) for machine learning, [GraphX](graphx-programming-guide.html) for graph processing, and [Structured Streaming](structured-streaming-programming-guide.html) for incremental computation and stream processing.
2727

28-
# Security
29-
30-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
31-
Please see [Spark Security](security.html) before downloading and running Spark.
32-
3328
# Downloading
3429

3530
Get Spark from the [downloads page](https://spark.apache.org/downloads.html) of the project website. This documentation is for Spark version {{site.SPARK_VERSION}}. Spark uses Hadoop's client libraries for HDFS and YARN. Downloads are pre-packaged for a handful of popular Hadoop versions.

docs/quick-start.md

Lines changed: 0 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -32,11 +32,6 @@ you can download a package for any version of Hadoop.
3232

3333
Note that, before Spark 2.0, the main programming interface of Spark was the Resilient Distributed Dataset (RDD). After Spark 2.0, RDDs are replaced by Dataset, which is strongly-typed like an RDD, but with richer optimizations under the hood. The RDD interface is still supported, and you can get a more detailed reference at the [RDD programming guide](rdd-programming-guide.html). However, we highly recommend you to switch to use Dataset, which has better performance than RDD. See the [SQL programming guide](sql-programming-guide.html) to get more information about Dataset.
3434

35-
# Security
36-
37-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
38-
Please see [Spark Security](security.html) before running Spark.
39-
4035
# Interactive Analysis with the Spark Shell
4136

4237
## Basics

docs/running-on-kubernetes.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,8 +25,10 @@ Kubernetes scheduler that has been added to Spark.
2525

2626
# Security
2727

28-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
29-
Please see [Spark Security](security.html) and the specific advice below before running Spark.
28+
Security features like authentication are not enabled by default. When deploying a cluster that is open to the internet
29+
or an untrusted network, it's important to secure access to the cluster to prevent unauthorized applications
30+
from running on the cluster.
31+
Please see [Spark Security](security.html) and the specific security sections in this doc before running Spark.
3032

3133
## User Identity
3234

docs/running-on-mesos.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,9 @@ The advantages of deploying Spark with Mesos include:
3232

3333
# Security
3434

35-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
35+
Security features like authentication are not enabled by default. When deploying a cluster that is open to the internet
36+
or an untrusted network, it's important to secure access to the cluster to prevent unauthorized applications
37+
from running on the cluster.
3638
Please see [Spark Security](security.html) and the specific security sections in this doc before running Spark.
3739

3840
# How it Works

docs/running-on-yarn.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,9 @@ was added to Spark in version 0.6.0, and improved in subsequent releases.
2626

2727
# Security
2828

29-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
29+
Security features like authentication are not enabled by default. When deploying a cluster that is open to the internet
30+
or an untrusted network, it's important to secure access to the cluster to prevent unauthorized applications
31+
from running on the cluster.
3032
Please see [Spark Security](security.html) and the specific security sections in this doc before running Spark.
3133

3234
# Launching Spark on YARN

docs/security.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,10 @@ license: |
2323

2424
# Spark Security: Things You Need To Know
2525

26-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
26+
Security features like authentication are not enabled by default. When deploying a cluster that is open to the internet
27+
or an untrusted network, it's important to secure access to the cluster to prevent unauthorized applications
28+
from running on the cluster.
29+
2730
Spark supports multiple deployments types and each one supports different levels of security. Not
2831
all deployment types will be secure in all environments and none are secure by default. Be
2932
sure to evaluate your environment, what Spark supports, and take the appropriate measure to secure

docs/spark-standalone.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,9 @@ In addition to running on the Mesos or YARN cluster managers, Spark also provide
2525

2626
# Security
2727

28-
Security in Spark is OFF by default. This could mean you are vulnerable to attack by default.
28+
Security features like authentication are not enabled by default. When deploying a cluster that is open to the internet
29+
or an untrusted network, it's important to secure access to the cluster to prevent unauthorized applications
30+
from running on the cluster.
2931
Please see [Spark Security](security.html) and the specific security sections in this doc before running Spark.
3032

3133
# Installing Spark Standalone to a Cluster

0 commit comments

Comments
 (0)