Apache Spark
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Here are 6,518 public repositories matching this topic...
-
Updated
Apr 3, 2022 - Python
-
Updated
Apr 12, 2022 - Python
-
Updated
Apr 11, 2022 - Go
-
Updated
Mar 5, 2022
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.
-
Updated
Apr 7, 2022 - Python
-
Updated
Mar 23, 2022 - Java
-
Updated
Mar 23, 2022
-
Updated
Feb 9, 2022 - Java
-
Updated
Apr 12, 2022 - Jupyter Notebook
-
Updated
Feb 8, 2022 - Python
-
Updated
Apr 12, 2022 - Java
I'm trying to use the spec to implement a reader, but the spec is too vague on what the schema of a checkpoint file is supposed to be. Here are points that are not specified:
- Which columns does a checkpoint file contain. The spec only gives an example of what columns it contains for a specific table, but it doe
-
Updated
Apr 24, 2020 - Jsonnet
-
Updated
Apr 12, 2022 - Jupyter Notebook
-
Updated
Jan 20, 2022 - Python
-
Updated
May 26, 2019 - Scala
I have a simple regression task (using a LightGBMRegressor) where I want to penalize negative predictions more than positive ones. Is there a way to achieve this with the default regression LightGBM objectives (see https://lightgbm.readthedocs.io/en/latest/Parameters.html)? If not, is it somehow possible to define (many example for default LightGBM model) and pass a custom regression objective?
-
Updated
May 12, 2021 - Jupyter Notebook
-
Updated
Oct 19, 2021 - JavaScript
Used Spark version
Spark Version: 2.4.4
Used Spark Job Server version
SJS version: v0.11.1
Deployed mode
client on Spark Standalone
Actual (wrong) behavior
I can't get config, when post a job with 'sync=true'. I got it:
http://localhost:8090/jobs/ff99479b-e59c-4215-b17d-4058f8d97d25/config
{"status":"ERROR","result":"No such job ID ff99479b-e59c-4215-b17d-4058f8d97d25"
Created by Matei Zaharia
Released May 26, 2014
- Repository
- apache/spark
- Website
- spark.apache.org
- Wikipedia
- Wikipedia


Describe the bug
Using a time dimension on a runningTotal measure on Snowflake mixes quoted and unquoted columns in the query. This fails the query, because Snowflake has specific rules about quoted columns. Specifically:
So "date_from" <> date_from
To Reproduce
Steps to reproduce