A Dataproc job for running Apache Spark applications on YARN.
JSON representation |
---|
{ "args" : [ string ] , "jarFileUris" : [ string ] , "fileUris" : [ string ] , "archiveUris" : [ string ] , "properties" : { string : string , ... } , "loggingConfig" : { object ( |
args[]
string
Optional. The arguments to pass to the driver. Do not include arguments, such as --conf
, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
jarFileUris[]
string
Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Spark driver and tasks.
fileUris[]
string
Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
archiveUris[]
string
Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
properties
map (key: string, value: string)
Optional. A mapping of property names to values, used to configure Spark. Properties that conflict with values set by the Dataproc API might be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
An object containing a list of "key": value
pairs. Example: { "name": "wrench", "mass": "1.3kg", "count": "3" }
.
loggingConfig
object (
LoggingConfig
)
Optional. The runtime log config for job execution.
driver
. Required. The specification of the main method to call to drive the job. Specify either the jar file that contains the main class or the main class name. To pass both a main jar and a main class in that jar, add the jar to jarFileUris
, and then specify the main class name in mainClass
. driver
can be only one of the following:mainJarFileUri
string
The HCFS URI of the jar file that contains the main class.
mainClass
string
The name of the driver's main class. The jar file that contains the class must be in the default CLASSPATH or specified in SparkJob.jar_file_uris.