The LinkedIn Gradle Plugin for Apache Hadoop was created and designed by Alex Bain. The design of the LinkedIn Gradle DSL for Apache Hadoop was influenced by azkaban-rb from Matthew Hayes and with input from Will Vaughan.
The following were contributed by Arpan Agrawal. Thanks, Arpan!
Added AutoTunePigLiJob job type support
The following were contributed by Jun "Tony" Zhou. Thanks, Tony!
Added Python Spark support for runSparkJob task.
Added support for submitting python Spark applications with Spark job type. Corresponsing tests are added.
Added application running mode check in Spark plugin. Spark job now run on YARN cluster unless explicitly specified otherwise.
The following were contributed by Jin Hyuk Chang. Thanks, Jin Hyuk!
DSS-5053: Added Hive ORC table as a source in hdfsToTeradata job type.
DSS-4743: Hadoop DSL for SQL job type. Adding encrypted password support on Teradata job type in Hadoop DSL.
DSS-3870: Hadoop DSL for new Azkaban job type. Teradata <-> HDFS, HDFS -> Espresso, and Gobblin
The following were contributed by Anthony Hsu. Thanks, Anthony!
LIHADOOP-16591: ligradle azkabanUpload fails with "ClassNotFoundException: org.apache.http.client.methods.HttpPost"
The following were contributed by Jonathan Hung. Thanks, Jonathan!
LIHADOOP-37013. Tensorflow execution type multiplexing between TF on Spark and TF on YARN
Remove taskCommand in tensorflow dsl job
LIHADOOP-35318 Implement TensorFlow job in hadoop dsl
TOOLS-67569 Block Hadoop Client Teams from checking in Hadoop Jars in their MP
The following were contributed by Anant Nag. Thanks, Anant!
Minor fix to the test plugin
LIHADOOP-28747 Hadoop plugin's test plugin should have functionality to add assertions on tests.
LIHADOOP-25519 ligradle tasks to test the flow on Azkaban
LIHADOOP-25518 Prototype deploy tests for the workflows
LIHADOOP-23255: Add azkaban jvm properties to the jobConf for bangbang
LIHADOOP-18923: Integrate bangbang with li-hadoop-plugin
LIHADOOP-18499 Add extra typed methods for Hadoop DSL SparkJob properties
LIHADOOP-6730: Add clusterProvided configuration for dependency management
LIHADOOP-17034 Create HadoopShell job type in Hadoop DSL
LIHADOOP-16016 Minor bug fix for li-hadoop-plugin
LIHADOOP-15728 Update dependency pattern blacklist rules for li-hadoop-plugin
LIHADOOP-15501 checkDependencies task fails if any of groupID, name or version is null
LIHADOOP-15077 Hadoop plugin should have a feature to control and monitor dependencies specified by the users
Fixups on HadoopJavaJob and HiveJob
LIHADOOP-14506 Hadoop plugin should directly submit jobs to oozie
LIHADOOP-13682 Oozie Hadoop DSL Compiler support property files
LIHADOOP-13681 Oozie Hadoop DSL Compiler support all job types
Added xsd schemas for different job types
LIHADOOP-14566 SparkPlugin does not read application parameters
LIHADOOP-12557 Hadoop Plugin should support launching Spark scripts from a gateway machine
LIHADOOP-14099 Create spark job type in hadoop dsl
LIHADOOP-13660 Hadoop Plugin oozieUpload task should upload the contents of the hadoopZip task
Minor fixes on OozieUploadTask and LiKerberosUtil
LIHADOOP-13353 Hadoop Plugin oozieUpload task should upload to versioned directory.
LIHADOOP-13457 Fix Stackoverflow exception thrown by OozieUploadTask.
LIHADOOP-12771 Prototype Hadoop Plugin Upload Task for Oozie
HADOOP-12726 Style cleanups for Hadoop zip tests
HADOOP-12243 Fix Hadoop Plugin PCL Breakage
HADOOP-10148 Support exclusion list for Hadoop Plugin sources Zip
The following were contributed by Akshay Rai. Thanks, Akshay!
LIHADOOP-12945 Hadoop Plugin azkabanUpload enhancement to print project URL
HADOOP-12243 Fix Hadoop Plugin PCL Breakage
HADOOP-10773 Rewrite li-azkaban2 zip upload tasks for the Hadoop Plugin
HADOOP-8251 Hadoop DSL doesn't use -p flag for mkdir
The following were contributed by Keith Dsouza. Thanks, Keith!
Added support to the KafkaPushJob to accept the disable.auditing property
The following were contributed by Ragesh Rajagopalan. Thanks, Ragesh!
LIHADOOP-20985 azkabanUpload throwing 500 error
DI-584 Login prompt missing when uploading to Azkaban
The following were contributed by Pranay Hasan Yerra. Thanks, Pranay!
LIHADOOP-32983 Azkaban CLI Tasks should prompt for "password+VIP" internal to Linkedin
Addressed a bug in ready status. Cleaned up the codenarc errors in azkaban-client
LIHADOOP-33260 Bump to Gradle 4.1
LIHADOOP-26107 Parameters to override the setting in .azkabanPlugin.json
LIHADOOP-16252 Prevent core-site.xml from polluting IDE's classpath
LIHADOOP-25566 Update Dr.Elephant URL link in azkabanFlowStatus task
LIHADOOP-25637 Check for trailing slash in AzkabanUrl when running azkabanUpload task
LIHADOOP-24274 Checks for empty fields in .azkabanPlugin.json
LIHADOOP-21680 Azkaban CLI Features Create Project, Flow Status, Execute Flow and Cancel Flow
LIHADOOP-23788 Display custom message after azkabanUpload task
LIHADOOP-22850 Handle null console and minor changes in upload bar's display
LIHADOOP-22822 The azkabanUpload task should automatically go into edit mode if there are missing required fields
LIHADOOP-22809 Display configured Hadoop zips during azkabanUpload task
LIHADOOP-22655 Automate creation of Azkaban Project during azkabanUpload task. Also avoids uploading twice on invalid session
LIHADOOP-22222 Display Upload Status of Zip while running azkabanUpload task
LIHADOOP-21797 Automate the creation of the .azkabanPlugin.json file
LIHADOOP-21658 Refactor AzkabanUploadTask in Hadoop Plugin by creating AzkabanHelper class
The following were contributed by Pratik Satish. Thanks, Pratik!
LIHADOOP-21429 Add HadoopValidatorPlugin with PigValidatorPlugin in its stack
The following were contributed by Ivan Heda. Thanks, Ivan!
Added possibility to have Azkaban password in configuration JSON.
The following were contributed by Alexander Ivaniuk. Thanks, Alexander!
Add possibility to group jobs into subflows in Azkaban UI
The following were contributed by Jennifer Dai. Thanks, Jennifer!
Add ability for Hadoop DSL to understand the PinotBuildAndPushJob job type
The following were contributed by Jianling Zhong. Thanks, Jianling!
Add supports for a "required parameters" check
The following were contributed by Charlie Summers. Thanks, Charlie!
DSL Trigger Implementation
Upgrade li-hadoop-plugin for Flow 2.0
Refactor YamlCompiler to fit closer with AzkabanDslCompiler
Allow configurable Yaml creation for Flow 2.0
Introduce YamlCompiler, YamlWorkflow, YamlJob, and YamlProject for Flow 2.0
Add ability for Hadoop DSL to understand the TableauJob job type
Adding additional condition for TableauJob creation
Allow generateYamlOutput to be defined multiple times
Remove maxWaitMins requirement if no dependencies defined in Trigger
Handling emergent flows for Flow 2.0
The following were contributed by Nicholas Cowan. Thanks, Nick!
Add ability for Hadoop DSL to understand the HdfsWait job type
Add subdirectory hadoop-jobs
Adding timezone parameter and ability for directoryPath to build paths based on date for hdfsWaitJobs
The following were contributed by Charles Gao. Thanks, Charles!
Add ability for Hadoop DSL to understand the VenicePushJob job type
The following were contributed by Michal Trna. Thanks, Michal!
Session IDs are stored separately for each Azkaban server
The following were contributed by Tzu-Han Jan. Thanks, TJ!
Updated Hadoop DSL for Kafka-Push-Job to understand multiTopic configurations
The following were contributed by Chris Li.
Remove jdbcDriverClass requirement from SQL job type
The following were contributed by Jamie Sun. Thanks, Jamie!
Add conditional workflow feature
Fix base properties scope for subflows when generating YAML files
Fix variable substitution issue when generating YAML files
Fix variable substitution issue for dependency list
Fix cloneWorkflow and cloneJob issue when generating conditional workflows
Fix variable substitution issue for flow dependency list
The following were contributed by Rakesh Malladi. Thanks, Rakesh!
Add ability for Hadoop DSL to understand the WormholePushJob job type
Deprecate dataset.group property from the WormholePushJob
The following were contributed by Yeni Bermudez. Thanks, Yeni!
Add a workaround to handle visiting files multiple times when merging tempprops into .flow files.
The following were contributed by Abhishek Nimesh. Thanks, Abhishek!
Add ability for Hadoop DSL to understand the KabootarJob job type
The following were contributed by Zhixiong Chen.
Sort config
The following were contributed by Sayali Kadam.
Accept property 'ai.project.group' for KabootarJob
The following were contributed by Arjun Singh Bora.
Add ability for Hadoop DSL to understand the WormholePushJob2 job type
The following were contributed by Sayali Kadam.
Fixing property aiProjectGroup while cloning Kabootar Job
Kyle Moore (DPUkyle) contributed the following:
Refactored Hadoop DSL "tests" into proper integration tests using Gradle TestKit
Refactored li-hadoop-plugin "tests" into proper integration tests using Gradle TestKit
The following were contributed by Frank Gu.
Add ability for Hadoop DSL to understand the KubernetesJob job type