Skip to content

Bump delta-core_2.12 from 1.0.0 to 2.1.1 in /integration/spark#8

Closed
dependabot[bot] wants to merge 1 commit intomainfrom
dependabot/gradle/integration/spark/io.delta-delta-core_2.12-2.1.1
Closed

Bump delta-core_2.12 from 1.0.0 to 2.1.1 in /integration/spark#8
dependabot[bot] wants to merge 1 commit intomainfrom
dependabot/gradle/integration/spark/io.delta-delta-core_2.12-2.1.1

Conversation

@dependabot
Copy link

@dependabot dependabot bot commented on behalf of github Nov 3, 2022

Bumps delta-core_2.12 from 1.0.0 to 2.1.1.

Release notes

Sourced from delta-core_2.12's releases.

Delta Lake 2.1.1

We are excited to announce the release of Delta Lake 2.1.1 on Apache Spark 3.3. This release contains important bug fixes to 2.1.0 and it is recommended that users update to 2.1.1. Similar to Apache Spark™, we have released Maven artifacts for both Scala 2.12 and Scala 2.13.

This release includes the following bug fixes and improvements:

  • Fix for a bug in the DynamoDB-based S3 multi-cluster mode configuration. The previous version wrote an incorrect timestamp which was used by DynamoDB’s TTL feature to cleanup expired items. This timestamp value has been fixed and the table attribute renamed from commitTime to expireTime. If you already have TTL enabled, please follow the migration steps here.
  • Fix for incorrect MERGE behavior when the Delta statistics are disabled.
  • Fix for accidental protocol downgrades with RESTORE command. Until now, RESTORE TABLE may downgrade the protocol version of the table, which could have resulted in inconsistent reads with time travel. With this fix, the protocol version is never downgraded from the current one.
  • Improve performance of the DELETE command by optimizing the step to search affected files to trigger column pruning.
  • Fix for NotSerializableException when running RESTORE command in Spark SQL with Hadoop2.

Delta Lake 2.1.0

We are excited to announce the release of Delta Lake 2.1.0 on Apache Spark 3.3. Similar to Apache Spark™, we have released Maven artifacts for both Scala 2.12 and Scala 2.13.

The key features in this release are as follows

  • Support for Apache Spark 3.3.
  • Support for [TIMESTAMP | VERSION] AS OF in SQL. With Spark 3.3, Delta now supports time travel in SQL to query older data easily. With this update, time travel is now available both in SQL and through the DataFrame API.
  • Support for Trigger.AvailableNow when streaming from a Delta table. Spark 3.3 introduces Trigger.AvailableNow for running streaming queries like Trigger.Once in multiple batches. This is now supported when using Delta tables as a streaming source.
  • Support for SHOW COLUMNS to return the list of columns in a table.
  • Support for DESCRIBE DETAIL in the Scala and Python DeltaTable API. Retrieve detailed information about a Delta table using the DeltaTable API and in SQL.
  • Support for returning operation metrics from SQL Delete, Merge, and Update commands. Previously these SQL commands returned an empty DataFrame, now they return a DataFrame with useful metrics about the operation performed.
  • Optimize performance improvements
    • Added a config to use repartition(1) instead of coalesce(1) in Optimize for better performance when compacting many small files.
    • Improve Optimize performance by using a queue-based approach to parallelize the compaction jobs.
  • Other notable changes
    • Support for using variables in the VACUUM and OPTIMIZE SQL commands.
    • Improvements for CONVERT TO DELTA with catalog tables.
      • Autofill the partition schema from the catalog when it’s not provided.
      • Use partition information from the catalog to find the data files to commit instead of doing a full directory scan. Instead of committing all data files in the table directory, only data files under the directories of active partitions will be committed.
    • Support for Change Data Feed (CDF) batch reads on column mapping enabled tables when DROP COLUMN and RENAME COLUMN have not been used. See the documentation for more details.
    • Improve Update performance by enabling schema pruning in the first pass.
    • Fix for DeltaTableBuilder to preserve table property case of non-delta properties when setting properties.
    • Fix for duplicate CDF row output for delete-when-matched merges with multiple matches.
    • Fix for consistent timestamps in a MERGE command.
    • Fix for incorrect operation metrics for DataFrame writes with a replaceWhere option.
    • Fix for a bug in Merge that sometimes caused empty files to be committed to the table.
    • Change in log4j properties file format. Apache Spark upgraded the log4j version from 1.x to 2.x which has a different format for the log4j file. Refer to the Spark upgrade notes.

Benchmark framework update

Improvements to the benchmark framework (initial version added in version 1.2.0) including support for benchmarking arbitrary functions and not just SQL queries. We’ve also added Terraform scripts to automatically generate the infrastructure to run benchmarks on AWS and GCP.

Credits

... (truncated)

Commits
  • d8c4fc1 Setting version to 2.1.1
  • eae7e63 Upgrade version in integration tests
  • 41a1dbc Misc integration test updates
  • 4ec7631 Issue #1436: Fix restore delta table NotSerializableException for Hadoop 2
  • 58f539f Fix S3DynamoDBLogStore concurrent writer bug
  • d7845e6 Allow schema pruning for delete first pass
  • 26df795 Fix bug on merge command when DELTA_COLLECT_STATS is disabled
  • 34b52b9 Fix Delta streaming source filter logic to not return incorrect -1 index
  • 82ddcf1 Fix Delta source initialization issue when using AvailableNow
  • 8570049 Prevent Protocol Downgrades during RESTORE in Delta
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [delta-core_2.12](https://github.com/delta-io/delta) from 1.0.0 to 2.1.1.
- [Release notes](https://github.com/delta-io/delta/releases)
- [Commits](delta-io/delta@v1.0.0...v2.1.1)

---
updated-dependencies:
- dependency-name: io.delta:delta-core_2.12
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file java Pull requests that update Java code labels Nov 3, 2022
@dependabot @github
Copy link
Author

dependabot bot commented on behalf of github Dec 6, 2022

Superseded by #49.

@dependabot dependabot bot closed this Dec 6, 2022
@dependabot dependabot bot deleted the dependabot/gradle/integration/spark/io.delta-delta-core_2.12-2.1.1 branch December 6, 2022 14:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file java Pull requests that update Java code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants