Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP(iox-10578): patched df upgrade 202-04-27 #17

Closed
wants to merge 10 commits into from

Conversation

appletreeisyellow
Copy link

@appletreeisyellow appletreeisyellow commented Apr 30, 2024

⚠️ This will not be merged. ⚠️

  1. Bringing us up to datafusion to 2024-04-27 apache@0f76bd6

  2. This PR is based on April 23, 2024 apache@65ecfda

git co -b chunchun/update-df-apr-week-4-3 65ecfda84f7a105412bbf4040885a1b1774668d1
  1. Applied the following patch(es):

    1. Cherry picked Allow adding user defined metadata to ParquetSink apache/datafusion#10224 /
      apache@9c8873a

      commit 9c8873af12826e47f5743991859790df7a3b6400
      Author: wiedld <[email protected]>
      Date:   Fri Apr 26 03:42:16 2024 -0700
      
          Allow adding user defined metadata to `ParquetSink` (#10224)
    2. cherry picked fix: no longer support the substring function apache/datafusion#10242 / f8c623f

      commit f8c623fe045d70a87eac8dc8620b74ff73be56d5
      Author: Jonah Gao <[email protected]>
      Date:   Sat Apr 27 02:30:09 2024 +0800
      
          fix: no longer support the `substring` function (#10242)
    3. cherry picked feat: add optimizer config param to avoid grouping partitions prefer_existing_union apache/datafusion#10259 / apache@2231183

      commit 22311835bc1b4bd83b50e1c3875b0e725622b872
      Author: Nga Tran <[email protected]>
      Date:   Tue Apr 30 11:45:34 2024 -0400
      
          feat: add optimizer config param to avoid grouping partitions `prefer_existing_union` (#10259)
    4. cherry picked chore: add function to set prefer_existing_union apache/datafusion#10322 / 3b77b6b

      commit 3b77b6b8e884a1bda7579891935466561c235a97
      Author: Nga Tran <[email protected]>
      Date:   Tue Apr 30 17:49:14 2024 -0400
      
      chore: add function to set prefer_existing_union (#10322)

wiedld and others added 4 commits April 30, 2024 12:29
* chore: make explicit what ParquetWriterOptions are created from a subset of TableParquetOptions

* refactor: restore the ability to add kv metadata into the generated file sink

* test: demomnstrate API contract for metadata TableParquetOptions

* chore: update code docs

* fix: parse on proper delimiter, and improve tests

* fix: enable any character in the metadata string value, by having any key parsing be a part of the format.metadata::key
* fix: no longer support the `substring` function

* enable from-for format

* update test comment

* review feedback

* review feedback

Co-authored-by: Jeffrey Vo <[email protected]>

---------

Co-authored-by: Jeffrey Vo <[email protected]>
…_existing_union` (apache#10259)

* feat: add a config param to avoid converting union to interleave

* chore: update config for the tests

* chore: update configs.md
@github-actions github-actions bot removed the sql label Apr 30, 2024
@appletreeisyellow appletreeisyellow changed the title WIP(iox-10578): patched df upgrade 202-04-TBD WIP(iox-10578): patched df upgrade 202-04-27 Apr 30, 2024
@appletreeisyellow
Copy link
Author

The upgrade is done. Closing

@appletreeisyellow appletreeisyellow deleted the chunchun/update-df-apr-week-4-3 branch May 1, 2024 20:39
appletreeisyellow pushed a commit that referenced this pull request Jul 22, 2024
apache#11505)

* fix unparser for derived table with columns

* refactoring

* renaming

* case in tests
wiedld pushed a commit that referenced this pull request Jul 31, 2024
apache#11505)

* fix unparser for derived table with columns

* refactoring

* renaming

* case in tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants