Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MINOR][DNM][TESTING] Flink bundle testing 5.5 #11326

Closed
wants to merge 149 commits into from

Commits on Feb 26, 2024

  1. Move version to 0.15.0-SNAPSHOT

    yihua committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    7b5b6c7 View commit details
    Browse the repository at this point in the history
  2. [HUDI-6825] Use UTF_8 to encode String to byte array in all places (a…

    …pache#9634)
    
    Unify the encoding of Java `String` to byte array in Hudi,
    especially for writing bytes to the storage,
    by using `UTF_8` encoding only.
    
    ---------
    
    Co-authored-by: Sagar Sumit <[email protected]>
    yihua and codope committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    6f25f41 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    232255e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d0e98e1 View commit details
    Browse the repository at this point in the history
  5. [MINOR] Update cleaner docs (apache#9716)

    Co-authored-by: Jonathan Vexler <=>
    Co-authored-by: Y Ethan Guo <[email protected]>
    jonvex and yihua committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    ec91bbc View commit details
    Browse the repository at this point in the history
  6. [MINOR] Move hoodie hfile/orc reader/writer test cases from hudi-clie…

    …nt-common to hudi-common (apache#9103)
    
    Co-authored-by: Y Ethan Guo <[email protected]>
    Mulavar and yihua committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    3998ef6 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    82bd765 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    52c42f8 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    69d0998 View commit details
    Browse the repository at this point in the history
  10. [HUDI-7016] Fix bundling of RoaringBitmap dependency (apache#9963)

    This commit fixes the bundling of RoaringBitmap dependency in Hudi bundles by including it in the shade rules and shading the classes, to avoid dependency conflict with engine-provided jars, e.g., Spark. Before this fix, with Hudi Spark bundle, NoSuchMethodError exception is thrown by Spark 3.2.
    yihua committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    1911c27 View commit details
    Browse the repository at this point in the history
  11. [HUDI-6993] Support Flink 1.18 (apache#9949)

    * Address build failures in older Flink Versions
    * Remove unnecessary dependency on flink-connector-hive
    * Fix Flink 1.18 Validate-bundles
    
    ---------
    
    Signed-off-by: Prabhu Joseph <[email protected]>
    Co-authored-by: Prabhu Joseph <[email protected]>
    Co-authored-by: root <[email protected]>
    3 people authored and yihua committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    7acc41e View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    8fc4135 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    c072007 View commit details
    Browse the repository at this point in the history
  14. [HUDI-6806] Support Spark 3.5.0 (apache#9717)

    ---------
    
    Co-authored-by: Shawn Chang <[email protected]>
    Co-authored-by: Y Ethan Guo <[email protected]>
    3 people committed Feb 26, 2024
    Configuration menu
    Copy the full SHA
    ae80cbd View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    1605c28 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    149ca9a View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    d1366d8 View commit details
    Browse the repository at this point in the history

Commits on Feb 27, 2024

  1. [HUDI-7207] Sequentially delete complete instant files in archival to…

    … prevent inconsistency during data reads (apache#10711)
    majian1998 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    008320c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    af3f258 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    50119d2 View commit details
    Browse the repository at this point in the history
  4. [HUDI-7190] Fix nested columns vectorized read for spark33+ legacy fo…

    …rmats (apache#10265)
    
    * [HUDI-7190] Fix legacy parquet format nested columns vectorized read for spark3.3+
    * Fix nested type implicit schema evolution
    * fix legacy format support batch read
    * Add exception messages when vectorized read nested type with type change
    stream2000 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    155a66c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    e1625b1 View commit details
    Browse the repository at this point in the history
  6. [HUDI-7242] Avoid unnecessary bigquery table update when using sync t…

    …ool (apache#10374)
    
    Co-authored-by: jp0317 <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    a8ef9d4 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    353d281 View commit details
    Browse the repository at this point in the history
  8. [MINOR] DataStream need in closeure in FileSystemBasedLockProvider (a…

    …pache#10411)
    
    Co-authored-by: xuyu <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    5faefcd View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    1be7447 View commit details
    Browse the repository at this point in the history
  10. [HUDI-7268] HoodieFlinkStreamer should disable compaction in pipeline…

    … with append mode (apache#10430)
    
    Co-authored-by: xuyu <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    94a162a View commit details
    Browse the repository at this point in the history
  11. [HUDI-7260] Fix call repair_overwrite_hoodie_props failure error due …

    …to specify hoodie.properties path (apache#10413)
    empcl authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    acace8f View commit details
    Browse the repository at this point in the history
  12. [MINOR] Fix ArchivalUtils Logger named (apache#10436)

    Co-authored-by: dongsj <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    2601a0e View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    595d230 View commit details
    Browse the repository at this point in the history
  14. [HUDI-7271] Copy a conf in ClusteringOperator to avoid configuration …

    …leak (apache#10441)
    
    Co-authored-by: leixin1 <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    37ff8fe View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    91d7983 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    60b073f View commit details
    Browse the repository at this point in the history
  17. [MINOR] Disable flaky test (apache#10449)

    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    6ffc817 View commit details
    Browse the repository at this point in the history
  18. [HUDI-7279] make sampling rate configurable for BOUNDED_IN_MEMORY exe…

    …cutor type (apache#10459)
    
    * make sampling rate configurable for BOUNDED_IN_MEMORY executor type
    * add sinceVersion for new configs
    ---------
    Co-authored-by: wei.kong <[email protected]>
    waitingF authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    ef1ccce View commit details
    Browse the repository at this point in the history
  19. [HUDI-5973] Fixing refreshing of schemas in HoodieStreamer continuous…

    … mode (apache#10261)
    
    * Add cachedSchema per batch, fix idempotency with getSourceSchema calls
    ---------
    Co-authored-by: danielfordfc <[email protected]>
    nsivabalan authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    fc587b3 View commit details
    Browse the repository at this point in the history
  20. [MINOR] Fix unit tests (apache#10362)

    geserdugarov authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    b712666 View commit details
    Browse the repository at this point in the history
  21. [HUDI-7284] Stream sync doesn't differentiate replace commits (apache…

    …#10467)
    
    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    d1dd4a4 View commit details
    Browse the repository at this point in the history
  22. [HUDI-7241] Avoid always broadcast HUDI relation if not using HoodieS…

    …parkSessionExtension (apache#10373)
    
    * [HUDI-7241] Avoid always broadcast HUDI relation if not using HoodieSparkSessionExtension
    * Update the logical to check whether HoodieExtension is enabled
    beyond1920 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    c0e59e9 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    26df317 View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    fcd6cd9 View commit details
    Browse the repository at this point in the history
  25. [HUDI-7288] Fix ArrayIndexOutOfBoundsException when upgrade nonPartit…

    …ionedTable created by 0.10/0.11 HUDI version (apache#10482)
    beyond1920 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    cdefb4b View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    ef7f523 View commit details
    Browse the repository at this point in the history
  27. [MINOR] Parallelized the check for existence of files in IncrementalR…

    …elation. (apache#10480)
    
    This speedups the check for large datasets when a very large number of files need to be checked for existence.
    prashantwason authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    635d0c6 View commit details
    Browse the repository at this point in the history
  28. [HUDI-7282] Avoid verification failure due to append writing of the c…

    …ow table with cluster configuration when the index is bucket. (apache#10475)
    Akihito-Liang authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    8546cbf View commit details
    Browse the repository at this point in the history
  29. [HUDI-6902] Use mvnw command for hadoo-mr test (apache#10474)

    The reason is to clean up any orphan resources.
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    744befe View commit details
    Browse the repository at this point in the history
  30. [HUDI-6902] Give minimum memory for unit tests (apache#10469)

    Changes:
    1. Set initial memory 128M.
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    36eeb94 View commit details
    Browse the repository at this point in the history
  31. [HUDI-7278] make bloom filter skippable for CPU saving (apache#10457)

    * make bloom filter skippable for CPU saving
    ---------
    Co-authored-by: wei.kong <[email protected]>
    waitingF authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    da6a490 View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    7d97216 View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    2b2e1a0 View commit details
    Browse the repository at this point in the history
  34. [HUDI-6092] Set the timeout for the forked JVM (apache#10496)

    After we set this parameter, the surefire will try to ping the forked JVM
    after the timeout.
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    0de5f07 View commit details
    Browse the repository at this point in the history
  35. [MINOR] Clean default Hadoop configuration values in tests (apache#10495

    )
    
    * [MINOR] Clean default Hadoop configurations for SparkContext
    
    These default Hadoop configurations are not used in Hudi tests.
    
    * Consolidating the code into a helper class
    
    ---------
    
    Co-authored-by: vinoth chandar <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    d414b60 View commit details
    Browse the repository at this point in the history
  36. [HUDI-7300] Merge schema in ParuqetDFSSource (apache#10199)

    Rohit Mittapalli authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    9ddcfb1 View commit details
    Browse the repository at this point in the history
  37. [MINOR] Fix eager rollback mdt ut (apache#10506)

    Signed-off-by: wulingqi <[email protected]>
    KnightChess authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    5bc160b View commit details
    Browse the repository at this point in the history
  38. [HUDI-7296] Reduce CI Time by Minimizing Duplicate Code Coverage in T…

    …ests (apache#10492)
    
    * reduce combos of tests
    
    * build success
    
    ---------
    
    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    8048c99 View commit details
    Browse the repository at this point in the history
  39. [HUDI-7246] Fix Data Skipping Issue: No Results When Query Conditions…

    … Involve Both Columns with and without Column Stats (apache#10389)
    majian1998 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    7c13eb3 View commit details
    Browse the repository at this point in the history
  40. [HUDI-7170] Implement HFile reader independent of HBase (apache#10241)

    This commit adds a Hudi-native HFile reader implementation independent of HBase.
    yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    2337270 View commit details
    Browse the repository at this point in the history
  41. [HUDI-6902] Fix a unit test (apache#10513)

    fixed a test.
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    a508d54 View commit details
    Browse the repository at this point in the history
  42. Configuration menu
    Copy the full SHA
    3facb0a View commit details
    Browse the repository at this point in the history
  43. Configuration menu
    Copy the full SHA
    e8f34c3 View commit details
    Browse the repository at this point in the history
  44. [HUDI-7297] Fix ambiguous error message when field type defined in sc…

    …hema mismatches that in parquet file (apache#10497)
    paul8263 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    975ba22 View commit details
    Browse the repository at this point in the history
  45. [HUDI-7309] Disable constructing AND & OR filter predicates when filt…

    …er pushing down for any of its operand's logical type for is unsupported in ExpressionPredicates::toParquetPredicate (apache#10524)
    paul8263 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    cefc530 View commit details
    Browse the repository at this point in the history
  46. [HUDI-7284] Fix cluster stream sync check (apache#10501)

    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    0705849 View commit details
    Browse the repository at this point in the history
  47. [HUDI-7314] Hudi Create table support index type check (apache#10536)

    Co-authored-by: xuyu <[email protected]>
    Co-authored-by: Y Ethan Guo <[email protected]>
    3 people committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    4361432 View commit details
    Browse the repository at this point in the history
  48. [HUDI-7277] Fix hoodie.bulkinsert.shuffle.parallelism not activated…

    … with no-partitioned table (apache#10532)
    
    Signed-off-by: wulingqi <[email protected]>
    KnightChess authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    ccb5993 View commit details
    Browse the repository at this point in the history
  49. [MINOR] Added descriptive exception if column present in required avr…

    …o schema does not exist in hudi table (apache#10527)
    prathit06 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    38525de View commit details
    Browse the repository at this point in the history
  50. [HUDI-7315] Disable constructing NOT filter predicate when pushing do…

    …wn its wrapped filter unsupported, as its operand's primitive value is incomparable (apache#10537)
    paul8263 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    e5cabe6 View commit details
    Browse the repository at this point in the history
  51. [HUDI-7317] FlinkTableFactory snatifyCheck should contains index type (

    …apache#10541)
    
    Co-authored-by: xuyu <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    c9cdc2a View commit details
    Browse the repository at this point in the history
  52. [HUDI-7303] Fix date field type unexpectedly convert to Long when usi…

    …ng date comparison operator (apache#10517)
    paul8263 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    288898e View commit details
    Browse the repository at this point in the history
  53. [MINOR] Reduce UT spark-datasource test times (apache#10547)

    * [MINOR] Reduce UT spark-datasource test times
    
    * Reverting the parallelism change
    vinothchandar authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    1554908 View commit details
    Browse the repository at this point in the history
  54. Configuration menu
    Copy the full SHA
    1b37ee2 View commit details
    Browse the repository at this point in the history
  55. [HUDI-7316] AbstractHoodieLogRecordReader should accept HoodieTableMe…

    …taClient in order to reduce occurences of executors making file listing calls when reloading active timeline (apache#10540)
    
    Summary:
    Currently some implementors of  AbstractHoodieLogRecordReader create a HoodieTableMetaClient on construction, which implicitly reloads active timeline, causing a `listStatus` HDFS call. Since these are created in executors, each of the hundreds to thousands of executors will make a `listStatus` call at the same time during a stage. To avoid these redundant calls to HDFS NameNode, AbstractHoodieLogRecordReader and the following implementations have been updated to allow an existing HoodieTableMetaClient to be passed in.
    - HoodieUnMergedLogRecordScanner
    - HoodieMergedLogRecordScanner
    - HoodieMetadataMergedLogRecordReader
    As long as the caller passed in a HoodieTableMetaClient with active timeline already loaded, and the implementation doesn't need to re-load the timeline (such as in order to get a more "fresh" timeline) than `listStatus` calls can be avoided in the executor, without causing the logic to be incorrect.
    
    Co-authored-by: Krishen Bhan <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    cef039f View commit details
    Browse the repository at this point in the history
  56. Configuration menu
    Copy the full SHA
    492daf0 View commit details
    Browse the repository at this point in the history
  57. Configuration menu
    Copy the full SHA
    126010b View commit details
    Browse the repository at this point in the history
  58. [HUDI-7298] Write bad records to error table in more cases instead of…

    … failing stream (apache#10500)
    
    Cases:
    - No transformers, with schema provider. Records will go to the error table if they cannot be rewritten in the deduced schema.
    - recordkey is null, even if the column is nullable in the schema
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    9002a02 View commit details
    Browse the repository at this point in the history
  59. Configuration menu
    Copy the full SHA
    31adbb9 View commit details
    Browse the repository at this point in the history
  60. [HUDI-7327] remove meta cols from incoming schema in stream sync (apa…

    …che#10556)
    
    ---------
    
    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    6f27d81 View commit details
    Browse the repository at this point in the history
  61. Configuration menu
    Copy the full SHA
    54a3b67 View commit details
    Browse the repository at this point in the history
  62. [MINOR] add logger to CompactionPlanOperator & ClusteringPlanOperator (

    …apache#10562)
    
    Co-authored-by: dongsj <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    e76f2e8 View commit details
    Browse the repository at this point in the history
  63. [HUDI-7308] LockManager::unlock should not call updateLockHeldTimerMe…

    …trics if lockDurationTimer has not been started (apache#10523)
    kbuci authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    6dd4bea View commit details
    Browse the repository at this point in the history
  64. [HUDI-7335] Create hudi-hadoop-common for hadoop-specific implementat…

    …ion (apache#10564)
    
    This commit creates a new module `hudi-hadoop-common` for hadoop-specific implementation.  This serves as the first step to decouple `hudi-common` module from hadoop dependencies.
    yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    86e3ca6 View commit details
    Browse the repository at this point in the history
  65. Configuration menu
    Copy the full SHA
    b5200bf View commit details
    Browse the repository at this point in the history
  66. [HUDI-7336] Introduce new HoodieStorage abstraction (apache#10567)

    This commit introduces `HoodieStorage` abstraction and Hudi's counterpart classes for Hadoop File System classes (`org.apache.hadoop.fs.`[`FileSystem`, `Path`, `PathFilter`, `FileStatus`]) to decouple Hudi's implementation from Hadoop classes, so it's much easier to plugin different file system implementation.
    yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    005c758 View commit details
    Browse the repository at this point in the history
  67. Configuration menu
    Copy the full SHA
    e00e2d7 View commit details
    Browse the repository at this point in the history
  68. Configuration menu
    Copy the full SHA
    a058344 View commit details
    Browse the repository at this point in the history
  69. Configuration menu
    Copy the full SHA
    8fda151 View commit details
    Browse the repository at this point in the history
  70. Configuration menu
    Copy the full SHA
    90ca4f0 View commit details
    Browse the repository at this point in the history
  71. Configuration menu
    Copy the full SHA
    97ce215 View commit details
    Browse the repository at this point in the history
  72. Configuration menu
    Copy the full SHA
    4d49fa4 View commit details
    Browse the repository at this point in the history
  73. [HUDI-7344] Use Java <Input/Output>Stream instead of FSData<Input/Out…

    …put>Stream when possible (apache#10573)
    yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    bcfcd9f View commit details
    Browse the repository at this point in the history
  74. Configuration menu
    Copy the full SHA
    e38c731 View commit details
    Browse the repository at this point in the history
  75. Configuration menu
    Copy the full SHA
    aef157a View commit details
    Browse the repository at this point in the history
  76. Configuration menu
    Copy the full SHA
    104fa7d View commit details
    Browse the repository at this point in the history
  77. [HUDI-6868] Support extracting passwords from credential store for Hi…

    …ve Sync (apache#10577)
    
    Co-authored-by: Danny Chan <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    cb2d94b View commit details
    Browse the repository at this point in the history
  78. Configuration menu
    Copy the full SHA
    fa6e499 View commit details
    Browse the repository at this point in the history
  79. [HUDI-7284] Fix bad method name getLastPendingClusterCommit to getLas…

    …tPendingClusterInstant (apache#10613)
    
    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    4a04292 View commit details
    Browse the repository at this point in the history
  80. Configuration menu
    Copy the full SHA
    692f0d1 View commit details
    Browse the repository at this point in the history
  81. [HUDI-7375] Disable a flaky test method (apache#10627)

    Which is caused by issues from underlying MiniHDFS.
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    18f10ba View commit details
    Browse the repository at this point in the history
  82. Configuration menu
    Copy the full SHA
    b8b88cf View commit details
    Browse the repository at this point in the history
  83. [HUDI-7338] Bump HBase, Pulsar, Jetty version (apache#10223)

    Co-authored-by: Shawn Chang <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    d17ae75 View commit details
    Browse the repository at this point in the history
  84. [HUDI-7367] Add makeQualified APIs (apache#10607)

    * [HUDI-7367] Add makeQualified APIs
    
    * Fix checkstyle
    yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    51a364c View commit details
    Browse the repository at this point in the history
  85. Configuration menu
    Copy the full SHA
    66ac9ff View commit details
    Browse the repository at this point in the history
  86. Configuration menu
    Copy the full SHA
    e03a88c View commit details
    Browse the repository at this point in the history
  87. [HUDI-7387] Serializable Class need contains serialVersionUID to keep…

    … compatibility in upgrade (apache#10633)
    xuzifu666 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    9911497 View commit details
    Browse the repository at this point in the history
  88. [MINOR] fix typo (apache#10634)

    lxliyou001 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    32fe3b6 View commit details
    Browse the repository at this point in the history
  89. Configuration menu
    Copy the full SHA
    8436feb View commit details
    Browse the repository at this point in the history
  90. [HUDI-7373] revert config hoodie.write.handle.missing.cols.with.lossl…

    …ess.type.promotion (apache#10611)
    
    ---------
    
    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    09f3fb5 View commit details
    Browse the repository at this point in the history
  91. [HUDI-6902] Containerize the Azure CI (apache#10512)

    * [HUDI-6902] Containerize the Azure tests
    
    * remove warning message
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    a0ebac8 View commit details
    Browse the repository at this point in the history
  92. Configuration menu
    Copy the full SHA
    ff79572 View commit details
    Browse the repository at this point in the history
  93. [HUDI-7379] Exclude jackson-module-afterburner from hudi-aws module (a…

    …pache#10618)
    
    Co-authored-by: Prabhu Joseph <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    ba5dc83 View commit details
    Browse the repository at this point in the history
  94. [HUDI-7381] Fix compaction write stats and metrics for create and ups…

    …ert time (apache#10619)
    
    Co-authored-by: rmahindra123 <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    3f22f6f View commit details
    Browse the repository at this point in the history
  95. [MINOR] Disable Containers the Azure CI (apache#10662)

    This reverts commit 81cddbb.
    linliu-code authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    84c7edd View commit details
    Browse the repository at this point in the history
  96. Configuration menu
    Copy the full SHA
    3dcfbc2 View commit details
    Browse the repository at this point in the history
  97. Configuration menu
    Copy the full SHA
    82c79ce View commit details
    Browse the repository at this point in the history
  98. [HUDI-7357] Introduce generic StorageConfiguration (apache#10586)

    This commit introduces the generic `StorageConfiguration` to store configuration for I/O with `HoodieStorage`. Given there's overhead of reinitializing Hadoop's `Configuration` instance, the approach is to wrap the instance in the `HadoopStorageConfiguration` implementation.  This change will enable us to remove our dependency on Hadoop's `Configuration` class.  When integrated, places using `Configuration` will be replaced by `StorageConfiguration` and the `StorageConfiguration` will be passed around for instantiating `HoodieStorage` (unless Hadoop-based readers need the `Configuration` instance).
    yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    8186c64 View commit details
    Browse the repository at this point in the history
  99. [HUDI-7104] Fixing cleaner savepoint interplay to fix edge case with …

    …incremental cleaning (apache#10651)
    
    * Fixing incremental cleaning with savepoint
    
    * Addressing feedback
    nsivabalan authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    26fb26d View commit details
    Browse the repository at this point in the history
  100. Configuration menu
    Copy the full SHA
    961e941 View commit details
    Browse the repository at this point in the history
  101. Configuration menu
    Copy the full SHA
    3e7b453 View commit details
    Browse the repository at this point in the history
  102. Configuration menu
    Copy the full SHA
    cad5605 View commit details
    Browse the repository at this point in the history
  103. Configuration menu
    Copy the full SHA
    5b94afa View commit details
    Browse the repository at this point in the history
  104. [HUDI-7381] Fix flaky test introduced in PR 10619 (apache#10674)

    Co-authored-by: rmahindra123 <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    05602a1 View commit details
    Browse the repository at this point in the history
  105. [MINOR] Clarify config descriptions (apache#10681)

    This aligns with the doc change here: apache#10680
    bhasudha authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    0f2e6db View commit details
    Browse the repository at this point in the history
  106. Configuration menu
    Copy the full SHA
    6ed3b43 View commit details
    Browse the repository at this point in the history
  107. [HUDI-7147] Fix npe stream sync first batch, empty schema, upsert (ap…

    …ache#10689)
    
    * fix npe
    
    * add empty table support as well
    
    * use empty relation
    
    * fix failing tests
    
    ---------
    
    Co-authored-by: Jonathan Vexler <=>
    jonvex authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    d440d52 View commit details
    Browse the repository at this point in the history
  108. Configuration menu
    Copy the full SHA
    a16b4c6 View commit details
    Browse the repository at this point in the history
  109. Configuration menu
    Copy the full SHA
    926382d View commit details
    Browse the repository at this point in the history
  110. Configuration menu
    Copy the full SHA
    6147fd9 View commit details
    Browse the repository at this point in the history
  111. [HUDI-7418] Add file extension filter for s3 incr source (apache#10694)

    We have support for filtering the input files based on an extension (custom) for GCS Incr Source that can be configured. But we don't have the same for the S3 incr source (which always assumes that file extension is same as the format which may not be the case always).
    
    Co-authored-by: rmahindra123 <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    f2bcdf8 View commit details
    Browse the repository at this point in the history
  112. [HUDI-7423] Support table type name incase-sensitive when create tabl…

    …e in sparksql (apache#10703)
    
    * [HUDI-7423] Support table type name case-sensitive when create table in sparksql
    
    * add comments
    xuzifu666 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    798fca6 View commit details
    Browse the repository at this point in the history
  113. [HUDI-7416] Remove duplicate code for getFileFormat and Refactor filt…

    …er methods for S3/GCS sources (apache#10701)
    vinishjail97 authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    ba7f48a View commit details
    Browse the repository at this point in the history
  114. Configuration menu
    Copy the full SHA
    026231e View commit details
    Browse the repository at this point in the history
  115. [HUDI-6774] Prefix HiveConf properties to Hoodie catalog properties m…

    …ap with '.hadoop' (apache#10686)
    
    Co-authored-by: Vova Kolmakov <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    5591eb0 View commit details
    Browse the repository at this point in the history
  116. Configuration menu
    Copy the full SHA
    d5cc357 View commit details
    Browse the repository at this point in the history
  117. Configuration menu
    Copy the full SHA
    0479c09 View commit details
    Browse the repository at this point in the history
  118. Configuration menu
    Copy the full SHA
    6b1eb28 View commit details
    Browse the repository at this point in the history
  119. Configuration menu
    Copy the full SHA
    623d0df View commit details
    Browse the repository at this point in the history
  120. [HUDI-7432] Fix excessive object creation in KeyGenUtils (apache#10721)

    Co-authored-by: Vova Kolmakov <[email protected]>
    2 people authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    bef7c9b View commit details
    Browse the repository at this point in the history
  121. Configuration menu
    Copy the full SHA
    23c9d85 View commit details
    Browse the repository at this point in the history
  122. Configuration menu
    Copy the full SHA
    d361e80 View commit details
    Browse the repository at this point in the history
  123. Configuration menu
    Copy the full SHA
    ce0ee2f View commit details
    Browse the repository at this point in the history
  124. [HUDI-7275] Separate use of HoodieTimelineTimeZone.UTC and LOCAL in t…

    …ests to prevent infinite loops (apache#10738)
    geserdugarov authored and yihua committed Feb 27, 2024
    Configuration menu
    Copy the full SHA
    d0b34f0 View commit details
    Browse the repository at this point in the history
  125. Configuration menu
    Copy the full SHA
    a64a0ed View commit details
    Browse the repository at this point in the history
  126. Configuration menu
    Copy the full SHA
    b83f160 View commit details
    Browse the repository at this point in the history
  127. Configuration menu
    Copy the full SHA
    87e6e5e View commit details
    Browse the repository at this point in the history
  128. Configuration menu
    Copy the full SHA
    cfbacf7 View commit details
    Browse the repository at this point in the history
  129. Configuration menu
    Copy the full SHA
    41ee828 View commit details
    Browse the repository at this point in the history

Commits on May 27, 2024

  1. Configuration menu
    Copy the full SHA
    cea9304 View commit details
    Browse the repository at this point in the history
  2. Bundle validation only

    yihua committed May 27, 2024
    Configuration menu
    Copy the full SHA
    4cfcded View commit details
    Browse the repository at this point in the history
  3. Fix ci_run.sh

    yihua committed May 27, 2024
    Configuration menu
    Copy the full SHA
    e34c692 View commit details
    Browse the repository at this point in the history