Skip to content

Conversation

@jerry-024
Copy link
Contributor

@jerry-024 jerry-024 commented Nov 11, 2025

Purpose

format table: insert data after commit sync partitions to hms

  • user could add conf format-table.commit-hive-sync-url to support sync partition to hms.

Tests

API and Format

Documentation

@jerry-024 jerry-024 marked this pull request as draft November 11, 2025 10:00
jerry-024 and others added 8 commits November 12, 2025 11:05
…it_sync_hms

* upstream/master:
  [core] format table: fix partition key not in the end write error (apache#6595)
  [core] Async snapshot expire fallback to sync mode in batch commit (apache#6593)
  [spark] Enable dynamic options for format table (apache#6592)
  [arrow] Fix that complex writers didn't reset inner writer state (apache#6591)
@jerry-024 jerry-024 marked this pull request as ready for review November 13, 2025 09:35
.stringType()
.noDefaultValue()
.withDescription("Format table commit sync partition hive uri.");
public static final ConfigOption<String> FORMAT_TABLE_COMMIT_SYNC_PARTITION_HIVE_WAREHOUSE =
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warehouse is useless, remove it.

<td>Whether to force the use of lookup for compaction.</td>
</tr>
<tr>
<td><h5>format-table.commit.sync-partition-hive.uri</h5></td>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

format-table.commit-hive-sync-url.


if (options.type() == TableType.FORMAT_TABLE) {
return toFormatTable(identifier, schema, dataFileIO);
return toFormatTable(identifier, schema, dataFileIO, catalog.catalogContext());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Already has parameter catalogContext...

Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you also add a test for this? In Spark FormatTableTestBase.

@jerry-024 jerry-024 force-pushed the format_table_commit_sync_hms branch from ccddcf3 to 834a0e0 Compare November 17, 2025 03:40
@jerry-024 jerry-024 force-pushed the format_table_commit_sync_hms branch from e7705fc to 6bf2b94 Compare November 17, 2025 05:41
…it_sync_hms

* upstream/master:
  [core] Fix the serialization of JindoMultiPartUploadCommitter (apache#6613)
  [core] Postpone mode should not mark done when still has compaction (apache#6561)
  [spark] Wrap the corresponding table to ensure it guarantees its capabilities (apache#6607)
@jerry-024 jerry-024 force-pushed the format_table_commit_sync_hms branch from 60faa2f to 42f83f5 Compare November 17, 2025 07:55
Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@JingsongLi JingsongLi merged commit 57d1ac5 into apache:master Nov 17, 2025
23 of 24 checks passed
@jerry-024 jerry-024 deleted the format_table_commit_sync_hms branch November 17, 2025 09:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants