Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature][SDK] Add Parquet formatted data source for Transform #11134

Open
2 tasks done
Zkplo opened this issue Sep 17, 2024 · 0 comments
Open
2 tasks done

[Feature][SDK] Add Parquet formatted data source for Transform #11134

Zkplo opened this issue Sep 17, 2024 · 0 comments

Comments

@Zkplo
Copy link
Contributor

Zkplo commented Sep 17, 2024

Description

Parquet is a column storage file format designed for efficient processing of large-scale data, particularly suitable for analytical workloads. It optimizes storage efficiency through compression and encoding, and supports complex data structures. Widely used in big data frameworks such as Hadoop, Spark, Hive, etc., it has become a commonly used format in the field of big data due to its efficient read performance and storage savings.

Use case

No response

Are you willing to submit PR?

  • Yes, I am willing to submit a PR!

Code of Conduct

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant