Skip to content

[SPARK-47995][PYTHON][INFRA][TESTS] Upgrade pyarrow to 17.0.0 in GitHub Action CI #46232

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Apr 25, 2024

What changes were proposed in this pull request?

This PR aims to use pyarrow 17.0.0 in GitHub Action CI for Apache Spark 4.0.0.

Why are the changes needed?

Our CI is still using 15.0.2.

pyarrow                  15.0.2

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Pass the CIs.

Was this patch authored or co-authored using generative AI tooling?

No.

@github-actions github-actions bot added the BUILD label Apr 25, 2024
@dongjoon-hyun dongjoon-hyun marked this pull request as draft April 25, 2024 17:26
@dongjoon-hyun
Copy link
Member Author

dongjoon-hyun commented Apr 25, 2024

This is blocked by mlflow currently.

mlflow 2.12.1 requires pyarrow<16,>=4.0.0, but you have pyarrow 16.0.0 which is incompatible.

@dongjoon-hyun
Copy link
Member Author

This is still blocked by mlflow 2.12.2

mlflow 2.12.2 requires pyarrow<16,>=4.0.0, but you have pyarrow 16.0.0 which is incompatible.

@dongjoon-hyun
Copy link
Member Author

This is still blocked by mlflow.

mlflow 2.14.1 requires pyarrow<16,>=4.0.0, but you have pyarrow 16.0.0 which is incompatible.

@dongjoon-hyun
Copy link
Member Author

We need to use 17.0.0 due to SPARK-48940

@dongjoon-hyun
Copy link
Member Author

For the record, MLFlow 2.15.0 was released on July 29th, but the requirement is the same. We cannot use pyarrow 16.0.0 and 17.0.0.

mlflow 2.15.0 requires pyarrow<16,>=4.0.0, but you have pyarrow 16.0.0 which is incompatible.

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-47995][PYTHON][INFRA][TESTS] Upgrade pyarrow to 16.0.0 in GitHub Action CI [SPARK-47995][PYTHON][INFRA][TESTS] Upgrade pyarrow to 17.0.0 in GitHub Action CI Aug 4, 2024
IvanK-db pushed a commit to IvanK-db/spark that referenced this pull request Sep 20, 2024
### What changes were proposed in this pull request?
Refresh testing image for pyarrow 17

### Why are the changes needed?
currently the cached `pyarrow==15.0.2` is used in [CI](https://github.com/apache/spark/actions/runs/10674534002/job/29585233434), we need to test Spark with latest pyarrow

### Does this PR introduce _any_ user-facing change?
No, infra only

### How was this patch tested?
updated ci

### Was this patch authored or co-authored using generative AI tooling?
no

Closes apache#46232

Closes apache#47965 from zhengruifeng/infra_refresh_test_doc.

Authored-by: Ruifeng Zheng <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
attilapiros pushed a commit to attilapiros/spark that referenced this pull request Oct 4, 2024
### What changes were proposed in this pull request?
Refresh testing image for pyarrow 17

### Why are the changes needed?
currently the cached `pyarrow==15.0.2` is used in [CI](https://github.com/apache/spark/actions/runs/10674534002/job/29585233434), we need to test Spark with latest pyarrow

### Does this PR introduce _any_ user-facing change?
No, infra only

### How was this patch tested?
updated ci

### Was this patch authored or co-authored using generative AI tooling?
no

Closes apache#46232

Closes apache#47965 from zhengruifeng/infra_refresh_test_doc.

Authored-by: Ruifeng Zheng <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
himadripal pushed a commit to himadripal/spark that referenced this pull request Oct 19, 2024
### What changes were proposed in this pull request?
Refresh testing image for pyarrow 17

### Why are the changes needed?
currently the cached `pyarrow==15.0.2` is used in [CI](https://github.com/apache/spark/actions/runs/10674534002/job/29585233434), we need to test Spark with latest pyarrow

### Does this PR introduce _any_ user-facing change?
No, infra only

### How was this patch tested?
updated ci

### Was this patch authored or co-authored using generative AI tooling?
no

Closes apache#46232

Closes apache#47965 from zhengruifeng/infra_refresh_test_doc.

Authored-by: Ruifeng Zheng <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant