Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does this connector work with TF 2.x? #177

Open
dgoldenberg-audiomack opened this issue Dec 19, 2020 · 4 comments
Open

Does this connector work with TF 2.x? #177

dgoldenberg-audiomack opened this issue Dec 19, 2020 · 4 comments

Comments

@dgoldenberg-audiomack
Copy link

The latest TF right now is 2.4.0.
The latest connector on maven central is 1.15.0 published on Oct 23, 2019.

If I build the connector with the instructions from here: https://github.com/tensorflow/ecosystem/tree/master/spark/spark-tensorflow-connector, will it work?

@Echo9573
Copy link

Echo9573 commented Jan 6, 2021

Hi @dgoldenberg-audiomack, when I build the connector in TF 2.0.0, I got fails in the build test stage ( link ), how about you?

@dgoldenberg-audiomack
Copy link
Author

Hey @Echo9573, I think your error is:

/bin/sh: 1: java: not found

judging by the output. However you can try running Maven with -e and -X to get more info. It seems like either you don't have Java installed or it's not on your PATH.

@dgoldenberg-audiomack
Copy link
Author

I think the connector works; would be great if committers verified and added a blurb to the docs.

@jukujala
Copy link

I tried to follow the instructions with TF 2.2 and Spark 3.0.1, but installing ecosystem/hadoop has a missing dependency to org.tensorflow:proto:jar:2.2.0. The error message: Could not resolve dependencies for project org.tensorflow:tensorflow-hadoop:jar:2.2.0: Could not find artifact org.tensorflow:proto:jar:2.2.0 in central (https://repo.maven.apache.org/maven2)

Instructions pass if I manually set org.tensorflow:proto to TF version 1.15.0. However, I'm unsure what is the impact of using an old version of org.tensorflow:proto.

For the build I used dataproc preview-debian10 master image and these commands:

cd ../../hadoop
mvn versions:set -DnewVersion=2.2.0
mvn clean install
cd ../spark/spark-tensorflow-connector
mvn versions:set -DnewVersion=2.2.0
mvn clean install -Dspark.version=3.0.1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants