-
Notifications
You must be signed in to change notification settings - Fork 2.9k
switch to apache kafka #10178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
switch to apache kafka #10178
Conversation
iennae
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for proposing this change. It doesn't look like bitnami has a secure kafka image offered in their new bitnamisecure so this looks like the right path forward to use the official Apache release.
I've got two main points of feedback, with the second being critical for ensuring the sample works as intended.
Image Transition & Configuration
I totally understand the Kafka setup is secondary to the Dataflow portion, but I'm wondering how much you've confirmed changes based on swapping out a "polished" image (Bitnami) for an "upstream" one (Apache):
Documentation Context: Could you please add a brief note to the README explaining why we are moving from bitnami/kafka to apache/kafka? This helps future contributors (including sample reviewers) and users understand the dependency choice.
Default Configuration: The Bitnami image often included built-in security defaults and environment variables. Have you confirmed that the basic apache/kafka image provides a drop-in functional equivalent for this simple setup, or if any additional variables are needed to maintain security/stability? We might want to add a note at the top around this sample showcasing how to do this for local development and not showing production-quality kafka configuration.
I believe that apache/kafka:latest currently points to the 4.x.x stream https://hub.docker.com/r/apache/kafka which runs Kafka without ZooKeeper. The tutorial documentation still contains steps and infrastructure setup (firewall rule for port 2181) that assume a ZooKeeper-based Kafka instance (which the original bitnami/kafka:3.4.0 was).If we use
I think this means either refactoring the instructions or pin the image to a specific, recent
|
@derrickaw any updates? |
Hi @iennae, Documentation Context: Could you please add a brief note to the README explaining why we are moving from bitnami/kafka to apache/kafka? This helps future contributors (including sample reviewers) and users understand the dependency choice. Will add. Default Configuration: The Bitnami image often included built-in security defaults and environment variables. Have you confirmed that the basic apache/kafka image provides a drop-in functional equivalent for this simple setup, or if any additional variables are needed to maintain security/stability? We might want to add a note at the top around this sample showcasing how to do this for local development and not showing production-quality kafka configuration. I have ran this in the same procedure as previous ones and it worked. I think this means either refactoring the instructions or pin the image to a specific, recent 3.x.x Let me investigate this some more. Thanks! |
Description
Rerunning these commands failed due to untagging on Docker Hub (https://hub.docker.com/r/bitnami/kafka) for bitnami images. They have been moved to bitnamilegacy repo and no new tagging has been completed yet. Will just use Apache instead.
Note: Before submitting a pull request, please open an issue for discussion if you are not associated with Google.
Checklist
pom.xmlparent set to latestshared-configurationmvn clean verifyrequiredmvn -P lint checkstyle:checkrequiredmvn -P lint clean compile pmd:cpd-check spotbugs:checkadvisory only