Settings

Theme

Apache Beam for Search: Getting Started by Hacking Time

shopify.engineering

87 points by clandry94 5 years ago · 8 comments

Reader

runT1ME 5 years ago

It would be helpful if this thorough of an example could live on the apache beam website, I think it would avoid a lot of confusion. I certainly found the way it handles windowing with triggers to be quite diffrent than say, Spark.

  • softwaredoug 5 years ago

    Yeah I agree. I basically wrote this blog out of my challenges learning this content. It involved a lot of code spellunking and trial and error to figure out precisely what these concepts meant. I do find Beam powerful, but also to be a bit esoteric at times and difficult to follow how watermarks, windows, and triggers all work. And we encounter sometimes unexpected behavior that frequently causes us to revise our understanding of these concepts.

    There's a fair amount of Stackoverflow highly voted answers out there like "I dunno, try this trigger, see if it works" without much understanding of how everything work underneath. Probably cause it's tricky to grok

  • elric 5 years ago

    This seems to be a general problem with many projects under the Apache Software Foundation. You look at the landing page and end up not figuring out what the project is or what it's for. This seems to be especially true with projects in the whole "big data"/"stream processing" sphere.

kordlessagain 5 years ago

The link for the Apache Beam project has the hostname and domain transposed. The correct URL is: https://beam.apache.org/

waynesonfire 5 years ago

does it support data lineage?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection