Editorial Summary :
Google promotes Dataflow as one of the main components of the big data architecture on GCP . With the ability to extract data from open sources, this serverless solution is native the to Google Cloud Platform, enabling rapid implementation and integration . Dataflow can also run in ETL solution because it has: building blocks for operational data stores and data warehouses; data filtering and enrichment pipelines; PII de-identification pipeline; function for detecting anomalies in financial transactions; and export logs to external systems .
Key Highlights :
- Dataflow is one of the main components of the big data architecture on GCP .
- Apache beam is a serverless solution that is native the to Google Cloud Platform .
The editorial is based on the content sourced from www.analyticsvidhya.com