Editorial Summary :

Google promotes Dataflow as one of the main components of the big data architecture on GCP . With the ability to extract data from open sources, this serverless solution is native the to Google Cloud Platform, enabling rapid implementation and integration . Dataflow can also run in ETL solution because it has: building blocks for operational data stores and data warehouses; data filtering and enrichment pipelines; PII de-identification pipeline; function for detecting anomalies in financial transactions; and export logs to external systems .

Key Highlights :

  • Dataflow is one of the main components of the big data architecture on GCP .
  • Apache beam is a serverless solution that is native the to Google Cloud Platform .

The editorial is based on the content sourced from www.analyticsvidhya.com

Read the full article.

Similar Posts