Google, Cloudera Bring Cloud Dataflow to Spark

Google, Cloudera Bring Cloud Dataflow to Spark

Google and Cloudera have partnered together on a project that will bring Google’s Cloud Dataflow programming model to Apache?s Spark data processing engine. Dataflow arose out of Google’s own internal big data processing efforts and it utilizes Google’s Compute Engine, Cloud Storage and BigQuery cloud computing services. Spark is an Apache project for very fast big data processing.

The two companies have released a “runner” that connects Dataflow to Spark. However, enterprises should note that the tool is still an alpha release and is not ready for production deployment.

View article

Share the Post:
data observability

Data Observability Explained

Data is the lifeblood of any successful business, as it is the driving force behind critical decision-making, insight generation, and strategic development. However, due to its intricate nature, ensuring the

Heading photo, Metadata.

What is Metadata?

What is metadata? Well, It’s an odd concept to wrap your head around. Metadata is essentially the secondary layer of data that tracks details about the “regular” data. The regular

XDR solutions

The Benefits of Using XDR Solutions

Cybercriminals constantly adapt their strategies, developing newer, more powerful, and intelligent ways to attack your network. Since security professionals must innovate as well, more conventional endpoint detection solutions have evolved