Google Cloud Platform Administration
上QQ阅读APP看书,第一时间看更新

Big data

The following are the big data services:

  • BigQuery: BigQuery is an enterprise data warehouse that allows you to store and query massive datasets by enabling fast SQL queries using Google's underlying infrastructure.
  • Cloud dataflow: A fully managed service that allows real-time batch and stream data processing. The service also integrates with Stackdriver, Google's unified logging and monitoring solution, letting you monitor and troubleshoot issues as they happen.
  • Cloud dataproc: Cloud dataproc is a fully managed cloud service to run Apache spark and Apache Hadoop clusters.
  • Cloud datalab: A powerful tool that allows you to explore and visualize large datasets.
  • Cloud dataprep: A service that helps in structured and unstructured data analysis by means of visually exploring and cleaning it.
  • Cloud pub/sub: A service built for stream analytics that allows you to publish and subscribe to data streams for big data analysis.
  • Google genomics: A service that allows you to query the genomic information of large research projects.
  • Google DataStudio: Allows you to turn your data into informative dashboards.

We will look at all services in greater detail in the following chapters.