Zhixue
data engineer

Data engineer well-acquainted with SQL, Python, linux, Batch data pipeline, Streaming data pipeline, MySQL, BigQuery, Tableau, Spark, Dataflow,
Airflow, DataFusion, Google Cloud Platform, Cloud SQL, Google cloud Storage, Pandas, Numpy, Matplotlib, Plotly @Linkedin
brantdzx@gmail.com

Batch Data pipeline on Google Cloud with Dataflow and Airflow

I create a ETL batch data pipeline on Google Cloud Platform by using Dataflow and Composer (Airflow). This pipeline can periodicly download updated covid-19 cases data in EU from website to Google Cloud Storage (Datalake), perform data cleaning and transform , and load data to Bigquery (data warehouse) for analysis and data visualization (Tableau).