Skills & Experience
· 4+ years Hands on and deep experience working with Google Data Products (e.g. BigQuery, Cloud Composer, Dataflow, Dataproc, Cloud Data Fusion, Dataprep, etc.)
· 4+ years Hands on experience in Python
· Experience in distributed data processing using PySpark (DataProc) and real-time data processing using Kafka(PubSub)
· Experience in Metadata Management, Data Quality, and Data Lineage tools.
· Data Engineering and Lifecycle (including non-functional requirements and operations) management
· Solution Design skills – Prototyping, Usability testing, and data visualization literacy
· Experience with Complex SQL based data processing (using Stored Procs) and NoSQL databases
· Understanding of shell scripting in UNIX and GNU/Linux systems
· Knowledge of data warehousing and data modeling
· Knowledge of how to maintain ETLs operating on a variety of structured and unstructured sources
· Experience working in Agile Software Development Team
· Understanding of working with a Source Code Management system (Git)
Qualifications
· Bachelor’s degree or global equivalent in a Computer Science/Software Engineering/IT/Data Management
· 5+ years of relevant experience in working across data/IT projects, IT application details
· Preference to GCP Certified Data Engineers
Behaviors
Ability to perform under pressure, handling interruptions and changes without losing productivity.
Meet deadlines and demonstrate problem-solving skills
Strong adherence to the collection and reuse of best practices.