We are looking for a skilled and motivated Senior Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for designing, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) data pipelines. The role involves working with various GCP services, implementing data ingestion and transformation logic, and ensuring data quality and consistency across systems.
Experience Level: 7 to 10 years of relevant IT experience
Key Responsibilities:
Design, develop, test, and maintain scalable ETL data pipelines using Python.
Work extensively on Google Cloud Platform (GCP) services such as:
Dataflow for real-time and batch data processing
Cloud Functions for lightweight serverless compute
BigQuery for data warehousing and analytics
Cloud Composer for orchestration of data workflows (based on Apache Airflow)
Google Cloud Storage (GCS) for managing data at scale
IAM for access control and security
Cloud Run for containerized applications
Should have experience in the following areas :
API framework: Python FastAPI
Processing engine: Apache Spark
Messaging and streaming data processing : Kafka
Storage: MongoDB, Redis/Bigtable
Orchestration: Airflow
Perform data ingestion from various sources and apply transformation and cleansing logic to ensure high-quality data delivery.
Implement and enforce data quality checks, validation rules, and monitoring.
Collaborate with data scientists, analysts, and other engineering teams to understand data needs and deliver efficient data solutions.
Manage version control using GitHub and participate in CI/CD pipeline deployments for data projects.
Write complex SQL queries for data extraction and validation from relational databases such as SQL Server, Oracle, or PostgreSQL.
Document pipeline designs, data flow diagrams, and operational support procedures.
Designing, building, and maintaining large-scale data ingestion and ETL pipelines.
Managing Pub/Sub-based catalog feeds, Vertex AI embedding generation, BigQuery analytics workflows, and PP5 Knowledge Graph data pipelines.
Required Skills:
7–10 years of hands-on experience in Python for backend or data engineering projects.
Strong understanding and working experience with GCP cloud services (especially Dataflow, BigQuery, Cloud Functions, Cloud Composer, etc.).
Solid understanding of data pipeline architecture, data integration, and transformation techniques.
Experience in working with version control systems like GitHub and knowledge of CI/CD practices.
Experience in Apache Spark, Kafka, Redis, Fast APIs, Airflow, GCP Composer DAGs.
Strong experience in SQL with at least one enterprise database (SQL Server, Oracle, PostgreSQL, etc.).
Experience in data migrations from on-premise data sources to Cloud platforms.
Good to Have (Optional Skills):
Experience working with Snowflake cloud data platform.
Experience in deployments in GKE, Cloud Run.
Hands-on knowledge of Databricks for big data processing and analytics.
Familiarity with Azure Data Factory (ADF) and other Azure data engineering tools.
Additional Details:
Excellent problem-solving and analytical skills.
Strong communication skills and ability to collaborate in a team environment.
Education:
Bachelor's degree in Computer Science, a related field, or equivalent experience.
Diese Cookies sind für das Funktionieren der Website erforderlich und können in unseren Systemen nicht abgeschaltet werden. Sie können Ihren Browser so einstellen, dass er diese Cookies blockiert, aber dann könnten einige Teile der Website nicht funktionieren.
Sicherheit
Benutzererfahrung
Zielgruppenorientierte Cookies
Diese Cookies werden über unsere Website von unseren Werbepartnern gesetzt. Sie können von diesen Unternehmen verwendet werden, um ein Profil Ihrer Interessen zu erstellen und Ihnen an anderer Stelle relevante Werbung zu zeigen.
Google Analytics
Google Ads
Wir benutzen Cookies
🍪
Unsere Website verwendet Cookies und ähnliche Technologien, um Inhalte zu personalisieren, das Nutzererlebnis zu optimieren und Werbung zu indvidualisieren und auszuwerten. Indem Sie auf Okay klicken oder eine Option in den Cookie-Einstellungen aktivieren, stimmen Sie dem zu.
Die besten Remote-Jobs per E-Mail
Schliess dich über 5'000+ Personen an, die wöchentlich Benachrichtigungen über Remote-Jobs erhalten!