Cellares is seeking an innovative and highly motivated Senior Data Quality Engineer who will contribute to the development of our advanced cell therapy manufacturing platform.
The primary focus of this position is to ensure the accuracy, reliability, and integrity of data within our data platform. The individual will participate on a cross-functional team, design, build, and maintain automated testing frameworks to ensure data integrity at every stage of our data pipelines. The successful candidate should have extensive experience in quality assurance for data platforms, ideally with significant hands-on experience in the Databricks environment. They should be detail-oriented and possess strong analytical and problem-solving skills.
Candidates should enjoy working in a fast-paced, mission-driven environment, and be prepared to tackle a broad selection of challenges as the company grows. Candidates should be great team players with the ability to work with minimal supervision.
Cellares is seeking an innovative and highly motivated Senior Data Quality Engineer who will contribute to the development of our advanced cell therapy manufacturing platform.The primary focus of this position is to ensure the accuracy, reliability, and integrity of data within our data platform. The individual will participate on a cross-functional team, design, build, and maintain automated testing frameworks to ensure data integrity at every stage of our data pipelines. The successful candidate should have extensive experience in quality assurance for data platforms, ideally with significant hands-on experience in the Databricks environment. They should be detail-oriented and possess strong analytical and problem-solving skills.Candidates should enjoy working in a fast-paced, mission-driven environment, and be prepared to tackle a broad selection of challenges as the company grows. Candidates should be great team players with the ability to work with minimal supervision.
Cellares total compensation package contains competitive base salaries, highly subsidized Medical, Dental, and Vision Plans, 401(k) Matching, Free EV Charging, Onsite lunches, and Stock options. All displayed pay ranges are approximate, negotiable, and location dependent.
Responsibilities
Build and maintain automated data validation tests using Databricks notebooks and tools like Pytest
Test data ingestion, transformation, and loading processes within the Databricks Lakehouse, specifically focusing on the Bronze, Silver, and Gold layers of the Medallion architecture
Implement tests for data accuracy, completeness, consistency, timeliness, and uniqueness at different points in the pipeline to catch data issues early
Reconcile data by comparing record counts, schemas, and values between source systems and target tables in Databricks
Implement automated data quality checks within data pipelines to ensure no data regressions occur with new code deployments
Implement automated monitoring and alerting for data quality metrics, identifying anomalies in data freshness, schema evolution, and volume
Work closely with data engineers and product owners to understand data requirements and ensure data quality meets business needs
Ensure compliance with data governance policies by building quality checks that validate data sensitivity, masking, and lineage, leveraging tools like Unity Catalog
Communicate project status and new discoveries in a clear and timely manner during daily stand-ups
Requirements
Bachelor’s or Master’s in Computer Science, Electrical Engineering, or related field and 5+ years of relevant experience
Experience with data pipeline and data quality testing strategy and execution, with significant hands-on experience in the Databricks environment
Strong proficiency in Python for developing and executing data validation scripts
In-depth knowledge of Databricks, Delta Lake, and the Lakehouse architecture. Proficiency in writing complex SQL queries for data validation, reconciliation, and troubleshooting issues
Solid understanding of data warehousing concepts, including dimensional modeling (star/snowflake schemas)
Hands-on experience with Azure, including Azure storage and data services that integrate with Databricks
Ability to process data, interpret testing results and provide feedback to the team
Desire to be part of a rapidly evolving organization, with compelling technology, and taking products and processes to the next level
Self-awareness, integrity, authenticity, and a growth/entrepreneurial mindset
This is Cellares
Cellares is the first Integrated Development and Manufacturing Organization (IDMO) and takes an Industry 4.0 approach to mass manufacturing the living drugs of the 21st century. The company is both developing and operating integrated technologies for cell therapy manufacturing to accelerate access to life-saving cell therapies. The company’s Cell Shuttle integrates all the technologies required for the entire manufacturing process in a flexible and high-throughput platform that delivers true walk-away, end-to-end automation. Cell Shuttles will be deployed in Cellares’ Smart Factories around the world to meet total patient demand for cell therapies at global scale. Partnering with Cellares enables academics, biotechs, and pharma companies to accelerate drug development and scale out manufacturing, lower process failure rates, lower manufacturing costs, and meet global patient demand.
The company is headquartered in South San Francisco, California with its commercial-scale IDMO Smart Factory in Bridgewater, New Jersey. The company is backed by world-class investors and has raised over $355 million in financing.
Leveling will be based on overall experience, education, and demonstration of knowledge throughout the interview process.
Diese Cookies sind für das Funktionieren der Website erforderlich und können in unseren Systemen nicht abgeschaltet werden. Sie können Ihren Browser so einstellen, dass er diese Cookies blockiert, aber dann könnten einige Teile der Website nicht funktionieren.
Sicherheit
Benutzererfahrung
Zielgruppenorientierte Cookies
Diese Cookies werden über unsere Website von unseren Werbepartnern gesetzt. Sie können von diesen Unternehmen verwendet werden, um ein Profil Ihrer Interessen zu erstellen und Ihnen an anderer Stelle relevante Werbung zu zeigen.
Google Analytics
Google Ads
Wir benutzen Cookies
🍪
Unsere Website verwendet Cookies und ähnliche Technologien, um Inhalte zu personalisieren, das Nutzererlebnis zu optimieren und Werbung zu indvidualisieren und auszuwerten. Indem Sie auf Okay klicken oder eine Option in den Cookie-Einstellungen aktivieren, stimmen Sie dem zu.
Die besten Remote-Jobs per E-Mail
Schliess dich über 5'000+ Personen an, die wöchentlich Benachrichtigungen über Remote-Jobs erhalten!