Semantic Data Engineer | Digital Operations en BETA Technologies
BETA Technologies · South Burlington, Estados Unidos De América · Hybrid
- Oficina en South Burlington
How you will contribute to revolutionizing electric aviation:
- Design and build scalable data platform capabilities that empower domain teams through extensible frameworks and standards, enabling them to manage their data assets within a cohesive semantic architecture spanning the enterprise
- Implement robust data integration solutions and unified namespace patterns that create a digital thread connecting real-time and historical information across the entire product lifecycle—from design through manufacturing to field operations
- Build optimized data storage architectures and metadata management systems including data lakes, warehouses, knowledge graphs, and enterprise catalogs that make all organizational knowledge discoverable and actionable
- Develop semantic layers and ontologies that capture domain expertise and business logic, making implicit organizational knowledge explicit and machine-readable in formats that both humans and AI systems can leverage, while preserving traceability across the product lifecycle
- Create unified data access services that expose the digital thread to stakeholders across the organization, breaking down silos between engineering, manufacturing, operations, and business teams
- Implement data quality frameworks and lineage solutions that ensure accuracy across diverse data types while illuminating information flow through complex systems—essential for aerospace compliance, continuous improvement, and establishing the trusted data foundation required for advanced analytics and AI
- Partner with cross-functional teams to strengthen the digital thread through DataOps practices and champion data governance standards that keep our diverse data assets trusted, accessible, and valuable
- Actively contribute to a collaborative team environment through code reviews, knowledge sharing, and constructive feedback that continuously improves our data solutions and maintains high engineering standards
Minimum Qualifications:
- Bachelor's degree in Computer Science, Information Systems, Data Science, or related technical field (or equivalent practical experience)
- 3+ years of professional experience in data engineering or related field
- Hands-on experience with semantic technologies, knowledge graphs, or linked data systems
- Experience designing and implementing ontologies, taxonomies, or semantic data models
- Proficiency with modern data engineering tools and languages (e.g., Python, SQL)
- Experience with graph databases (e.g., Neo4j, Amazon Neptune, Stardog) and/or triple stores
- Strong understanding of data integration patterns and API design
- Excellent communication skills with ability to translate technical concepts for diverse audiences
- Experience with cloud platforms (preferably AWS)
- Experience with DataOps practices and modern data platform architectures (cloud and hybrid)
- Understanding of DevOps practices including CI/CD, version control (Git), and infrastructure as code
- Ability to work collaboratively in a fast-paced environment
- Exceptional troubleshooting skills with the ability to spot issues before they become problems
Above and Beyond Qualifications:
- Advanced degree in Computer Science or related field
- 5+ years of experience specifically working with semantic technologies in production environments
- Expert-level knowledge of semantic web standards (RDF, OWL, SPARQL, SHACL)
- Experience building and scaling knowledge graphs in enterprise settings
- Proficiency with additional programming languages (e.g., Scala, Java) and distributed computing frameworks (Spark, Flink)
- Hands-on experience with metadata management platforms and enterprise data catalogs
- Track record of implementing data lineage and data quality solutions at scale
- Experience working in regulated industries or with complex data governance frameworks
- Familiarity with aerospace/manufacturing data standards
- Knowledge of IoT protocols and time-series data management
- Publications, presentations, or open-source contributions in semantic technologies
- Familiarity with aviation regulations and standards (DO-178, DO-254, DO-330 etc.) as they relate to data systems