Firmenlogo

Kira Studio  ·  nan, · Hybrid

Jetzt bewerben

About the job

We are seeking a skilled Data/Scraping Engineer to join our data-driven team. As a key member of our organization, you will play a crucial role in collecting and processing data from diverse sources, ensuring its compatibility with our cutting-edge Large Language Models (LLMs). Your expertise in data acquisition, cleaning, and transformation will be instrumental in fueling our AI-powered solutions.

What can we expect from you?

Responsibilities

  • Data AcquisitionDesign and implement efficient data scraping pipelines to extract data from a wide range of sources, including websites, APIs, and databases.
  • Navigate complex data structures and handle various data formats, such as JSON, XML, and CSV.
  • Data Cleaning and PreprocessingDevelop robust data cleaning processes to ensure data quality, consistency, and integrity.
  • Apply advanced data preprocessing techniques to handle missing values, outliers, and inconsistencies in the collected data.
  • Data TransformationTransform and structure the cleaned data into formats compatible with our LLMs, such as text, numerical, and categorical features.
  • Optimize data representations to enhance the performance and accuracy of our AI models.
  • Data IntegrationIntegrate the processed data into our existing data pipelines and storage systems, ensuring seamless data flow and accessibility.
  • Collaborate with cross-functional teams to align data requirements and facilitate data-driven decision-making.
  • Data Monitoring and MaintenanceContinuously monitor the data scraping pipelines to ensure data freshness, reliability, and scalability.
  • Proactively identify and resolve data-related issues, such as data drift, inconsistencies, and performance bottlenecks.

Qualifications

  • Educational BackgroundA bachelor's degree in Computer Science, Data Science, or a related field. Advanced degrees are a plus.
  • Data Scraping ExpertiseExtensive experience in web scraping techniques, including using libraries like BeautifulSoup, Scrapy, and Selenium.
  • Proficiency in handling dynamic web pages, authentication, and anti-scraping mechanisms.
  • Data Preprocessing SkillsStrong knowledge of data cleaning, normalization, and feature engineering techniques.
  • Familiarity with data preprocessing libraries like Pandas, NumPy, and scikit-learn.
  • Programming ProficiencyExcellent programming skills in Python, with experience in data manipulation and analysis.
  • Familiarity with SQL and NoSQL databases for data storage and retrieval.
  • Problem-Solving AbilitiesStrong analytical and problem-solving skills, with the ability to tackle complex data challenges.
  • Attention to detail and a meticulous approach to ensuring data quality and integrity.
  • Communication and CollaborationExcellent communication skills to effectively collaborate with cross-functional teams and stakeholders.
  • Ability to translate technical concepts and data requirements to non-technical audiences.

What can you expect from us?

  • Cutting-Edge TechnologiesWork with state-of-the-art LLMs and AI technologies to drive innovation and solve complex problems.
  • Data-Driven CultureBe part of a data-driven organization that values evidence-based decision-making and continuous improvement.
  • Collaborative EnvironmentCollaborate with a diverse team of experts, including data scientists, AI researchers, and domain specialists.
  • Professional GrowthEnjoy opportunities for learning and professional development through training programs, conferences, and mentorship.
  • Impactful WorkContribute to projects that have a significant impact on our organization's success and drive advancements in AI and data-driven solutions.
  • Competitive CompensationReceive a competitive salary and comprehensive benefits package, reflecting the value you bring to our team.

If you are passionate about data, have a strong background in data scraping and preprocessing, and are excited about enabling cutting-edge AI solutions, we would love to have you on our team. Join us in our mission to harness the power of data and drive innovation forward. Apply now and embark on a rewarding career as a Data/Scraping Engineer!

Powered by Webbtree
Jetzt bewerben

Weitere Jobs