Escritório à distância e em casa - Empregos em Reino Unido

Remote Web Scraping Engineer

Legalist · United Kingdom · Remote

2024-11-04 13:00:00.0

Python HTML/CSS SQL Docker Container Platforms Kubernetes Container Platforms AWS DevOps Automation Tools RabbitMQ Cloud

Hybrid Volunteer Youth Mentor. (online) | Creative Active Lives CIC Volunteer Youth Mentor. (online) | Creative Active Lives CIC

Reach Volunteering · Hybrid

2024-11-04 13:00:00.0

Product Owner

Hybrid Nuclear Physics Expertise Sought for AI Training Nuclear Physics Expertise Sought for AI Training

Outlier · Hybrid

2024-11-04 13:00:00.0

AWS

Homeoffice Cryptoeconomics Researcher

Wormhole Foundation · United Kingdom · Remote

2024-11-04 13:00:00.0

Python SQL Microsoft Excel Blockchain

Hybrid Philosophy Expertise Sought for AI Training Philosophy Expertise Sought for AI Training

Outlier · Hybrid

2024-11-04 13:00:00.0

AWS

Hybrid Remote AI Writing Editor (Tier 1) Remote AI Writing Editor (Tier 1)

Outlier · Hybrid

2024-11-04 13:00:00.0

AWS

Remote Software Engineer with verification

Centrica · Remote

2024-11-04 13:00:00.0

Java Python TypeScript SQL NoSQL Kubernetes Container Platforms AWS Microsoft Excel Backend Software Engineer Product Owner

Remote Software Engineer (Mid level)

Flock · United Kingdom · Remote

2024-11-04 13:00:00.0

MySQL TypeScript SQL AWS Software Engineer

Remote Full Stack Product Engineer

zeroheight · United Kingdom · Remote

2024-11-04 13:00:00.0

JavaScript Ruby Slack Communication and Collaboration AWS Microsoft Excel Backend Software Engineer Cloud

Homeoffice Senior Backend Engineer

Happening · United Kingdom · Remote

2024-11-04 13:00:00.0

AWS Cloud

Homeoffice Senior Software Engineer (Product)

Phaidra · Remote

2024-11-04 13:00:00.0

Python JavaScript C# TypeScript SQL Docker Container Platforms Kubernetes Container Platforms Slack Communication and Collaboration DevOps Automation Tools Microsoft Excel Frontend Backend Software Engineer Cloud

Homeoffice Technical Product Manager

Ably · United Kingdom · Remote

2024-11-04 13:00:00.0

Swift DevOps Automation Tools Software Engineer Cloud

Homeoffice Senior Python Software Engineer - Web3, DeFi

Clearmatics · United Kingdom · Remote

2024-11-04 13:00:00.0

Python C++ SQL Microsoft Excel Backend Software Engineer Blockchain Solidity Web3

Remote Product Manager

Zencargo · United Kingdom · Remote

2024-11-04 13:00:00.0

Microsoft Excel Software Engineer

Homeoffice Analytics Engineer

Prolific · Remote

2024-11-04 13:00:00.0

SQL AWS Cloud

Remote Senior Software Engineer

Prolific · Remote

2024-11-04 13:00:00.0

Python JavaScript TypeScript SQL NoSQL AWS Vue.js Backend Software Engineer Cloud

Remote Senior Backend Engineer (AWS) with verification

Lumenalta · United Kingdom · Remote

2024-11-04 13:00:00.0

Python Kubernetes Container Platforms AWS DevOps Automation Tools Microsoft Excel Frontend Backend Cloud

Homeoffice Senior Product Manager

Tracsis PLC · United Kingdom · Remote

2024-11-04 13:00:00.0

Homeoffice Software Engineer

Bugcrowd · United Kingdom · Remote

2024-11-04 13:00:00.0

Python Swift AWS Software Engineer

Hybrid Volunteer Social Media Marketer | OneWig OneSmile Volunteer Social Media Marketer | OneWig OneSmile

Reach Volunteering · Hybrid

2024-11-04 13:00:00.0

Anterior Próximo

Remote Web Scraping Engineer

Legalist · United Kingdom · Remote

2024-11-04 13:00:00.0

About the job

Intro description:Legalist is an institutional alternative asset management firm. Founded in 2016 and incubated at Y Combinator, the firm uses data-driven technology to invest in credit assets at scale. We are always looking for talented people to join our team.Where You Come In:

Help to design and implement the architecture of a large-scale crawling system

Design, implement, and maintain various components of our data acquisition infrastructure (building new crawlers, maintain existing crawlers, data cleaners & loaders)

Work on developing tools to facilitate the scraping at scale, monitor the health of crawlers and ensure data quality of the scraped items.

Collaborate with our product and business teams to understand / anticipate requirements to strive for greater functionality and impact in our data gathering systems

What you'll be bringing to the team:

3+ Years experience with Python for data wrangling and cleaning

2+ Years experience with data crawling & scraping at scale (100+ spiders at least)

Productionized experience with Scrapy is mandatory. Distributed crawling and advanced scrapy experience are a plus.

Familiarity with scraping libraries and monitoring tools highly recommended (BeautifulSoup, Xpaths, Selenium, Puppeteer, Splash)

Familiarity with data pipelining to integrate scraped items into existing data pipelines.

Experience extracting data from multiple disparate sources including HTML, XML, REST, GraphQL, PDF, and spreadsheets.

Experience running, monitoring and maintaining a large set of broad crawlers (100+ spiders)

Sound Knowledge in bypassing Bot Detection Techniques

Experience using techniques to protect web scrapers against site ban, IP leak, browser crash, CAPTCHA and proxy failure.

Experience with cloud environments like GCP, AWS, as well as containerization tools like Docker and orchestration such as kubernetes or others.

Ability to maintain all aspects of a scraping pipeline end to end (building and maintaining spiers, avoiding bot prevention techniques, data cleaning and pipelining, monitoring spider health and performance).

OOP, SQL and Django ORM basics

Even better if you have, but not necessary:

Experience with microservices architecture would be a plus.

Familiarity with message brokers such as Kafka, RabbitMQ, etc

Experience with DevOps

Expertise in data warehouse maintenance, specifically with Google BigQuery (ETLs, data sourcing, modeling, cleansing, documentation, and maintenance)

Familiarity with job scheduling & orchestration frameworks - e.g. Jenkins, Dagster, Prefect

Escritório à distância e em casa - Empregos em Reino Unido

Remote Web Scraping Engineer

Hybrid Volunteer Youth Mentor. (online) | Creative Active Lives CIC Volunteer Youth Mentor. (online) | Creative Active Lives CIC

Hybrid Nuclear Physics Expertise Sought for AI Training Nuclear Physics Expertise Sought for AI Training

Homeoffice Cryptoeconomics Researcher

Hybrid Philosophy Expertise Sought for AI Training Philosophy Expertise Sought for AI Training

Hybrid Remote AI Writing Editor (Tier 1) Remote AI Writing Editor (Tier 1)

Remote Software Engineer with verification

Remote Software Engineer (Mid level)

Remote Full Stack Product Engineer

Homeoffice Senior Backend Engineer

Homeoffice Senior Software Engineer (Product)

Homeoffice Technical Product Manager

Homeoffice Senior Python Software Engineer - Web3, DeFi

Remote Product Manager

Homeoffice Analytics Engineer

Remote Senior Software Engineer

Remote Senior Backend Engineer (AWS) with verification

Homeoffice Senior Product Manager

Homeoffice Software Engineer

Hybrid Volunteer Social Media Marketer | OneWig OneSmile Volunteer Social Media Marketer | OneWig OneSmile

Ainda não foi selecionado nenhum emprego remoto

Remote Web Scraping Engineer

Remote Web Scraping Engineer

About the job

Benefícios adicionais

Dados de contacto

Telefone

Preferências

Experiência de trabalho

Educação

Competências

Criar perfil de aplicação

Só para candidatos

Benefícios adicionais

Definições de cookies

Definições de cookies

Cookies orientados para o grupo-alvo

Utilizamos cookies

Escritório à distância e em casa - Empregos em Reino Unido

Ainda não foi selecionado nenhum emprego remoto

Remote Web Scraping Engineer

Remote Web Scraping Engineer

About the job

Benefícios adicionais

Dados de contacto

Telefone

Preferências

Experiência de trabalho

Educação

Competências

Criar perfil de aplicação

Registar como candidato

Criar uma conta para apresentar o seu perfil às

Só para candidatos

Procurar emprego

Benefícios adicionais

Os mais recentes empregos de escritório em casa Semanalmente por correio eletrónico.

Definições de cookies

Definições de cookies

Cookies orientados para o grupo-alvo

Utilizamos cookies

Os mais recentes empregos de escritório em casa
Semanalmente por correio eletrónico.