Senior Python Developer (web)

legacy systems software
Deadline 25.07.2024

We are looking for a skilled Web Scrapers Developer to join our team. You will be responsible for designing and implementing web scraping solutions to collect data from various websites. This role requires a strong understanding of web technologies, data extraction, and manipulation techniques. The candidate should be proficient in Python and have experience with popular web scraping frameworks and tools.

Apply Now!

Your Responsibilities Will Include:

  • Design and implement web scraping solutions to collect data from diverse sources.
  • Develop and maintain web scrapers using Python and relevant libraries.
  • Handle large volumes of data and ensure data accuracy and integrity.
  • Utilize proxy services and headless browsers to bypass IP blocks and CAPTCHAs.
  • Work with databases to store and manage scraped data.
  • Ensure compliance with website terms of service and legal guidelines.
  • Collaborate with the data engineering team to integrate scraped data into the data pipeline.

Our Technical Requirements:

* Programming Language:

  • Proficiency in Python.

* Libraries and Frameworks:

  • Experience with BeautifulSoup for HTML and XML parsing.
  • Experience with Scrapy for complex and large-scale scraping projects.
  • Proficiency in Selenium for browser automation and interaction with JavaScript-heavy pages.
  • Experience with Requests for sending HTTP requests.

* Databases:

  • Knowledge of SQLite for small projects.
  • Experience with PostgreSQL for larger projects.
  • Familiarity with MongoDB for storing unstructured data.

* Other Tools and Technologies:

  • Experience with Docker for containerizing applications.
  • Understanding of proxy services to bypass IP blocks and restrictions.
  • Familiarity with Headless Chrome for automating interactions with websites without a graphical interface.

* CAPTCHA Bypassing:

  • Experience with 2Captcha or Anti-Captcha services for solving CAPTCHAs.
  • Familiarity with Puppeteer for automating CAPTCHA interactions.
  • Knowledge of Tesseract OCR for recognizing text-based CAPTCHAs.

* Additional Tools:

  • Proficiency in using Regular Expressions (Regex) for data extraction from text.

For an efficient and comfortable work you should have this experience:

  • Proven experience in developing web scrapers and data extraction solutions.
  • Strong problem-solving skills and the ability to handle complex scraping challenges.
  • Experience with handling large datasets and ensuring data accuracy.
  • Knowledge of web technologies, HTTP protocols, and web scraping best practices.
  • Familiarity with legal and ethical considerations in web scraping.

Educaton:

*A degree in Computer Science, Information Technology, or a related field is preferred, but not mandatory if the candidate has relevant experience and skills.

Helmes offers

  • Competitive salary with incentives.
  • Professional growth opportunities.
  • Dynamic and collaborative work environment.
  • Flexible working hours and remote work option.
  • Learning compensations.
  • Legalisation assistance.
Deadline 25.07.2024

Join Us!

Want To Apply!