Data scrapping engineer

  • Webbtree.com

Job description:

We are seeking a skilled Data/Scraping Engineer to join our data-driven team. As a key member of our organization, you will play a crucial role in collecting and processing data from diverse sources, ensuring its compatibility with our cutting-edge Large Language Models (LLMs). Your expertise in data acquisition, cleaning, and transformation will be instrumental in fueling our AI-powered solutions.What can we expect from you?ResponsibilitiesData AcquisitionDesign and implement efficient data scraping pipelines to extract data from a wide range of sources, including websites, APIs, and databases.Navigate complex data structures and handle various data formats, such as JSON, XML, and CSV.Data Cleaning and PreprocessingDevelop robust data cleaning processes to ensure data quality, consistency, and integrity.Apply advanced data preprocessing techniques to handle missing values, outliers, and inconsistencies in the collected data.Data TransformationTransform and structure the cleaned data into formats compatible with our LLMs, such as text, numerical, and categorical features.Optimize data representations to enhance the performance and accuracy of our AI models.Data IntegrationIntegrate the processed data into our existing data pipelines and storage systems, ensuring seamless data flow and accessibility.Collaborate with cross-functional teams to align data requirements and facilitate data-driven decision-making.Data Monitoring and MaintenanceContinuously monitor the data scraping pipelines to ensure data freshness, reliability, and scalability.Proactively identify and resolve data-related issues, such as data drift, inconsistencies, and performance bottlenecks.QualificationsEducational BackgroundA bachelor s degree in Computer Science, Data Science, or a related field. Advanced degrees are a plus.Data Scraping ExpertiseExtensive experience in web scraping techniques, including using libraries like BeautifulSoup, Scrapy, and Selenium.Proficiency in handling dynamic web pages, authentication, and anti-scraping mechanisms.Data Preprocessing SkillsStrong knowledge of data cleaning, normalization, and feature engineering techniques.Familiarity with data preprocessing libraries like Pandas, NumPy, and scikit-learn.Programming ProficiencyExcellent programming skills in Python, with experience in data manipulation and analysis.Familiarity with SQL and NoSQL databases for data storage and retrieval.Problem-Solving AbilitiesStrong analytical and problem-solving skills, with the ability to tackle complex data challenges.Attention to detail and a meticulous approach to ensuring data quality and integrity.Communication and CollaborationExcellent communication skills to effectively collaborate with cross-functional teams and stakeholders.Ability to translate technical concepts and data requirements to non-technical audiences.What can you expect from us?Cutting-Edge TechnologiesWork with state-of-the-art LLMs and AI technologies to drive innovation and solve complex problems.Data-Driven CultureBe part of a data-driven organization that values evidence-based decision-making and continuous improvement.Collaborative EnvironmentCollaborate with a diverse team of experts, including data scientists, AI researchers, and domain specialists.Professional GrowthEnjoy opportunities for learning and professional development through training programs, conferences, and mentorship.Impactful WorkContribute to projects that have a significant impact on our organization s success and drive advancements in AI and data-driven solutions.Competitive CompensationReceive a competitive salary and comprehensive benefits package, reflecting the value you bring to our team.If you are passionate about data, have a strong background in data scraping and preprocessing, and are excited about enabling cutting-edge AI solutions, we would love to have you on our team. Join us in our mission to harness the power of data and drive innovation forward. Apply now and embark on a rewarding career as a Data/Scraping Engineer! Powered by Webbtree
Advertisement
Apply for this job

Related jobs

Senior engineering manager email and collaboration security products новая

Unspecified GBP Whitecrow Bengaluru

About our client:Since 2003, our client has stopped bad things from happening to good organizations by enabling them to work protected. They empower more than 40,000 customers to help mitigate risk and manage complexitie

Senior engineering manager threat protection новая

Unspecified GBP Whitecrow Bengaluru

About our client:Since 2003, our client has stopped bad things from happening to good organizations by enabling them to work protected. They empower more than 40,000 customers to help mitigate risk and manage complexitie

Engineering manager identity platform team новая

Unspecified GBP Whitecrow Bengaluru

About our client:Since 2003, our client has stopped bad things from happening to good organizations by enabling them to work protected. They empower more than 40,000 customers to help mitigate risk and manage complexitie

Senior manager quality engineering and devops новая

Unspecified GBP Whitecrow Bengaluru

About our client:Since 2003, our client has stopped bad things from happening to good organizations by enabling them to work protected. They empower more than 40,000 customers to help mitigate risk and manage complexitie

Principal front end software engineer front end ui platform новая

Unspecified GBP Whitecrow Bengaluru

About our client:Since 2003, our client has stopped bad things from happening to good organizations by enabling them to work protected. They empower more than 40,000 customers to help mitigate risk and manage complexitie

Senior engineer general civil m and t новая

Unspecified GBP N A

Requisition ID:276800 : Relocation Authorized:National : Family : Telework Type:Part:Time Telework : Work Location:Various Permanent Bechtel Office Locations Extraordinary teams building inspiring projects: Since 1898, w

Senior cost engineer новая

Unspecified GBP N A

Requisition ID:270520 : Relocation Authorized:National : Family : Telework Type:Full:Time Office/Project : Work Location:Various Permanent Bechtel Office Locations Company Overview: Since 1898, we have helped customers c

Senior engineer cyber security it новая

Unspecified GBP

Additional Location(s):India:New Delhi Diversity : Innovation : Caring : Global Collaboration : Winning Spirit: High Performance At Boston Scientific, we ll give you the opportunity to harness all that s within you by wo

Principal engineer sustaining r and d новая

Unspecified GBP

Additional Locations:India:Maharashtra, Pune Diversity : Innovation : Caring : Global Collaboration : Winning Spirit : High Performance At Boston Scientific, we ll give you the opportunity to harness all that s within yo

Senior engineer control systems новая

Unspecified GBP N A

Requisition ID:275075 : Relocation Authorized:None : Telework Type:Full:Time Office/Project : Work Location:Various Permanent Bechtel Office Locations Company Overview: Since 1898, we have helped customers complete more