Data scrapping engineer

  • Webbtree.com

Job description:

We are seeking a skilled Data/Scraping Engineer to join our data-driven team. As a key member of our organization, you will play a crucial role in collecting and processing data from diverse sources, ensuring its compatibility with our cutting-edge Large Language Models (LLMs). Your expertise in data acquisition, cleaning, and transformation will be instrumental in fueling our AI-powered solutions.What can we expect from you?ResponsibilitiesData AcquisitionDesign and implement efficient data scraping pipelines to extract data from a wide range of sources, including websites, APIs, and databases.Navigate complex data structures and handle various data formats, such as JSON, XML, and CSV.Data Cleaning and PreprocessingDevelop robust data cleaning processes to ensure data quality, consistency, and integrity.Apply advanced data preprocessing techniques to handle missing values, outliers, and inconsistencies in the collected data.Data TransformationTransform and structure the cleaned data into formats compatible with our LLMs, such as text, numerical, and categorical features.Optimize data representations to enhance the performance and accuracy of our AI models.Data IntegrationIntegrate the processed data into our existing data pipelines and storage systems, ensuring seamless data flow and accessibility.Collaborate with cross-functional teams to align data requirements and facilitate data-driven decision-making.Data Monitoring and MaintenanceContinuously monitor the data scraping pipelines to ensure data freshness, reliability, and scalability.Proactively identify and resolve data-related issues, such as data drift, inconsistencies, and performance bottlenecks.QualificationsEducational BackgroundA bachelor s degree in Computer Science, Data Science, or a related field. Advanced degrees are a plus.Data Scraping ExpertiseExtensive experience in web scraping techniques, including using libraries like BeautifulSoup, Scrapy, and Selenium.Proficiency in handling dynamic web pages, authentication, and anti-scraping mechanisms.Data Preprocessing SkillsStrong knowledge of data cleaning, normalization, and feature engineering techniques.Familiarity with data preprocessing libraries like Pandas, NumPy, and scikit-learn.Programming ProficiencyExcellent programming skills in Python, with experience in data manipulation and analysis.Familiarity with SQL and NoSQL databases for data storage and retrieval.Problem-Solving AbilitiesStrong analytical and problem-solving skills, with the ability to tackle complex data challenges.Attention to detail and a meticulous approach to ensuring data quality and integrity.Communication and CollaborationExcellent communication skills to effectively collaborate with cross-functional teams and stakeholders.Ability to translate technical concepts and data requirements to non-technical audiences.What can you expect from us?Cutting-Edge TechnologiesWork with state-of-the-art LLMs and AI technologies to drive innovation and solve complex problems.Data-Driven CultureBe part of a data-driven organization that values evidence-based decision-making and continuous improvement.Collaborative EnvironmentCollaborate with a diverse team of experts, including data scientists, AI researchers, and domain specialists.Professional GrowthEnjoy opportunities for learning and professional development through training programs, conferences, and mentorship.Impactful WorkContribute to projects that have a significant impact on our organization s success and drive advancements in AI and data-driven solutions.Competitive CompensationReceive a competitive salary and comprehensive benefits package, reflecting the value you bring to our team.If you are passionate about data, have a strong background in data scraping and preprocessing, and are excited about enabling cutting-edge AI solutions, we would love to have you on our team. Join us in our mission to harness the power of data and drive innovation forward. Apply now and embark on a rewarding career as a Data/Scraping Engineer! Powered by Webbtree
Advertisement
Apply for this job

Related jobs

Data services engineer hana dba ariba

Unspecified GBP Bangalore

We help the world run better Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a wo

Data quality engineer

Unspecified GBP Bangalore

Summary We are looking for a Data Quality Engineer to join our Data and Analytics team to develop and automate solutions for operational efficiencies and improved reliability of our Cloud Data Analytics Platform. As we e

Lead data engineer новая

Unspecified GBP Bangalore

Department :Clinical Data Operations and Insights Location: Bangalore Are you passionate about transforming the clinical data flow and provisioning across the clinical data and technology value chain? Do you have experie

Area field engineer dmd новая

Unspecified GBP Dahej

Requisition ID:276337 : Relocation Authorized:International : Camp : Telework Type:Full:Time Office/Project : Work Location:Dahej Extraordinary teams building inspiring projects: Since 1898, we have helped customers comp

Field engineer mechanical dmd новая

Unspecified GBP Dahej

Requisition ID:276347 : Relocation Authorized:International : Camp : Telework Type:Full:Time Office/Project : Work Location:Dahej Extraordinary teams building inspiring projects: Since 1898, we have helped customers comp

Lead discipline field engineer civil dmd новая

Unspecified GBP Dahej

Requisition ID:276338 : Relocation Authorized:International : Camp : Telework Type:Full:Time Office/Project : Work Location:Dahej Company Overview: Since 1898, we have helped customers complete more than 25,000 projects

Field engineer civil dmd новая

Unspecified GBP Dahej

Requisition ID:276343 : Relocation Authorized:National : Camp : Telework Type:Full:Time Office/Project : Work Location:Dahej Extraordinary teams building inspiring projects: Since 1898, we have helped customers complete

Field engineer structural steel dmd новая

Unspecified GBP Dahej

Requisition ID:276345 : Relocation Authorized:National : Camp : Telework Type:Full:Time Office/Project : Work Location:Dahej Extraordinary teams building inspiring projects: Since 1898, we have helped customers complete

Area field engineer ca dmd новая

Unspecified GBP Dahej

Requisition ID:277007 : Relocation Authorized:National/International : Single : Telework Type:Full:Time Office/Project : Work Location:Dahej Extraordinary teams building inspiring projects: Since 1898, we have helped cus

Engineering manager 8+ years for a world s largest media tech company новая

Unspecified GBP Zyoin Bangalore

We are looking for a Engineering Manager for one of our esteemed Clients for Bangalore ,India Location. RESPONSIBILITIES: Handling and managing a team of software engineers and technical leads from various disciplines su