We are seeking a skilled and experienced Data Scientist with a deep understanding of Large Language Models (LLMs) to join our innovative team. This role offers the opportunity to work on complex data challenges and contribute to the development of cutting-edge models that will drive our company forward.
As a Data Scientist at SOURCIX, you will be at the forefront of leveraging LLMs and advanced data science techniques to manage and analyze large datasets of mechanical items. Your work will involve developing and implementing multi-model approaches to support various strategic initiatives within the company. This role requires a strong foundation in data science, natural language processing, and machine learning, with a focus on handling and deriving insights from complex, unstructured data.
Responsibilities:
- Data Extraction & Analysis: Develop algorithms to extract and analyze relevant information from large datasets, including unstructured data from PDF drawings and other sources.
- LLM Implementation: Utilize Large Language Models to interpret, process, and extract insights from complex data related to mechanical items.
- Model Development: Design and implement multi-model approaches to support various data-driven initiatives within the company.
- Machine Learning: Apply advanced machine learning techniques to enhance the processing, analysis, and accuracy of data models.
- Collaboration: Work closely with the software team to integrate data insights into broader company strategies and initiatives.
- Continuous Improvement: Stay informed about the latest advancements in LLMs, NLP, and machine learning to continuously innovate and improve data models and processes.
Requirements:
- Educational Background: Master’s or Ph.D. in Data Science, Computer Science, Machine Learning, Statistics, or a related field.
- Experience: 3+ years of experience as a Data Scientist, with a strong focus on LLMs, NLP, and machine learning.
- Technical Skills: Proficiency in Python, R, or other programming languages used in data science. Strong experience with TensorFlow, PyTorch, or similar ML frameworks.
- LLM Expertise: Demonstrated experience working with Large Language Models such as GPT, BERT, or similar, particularly in handling and interpreting complex datasets.
- NLP & ML: Solid understanding of Natural Language Processing techniques and machine learning algorithms.
- Data Handling: Experience in processing and analyzing large datasets, particularly those involving unstructured data like PDF files.
- Problem-Solving: Strong analytical skills with the ability to develop creative solutions to complex problems.
- Communication: Excellent verbal and written communication skills, with the ability to translate technical findings into business insights.
Advantages:
- Mechanical Industry Experience: Previous experience in the mechanical or manufacturing industry, particularly in working with mechanical parts, CAD drawings, or related data.
- Engineering Background: Educational or professional background in mechanical engineering or a closely related field.
- CAD Familiarity: Familiarity with CAD software and the ability to understand and extract relevant data from engineering drawings.
- Industry Standards Knowledge: Understanding of industry standards related to mechanical components, materials, and manufacturing processes.
- Hands-On Experience: Practical experience in handling data related to mechanical items, such as specifications, tolerances, and material properties.