Data Scientist LLM (m/f/d)

Posted 4 days 10 hours ago by Michael Bailey Associates

Contract
Not Specified
Other
Nordrhein-Westfalen, Düsseldorf, Germany, 40210
Job Description

We are currently looking for a Data Scientist (m/f/d) LLM

Start: 01.07.25
End: 31.12.25
Volume: fulltime
Location: remote

Project description: To develop an AI-powered application that automates the analysis, classification, and semantic processing of annual combined corporate reports, with a focus on compliance with regulatory standards such as IFRS and ESRS.

Task (performed independently):

- Building of a scraping and text preparation module for extracting content from combined reports by taking into consideration information and requirements provided by the client in advance based on own knowledge and experience
- Implementation of a search indexing and retrieval system for efficient information access
- Classification of text segments according to regulatory frameworks (IFRS, ESRS)
- Development of a matching engine to link regulatory descriptions to actual document content
- Development of a text consistency checking algorithm
- Integration of Named Entity Recognition (NER) with a labelling interface for additions
- Analysis of evaluation methods for the rewriting and rewording capabilities of large language models (LLMs) for editing the documents
- Creation of a user-facing interface for the above tasks
- Documentation of the results and presentation (online meeting) to client for a sign-off and handover
- Technical consultation of end users with respect to the developed tool and methods based on own expertise
- Client provides all necessary information, access to the systems and requirements in advance.

Skills:
Must haves:
Experience in quantitative text analysis
Proficiency in Python programming and relevant NLP libraries (eg, spaCy, NLTK, sbert)
Experience in developing and deploying Python code in MS Azure or Snowflake environment
English (spoken and written).

Nice haves:
Experience in programming with LLM inference APIs
Background in software development or corporate finance
Knowledge of Microsoft Azure services (AI Foundry, AI Search, Batch, Blob storage, Function, ML Service, Key Vaults, etc.);
Knowledge of Microsoft Office backends such as SharePoint and Outlook APIs
Proficiency in German.

We are looking forward to hearing from you. Please apply with your most recent CV