We are seeking a talented Research Scientist to contribute to our pioneering research initiatives in Advanced Architectures. As a critical team member, you will participate in research activities within the realm of Transformers, Recurrent Models, and Large Language Models (LLM).
* You will develop innovative modifications using TensorFlow, PyTorch, and Keras.
* Rethink and rebuild the core elements of models to enhance performance and efficiency.
* Integrate state-of-the-art techniques such as attention mechanisms, sequence modeling, and neural network optimization.
Your expertise in deep learning is highly valued. You should hold a Master's degree in Computer Science, Data Science, or a related field, demonstrating a strong foundation in technical knowledge.
1. Proficient in Python programming, showcasing expertise in developing and modifying module classes in TensorFlow and Keras.
2. Able to comprehend complex problems and provide efficient solutions.
3. Familiarity with state-of-the-art techniques in deep learning.
4. Knowledge of Large Language Model (LLM) architectures, such as Generative Pre-trained Transformer models, is desirable.
5. Willingsness to learn and work with advanced machine learning development stacks.
6. Strong work ethic, being diligent, responsible, structured, goal-oriented, and independent.
7. Excellent communication and interpersonal skills.
8. Fluency in English.
9. German proficiency is a plus.