Passende Schulungen
finden Sie hier:
↪ Microsoft Azure
↪ Microsoft Certified: Azure Enterprise Data Analyst Associate
↪ Data Mining - Machine Learning
↪ AWS Certified Machine Learning - Specialty (ACMLS)
↪ Digitalisierung
↪ Microsoft Dynamics 365 Finance & Supply Chain Management
↪ SQL Server 2017
↪ SQL Server 2016
↪ Alibaba Cloud
↪ Microsoft Office 2016
↪ Microsoft Exchange Server 2019
↪ Azure HDInsight
↪ Agilität
↪ Azure Cosmos DB
↪ Python
↪ Git
↪ PostgreSQL
↪ SQL Server
↪ Docker
↪ NoSQL
↪ Cloudera
↪ Microsoft Certified: Azure Data Scientist Associate
↪ AWS Certified Cloud Practitioner (ACCP)
↪ Cloud
↪ Kubernetes
↪ Python Zertifizierung
↪ Kubernetes Zertifizierungen
↪ WatchGuard
↪ FileMaker
↪ Künstliche Intelligenz
↪ Puppet
↪ AWS
↪ Google Workspace (G Suite)
↪ Microsoft Exchange Online
↪ SharePoint Online

Architect (H/(U/M/A/N)* Data Engineer Azure Databricks Kubernetes Docker #9808


1 keine Angabe 22 Wochen
Registrieren Sie sich jetzt kostenlos
um Ihre Anfrage versenden zu
können!

Oder loggen Sie sich ein!
Anfrage senden

Dieses Projekt wurde bereits erfolgreich vermittelt!
Projektdetails
Projektkategorien: Cloud
Projektbeginn: 01.08.2023
Abrechnung nach: nach Vereinbarung
Nebenkosten abrechenbar: Nein
Projektvolumen: keine Angabe

Projektbeschreibung
  • Good communicator, but great at independent work. Solution oriented. Analytical thinking.
  • Experience in machine learning engineering, exploratory data analysis, and software development and writing of ETL-Pipelines. Optimally, candidate should have a degree in mathematics, physics, computer sciences or in a related field.
  • Experience in Python programming is mandatory, especially with PySpark, whereas XGBoost, Seaborn, Matplotlib and dbutils (Databricks) are nice-to-have.
  • Expertise in Git, GitLab, and CI/CD are beneficial (including Azure CLI and Azure-Cloud specific APIs).
  • Experience in working with Azure, Databricks, Kubernetes and Docker.
  • Candidate should be familiar or inclined to working in an agile environment. Prior experience with predictive maintenance tasks is a plus.

Ihre Aufgaben

Our customer has five different machine learning-based solutions that identify weak points in the medium voltage grid. While targeting the same purpose, they differ in their data source systems, ML features, ML algorithms and especially the Distribution System Operator (DSO) for which they have been developed. Currently, none of these five solutions is easily applicable to a different target DSO or implemented in a scalable way. A newly developed data platform (iPEN) that collects, prepares, and provides data from all DSOs now allows to develop a new solution that consumes all needed data from a single source system. Based on unified data preparation, feature extraction, and machine learning steps, this new solution should be applicable to all customers DSOs. Particular attention will be paid to the scalability, stability, and maintainability of the resulting software. Data Science tasks related to projects, e.g. Predictive Maintenance Solutions.
  • Data Engineering tasks related to projects, e.g. Predictive Maintenance Solutions
  • Advice and designing business-critical data engineering use cases, from the business problem to delivery and operation.
  • Programming of data ingestion pipelines from various sources, e.g Graph Database or Data Lakehouse.
  • Writing of production feature engineering code in Python and PySpark on a Databricks Tech-Stack.
  • Build and maintain data pipelines for static, mixed, and time-series data.
  • Design, implementation and maintenance of data infrastructure for ML algorithms in Azure Databricks.
  • Responsible for the design and implementation of the CI/CD pipelines.
  • Data modeling and architecture (schemes, sources, optimizations).

Kontakt

Cegeka Deutschland GmbH
Andreas Baxmann
Im Mediapark 4d
50670 Köln
Tel. +49 221 160 20 15
Fax +49 221 160 20 13
andreas.baxmann@cegeka.com

Sprachkenntnisse
Deutsch
Experte
Kenntnisse & Fähigkeiten
Kubernetes
Grundkentnisse
Agilität
Grundkentnisse
Cloud Computing
Grundkentnisse
Cloud-Entwicklung
Grundkentnisse
COM/DCOM
Grundkentnisse
Docker
Grundkentnisse
ETL
Grundkentnisse
git
Grundkentnisse
Machine Learning
Grundkentnisse
Microsoft Azure
Grundkentnisse
Python
Grundkentnisse
SQL Azure
Grundkentnisse