- Experience with TDD , Git
- Experienced in Scala and/or Python and Unix/Linux environment
- Proficient with Microsoft Azure and Hortonworks stack.
- Experience with the Spark framework
- As a Data Engineer, you should have knowledge of big data technologies such as Hadoop, HBase, Hive , ETL frameworks
- Ability to event stream pipelines (Storm, Kafka, Kinesis)
- The ideal Data engineer has a history of Machine Learning if referred
- Good understanding of the Data Science lifecycle
- Good knowledge of infrastructure automation software/tools such as Chef, Terraform and/or Docker
- Advantageous would be knowledge and experience in administrating SQL and NoSQL databases
- Advantageous would be knowledge in Scala and Python
- Fluent in English is a must
Ihre Aufgaben
- Test driven development with Scala and the Apache Spark framework
- Creating data pipelines to create the data preparation layer within the project
- Translating existing complex data preparation SQL queries to Scala
- Create content for new analytics modules whilst communicating with other teams and developers
- Be a part of an international team
Kontakt
CEGEKA Deutschland GmbH
Alessia Cosentino
Im Mediapark 4d
50670 Köln
Tel. +49 221 16020 17
Fax +49 221 16020-13
alessia.cosentino@cegeka.de