1. Design, model, implement and maintain Hadoop solutions supporting raw-event collections
2. Help internal clients with pragmatic strategic & ad-hoc data requirements
3. Optimize our data infrastructure to become a reliable, tested real-time solution
4. Work closely with data scientists and engineers to build personalized experiences for our users
5. Drive research initiatives to innovate and support our data driven approach
6. Perform data extraction, aggregation from multiple sources, includng Google Analytics
7. Create confidence in our data by tests, automation, monitoring and neat visualizations
8. Several years of real-world experience with Apache Hadoop (with e.g. HDFS, Hive, HBase, Pig)
9. Experienced software engineer, proficient with several languages (we use Scala a lot)
10. Proficient with SQL (MySQL, MS-SQL are running) and NoSQL (e.g. MongoDB, Neo4j, Cassandra) databases and querying them
11. Sysadmin experience (Linux, AWS, Docker) is a plus
12. Prior experience with scaling ETL projects and a big data platform is advantageous
13. Able to understand complex topics, including business traits and user behavior
14. Uses modern tools and is generally efficient and test-driven
15. True agilist, keen on delivering working software
16. Non-religious about specific systems, frameworks, languages or tools
17. Can focus on deliverables and small increments but also not afraid of huge changes and humongous amounts of data
18. Proud to show your github.com