Data Pipeline Engineer - AWS, Hadoop, APIs, GIS (SE1)
Our client is an early stage, well funded, B2B tech start-up building a predictive investment platform at the top end of the real estate industry. The scientific challenge in such a data-heavy business focused on predicting prices and understanding the factors driving these prices is immense. Backed by one of the largest VCs in Europe they have big dreams and are rapidly growing!
On the back of a new round of funding they’re now looking for a skilled Data Engineer to help build some of the product and automate the data pipeline. A seriously top flight Data Scientist experienced in productionising data at the largest scale.
You will be experienced in:
- automating processes and working with real datasets
- building and using APIs
Further, you will know your way around a Linux terminal and are likely to be familiar with:
- Amazon Web Services (EC2, Elastic Beanstalk)
- Hadoop / hive/ Apache Spark
- GIS (e.g. postgis)
- pandas, numpy and/or scikit-learn
- an open source contributor
You will be naturally skilled at taking on new ideas, an excellent team player who can independently take charge and crucially 'get things done’!
Excellent benefits - including high equity potential, flexible working, 5 weeks vacation.