Data Engineer

Closing date
13 Aug 2021

View more

Technology & New Media
Contract Type

Job Details

Clausematch is an award-winning global Compliance Technology company with a unique AI-powered SaaS product offering for regulated firms including Banks, Investment Management, Payments and Insurance. Our product is a unique platform allowing people in enterprises to collaborate on documents faster and more efficiently.

The solution is already live with top-tier banks and other financial institutions. It works as a browser-based collaborative document editor containing in its core a detailed workflow, where comments, approvals and changes are a part of a full audit trail, providing full control of content. Every change and approval made in a document is tracked in an organised manner providing full audit trail and unprecedented reporting capabilities. As we are still a small and agile team with a presence across Europe, USA and Asia, we thoroughly believe the opportunities are virtually boundless.

For the right candidate it is an exciting time to help our customers solve difficult challenges and reap the rewards. We are growing very quickly due to the increased demand from new clients.

Role and responsibilities

We are looking for a Data Engineer to support the Head of Data Science & ML, working closely with the Engineering and Product teams. The responsibilities will include, but will not be limited to
  • Wrap Python-based ML models into scalable monitored services and integrate them with the Java/Kotlin-based backend
  • Design and develop cloud infrastructure for ML pipelines for Data Warehouse
  • Automated monitoring, alerting and CI/CD for ML pipelines and orchestration
  • Develop and enhance data preparation pipelines with appropriate thresholds, tests and documentation

  • At least 2 years experience with Data Engineering and ML productization
  • 1+ years of AWS or Azure cloud experience
  • Solid Python and Java/Kotlin coding skills, willingness to write clean and production-ready code
  • Have good understanding of data structures and algorithms
  • Strong knowledge of Data Warehouse, REST API, SQL, data and model storage formats
  • Have solid experience of a Python environment management with docker, conda, pip
  • Have solid experience with ML pipelines orchestration frameworks like Airflow
  • Experience with Linux, numpy, pandas, gunicorn, pytest, kafka
  • Should be familiar with all popular Atlassian product stack tools (Jira, Confluence, BitBucket etc)
  • Able to prioritise tasks, design optimal solutions with the most practical tools
  • Self-driven to constantly learn and grow
  • You speak fluent English; other languages and global exposure are welcomed

  • Interested in NLP / ML / DL
  • Familiar with Data Science environment and workflow
  • Experience with automation and/or configuration management tools like Puppet, Chef
  • Familiar with TensorFlow, knowledge graphs, CPU to GPU cluster ml pipelines refactoring
  • Ability to work with a diverse group of internal and external stakeholders
  • Passionate about technology and how it can be used to drive business impact
  • Outstanding integrity, communication and interpersonal skills, as well as an ability to listen carefully
  • Live within commuting distance from London (although the role will be remote at first)

  • Competitive salary and share options package
  • Be part of fun, high energy, growing technology company in a flexible work environment
  • Annual corporate team retreats and numerous development opportunities
  • Located in the Canary Wharf WeWork (with flexibility to work remotely or another WeWork on occasion)
  • Private health insurance provided by Vitality
  • Coaching provided by More Happi
  • Good holiday allowance 26 days + 3 days in addition to Christmas holidays

Get job alerts

Create a job alert and receive personalised job recommendations straight to your inbox.

Create alert