Key Responsibilities
- Design, build, and maintain ETL pipelines for large-scale structured and unstructured data.
- Work with ClickHouse for fast, reliable querying, aggregation, and performance optimization.
- Develop and apply similarity algorithms (cosine similarity, nearest neighbours, vector embeddings) for search, recommendation, and matching use cases.
- Conduct data cleaning, preprocessing, and analysis to support business and product needs.
- Collaborate with cross-functional teams to drive data-informed decision-making.
- Contribute to building, testing, and deploying predictive models.