We are seeking a Principal Software Developer – Data Architect to drive the technical vision and architectural
strategy of Caseware’s enterprise data platform, including the AI-Ready Data Platform. This role will define
the enterprise data architecture, patterns, and modeling standards that deliver trusted, governed, high-quality
data products forming a foundational data platform for our cloud offerings, enabling AI capabilities and secure
interoperability with customer systems, while powering analytics and strengthening our core products.
This role requires deep experience designing modern data platforms and practical familiarity with how data
supports AI workflows, including retrieval, search, grounding, and secure interoperability patterns. You will
apply this experience to build a data foundation that supports AI workflows and agentic capabilities, analytics,
and customer interoperability.
This is a key leadership role where you will act as a hands-on architect while mentoring the development team,
guiding the long-term technical vision, shaping enterprise data architecture standards across teams, and
contributing to crucial AI and data platform projects.
❗ This is a full-time permanent position
❗ This is a new vacancy
📍 Location: This is a hybrid role requiring the successful candidate to work 3 days a week in our Toronto office located at 351 King St E Suite 1100 Toronto ON.
strategy for a scalable, AI-Ready, enterprise data platform, including Sherlock modernization,
lakehouse architecture, data products, interoperability, and the patterns and capabilities needed to
support AI-Ready use cases.
standards, guardrails, and best practices for our foundational data platform, including Icebergbased lakehouse architecture, medallion patterns, ingestion, normalization, data quality, and
interoperability.
documentation, and prototyping, and mentor teams in responsible usage that improves design
quality, data discovery, and delivery effectiveness.
Sherlock products, schema modernization, and data model evolution.
adherence to high standards in data engineering practices, data modeling, data quality, and
platform architecture.
AI-Ready, and securely interoperable data proucts, including data contracts, ingestion and
normalization standards, and improving consistency and reuse across products
classification, retention, tenant isolation, and access controls for datasets and data products.
make it easy for teams to produce data products and integrate with the data platform.
traceability, data dictionary controls, freshness monitoring, and alerting, so data products are
reliable and audit-ready.
technical leadership role, preferably as a Principal Developer or Data Architect.
lakehouse, medallion, and analytics patterns, ingestion from OLTP systems, ETL/ELT pipelines,
distributed processing with Spark, Trino, and delivering analytics and AI-Ready data lakes at scale, with
strong operational practices.
analysis, design exploration, documentation, prototyping, code assistance, and mentoring teams on
responsible, effective usage.
DocumentDB, MS SQL Server, DynamoDB, AWS ElastiCache for Redis, and Valkey; event streaming and
queueing using SNS/SQS. Postgres, pgvector, and Kafka or Pub/Sub are an asset.
Formation, OpenSearch Serverless, S3 Vector Storage, Iceberg, Lambda, Step Functions, EKS, ETL on
EMR, and EMR Serverless.
guiding teams in data models, storage and integration architectures, data contracts, data domain
taxonomy, schema and event versioning, and resolving performance and scale bottlenecks.
sourcing, and CDC/change tracking strategies, safe historical reprocessing patterns, and performance
optimization through query analysis, indexing, and partitioning.
controls for privacy, access, auditability, safe reuse, and operational guardrails for AI-Ready datasets
and data products.
including governed data access, tenant-aware controls, and safe integration patterns.
retrieval, RAG data workflows, and real-time/event-driven data flows that support AI integrations.
Knowledge Bases, vector retrieval, and RAG workflows is preferred.
standards, and influencing technical direction across multiple teams.
tooling to deliver scalable, resilient data platforms and pipelines.
leadership on technical strategy, trade-offs, and decisions.
leadership to define the AI-Ready Data Platform’s technical direction, ensuring alignment with
business growth, scalability, and interoperability objectives.
patterns and modeling standards backed by reference documentation and architecture decision
records that teams can apply consistently.
and cross-product data architecture improvements, strengthening the foundation for AI
capabilities, interoperability, scalability, and performance.
practices in data modeling, data quality, governance, and operational excellence.
OpenSearch Serverless, S3 Vector Storage, EMR/EMR Serverless, Spark, Trino, MapReduce,
Iceberg, Lambda, Step Functions, EKS, SNS/SQS; MongoDB, Amazon DocumentDB, MS SQL
Server, Redis/Valkey; Java (Spring), Python.
AWS Knowledge Bases, MCP, embeddings, vector retrieval, and RAG.
To apply for this job, please visit the application page

