We are looking for creative, intellectually curious and entrepreneurial Big Data Software Engineers to join our London-based team.
The team
Joina high-profile team to work on ground-breaking problems in health outcomes across disease areas including Ophthalmology, Oncology, Neurology, Chronic diseases such as diabetes, and a variety of very rare conditions. Work hand-in-hand with statisticians, epidemiologists and disease area experts across the wider global RWE Solutions team, leveraging a vast variety of anonymous patient-level information from sources such as electronic health records; The data encompasses IQVIA’s access to over 530 million anonymised patients as well as bespoke, custom partnerships with healthcare providers and payers.
The role
As part of a highly talented Engineering and Data Science team, write highly performant and scalable code that will run on top of our Big Data platform (Spark/Hive/Impala/Hadoop). Collaborate with Data Science & Machine Learning experts on the ETL process, including the cohort building efforts.
What to expect:
Working in a cross-functional team – alongside talented Engineers and Data Scientists
Building scalable and high-performant code
Mentoring less experienced colleagues within the team
Implementing ETL and Feature Extractions pipelines
Monitoring cluster (Spark/Hadoop) performance
Working in an Agile Environment
Refactoring and moving our current libraries and scripts to Scala/Java
Enforcing coding standards and best practices
Working in a geographically dispersed team
Working in an environment with a significant number of unknowns – both technically and functionally.
Our ideal candidate: Essential experience
BSc or MSc in Computer Science or related field
Strong analytical and problem solving skills with personal interest in subjects such as math/statistics, machine learning and AI.
Solid knowledge of data structures and algorithms
Proficient in Scala, Java and SQL
Strong experience with Apache Spark, Hive/Impala and HDFS
Comfortable in an Agile environment using Test Driven Development (TDD) and Continuous Integration (CI)
Experience refactoring code with scale and production in mind
Familiar with Python, Unix/Linux, Git, Jenkins, JUnit and ScalaTest
Experience with integration of data from multiple data sources
NoSQL databases, such as HBase, Cassandra, MongoDB
Experience with any of the following distributions of Hadoop – Cloudera/MapR/Hortonworks.
Bonus points for experience in:
Other functional Languages such as Haskell and Clojure
Big Data ML toolkits such as Mahout, SparkML and H2O
Apache Kafka, Apache Ignite and Druid
Container technologies such as Docker
Cloud Platforms technologies such as DCOS/Marathon/Apache Mesos, Kubernetes and Apache Brooklyn.
This is an exciting opportunity to be part of one of the world’s leading Real World Evidence-based teams, working to help our clients answer specific questions globally, make more informed decisions and deliver results.
Our team within the Real-World & Analytics Solutions (RWAS) Technology division is a fast growing group of collaborative, enthusiastic, and entrepreneurial individuals. In our never-ending quest for opportunities to harness the value of Real World Evidence (RWE), we are at the centre of IQVIA’s advances in areas such as machine learning and cutting-edge statistical approaches. Our efforts improve retrospective clinical studies, under-diagnosis of rare diseases, personalized treatment response profiles, disease progression predictions, and clinical decision-support tools.
We invite you to join IQVIA™.
IQVIA is a strong advocate of diversity and inclusion in the workplace. We believe that a work environment that embraces diversity will give us a competitive advantage in the global marketplace and enhance our success. We believe that an inclusive and respectful workplace culture fosters a sense of belonging among our employees, builds a stronger team, and allows individual employees the opportunity to maximize their personal potential.