about 1 year ago
As a part of the team you will be responsible for managing data flow within the company and the construction of data lake structures for the company's research projects. This is an exciting role that requires an understanding of the company aims and objectives as well as enthusiasm for engaging with various sources of data processed and generated. Previous experience of working with genomic and biological data is an advantage. You must be able to work in line with standard operating procedures, work methodically, and understand the importance of privacy and security with respect to restricted (including clinical) data.
Use agile methodologies to track projects using JIRA
Manage data flow from multiple data providers to archival and analysis resources with the AWS environment.
Harmonise data processing and analyses of research projects within the company
Build data lakes to represent the various research projects
Have data management responsibilities including; ensuring data and metadata adhere to standards, tracking data processing,presentation and availability
Have project management responsibilities for ensuring work for deliverables is scheduled, drafting deliverables ( regular reports and data archival progress updates).
Qualifications and Experience required
·Bachelor's degree, or equivalent experience, in Computer Science, Bioinformatics, Mathematics or related discipline
·Expertise in AWS services such as Athena, Glue, Lambda, S3, DynamoDB, NoSQL, Relational Database Service (RDS), Amazon EMR and Amazon Redshift.
·Hands on experience leading large-scale global data warehousing and analytics projects
·Understanding of database and analytical technologies in the industry including MPP and NoSQL databases, Data Warehouse design and Dashboard development
·Experience of processing biological or experimental data or working within the field of bioinformatics